view article Article Building Blocks for Foundation Model Training and Inference on AWS amazon • 1 day ago • 11
LLMs Improving LLMs: Agentic Discovery for Test-Time Scaling Paper • 2605.08083 • Published 5 days ago • 57
Flow-OPD: On-Policy Distillation for Flow Matching Models Paper • 2605.08063 • Published 5 days ago • 83
view article Article CyberSecQwen-4B: Why Defensive Cyber Needs Small, Specialized, Locally-Runnable Models lablab-ai-amd-developer-hackathon • 4 days ago • 7
Listwise Policy Optimization: Group-based RLVR as Target-Projection on the LLM Response Simplex Paper • 2605.06139 • Published 6 days ago • 62
StraTA: Incentivizing Agentic Reinforcement Learning with Strategic Trajectory Abstraction Paper • 2605.06642 • Published 6 days ago • 22
Beyond Semantic Similarity: Rethinking Retrieval for Agentic Search via Direct Corpus Interaction Paper • 2605.05242 • Published 10 days ago • 93
Continuous-Time Distribution Matching for Few-Step Diffusion Distillation Paper • 2605.06376 • Published 6 days ago • 25
Skill1: Unified Evolution of Skill-Augmented Agents via Reinforcement Learning Paper • 2605.06130 • Published 6 days ago • 92
When to Trust Imagination: Adaptive Action Execution for World Action Models Paper • 2605.06222 • Published 6 days ago • 39
The Granularity Axis: A Micro-to-Macro Latent Direction for Social Roles in Language Models Paper • 2605.06196 • Published 6 days ago • 7
WTF GENIUS PAPERS Collection Papers that made me appreciate my major and my life a little more. obs=Observation, innov=Innovation. Most papers are abt improving tiny models. • 126 items • Updated 1 day ago • 19
SkillOS: Learning Skill Curation for Self-Evolving Agents Paper • 2605.06614 • Published 6 days ago • 38
MiA-Signature: Approximating Global Activation for Long-Context Understanding Paper • 2605.06416 • Published 6 days ago • 53
Stream-R1: Reliability-Perplexity Aware Reward Distillation for Streaming Video Generation Paper • 2605.03849 • Published 8 days ago • 122