On the Scaling of PEFT: Towards Million Personal Models of Trillion Parameters Paper • 2606.02437 • Published Jun 1 • 237
3DCodeBench: Benchmarking Agentic Procedural 3D Modeling Via Code Paper • 2606.01057 • Published May 31 • 8
SkillAdaptor: Self-Adapting Skills for LLM Agents from Trajectories Paper • 2606.01311 • Published May 31 • 37
SOD: Step-wise On-policy Distillation for Small Language Model Agents Paper • 2605.07725 • Published May 8 • 25
DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards Paper • 2605.21467 • Published May 20 • 207
Zero-Shot Sim-to-Real Robot Learning: A Dexterous Manipulation Study on Reactive Catching Paper • 2605.09789 • Published May 10 • 6
Learning to Communicate Locally for Large-Scale Multi-Agent Pathfinding Paper • 2605.07637 • Published May 12 • 20