Scaling the Horizon, Not the Parameters: Reaching Trillion-Parameter Performance with a 35B Agent Paper • 2606.30616 • Published 3 days ago • 73 • 3
TUA-Bench: A Benchmark for General-Purpose Terminal-Use Agents Paper • 2606.28480 • Published 6 days ago • 44 • 2
The Surprising Effectiveness of Video Diffusion Models for Hand Motion Reconstruction Paper • 2606.30308 • Published 3 days ago • 2 • 2
Walking in the Implicit: Interactive World Exploration via Neural Scene Representation Paper • 2606.30045 • Published 3 days ago • 4 • 2
Illuminating Unified Multimodal Model for Free-form Interleaved Text-Image Generation Paper • 2606.30054 • Published 3 days ago • 1 • 2
Agentic Abstention: Do Agents Know When to Stop Instead of Act? Paper • 2606.28733 • Published 5 days ago • 129 • 9
PolicyGuard: A Dialogue-Grounded Sub-Agent Verifier for Policy Adherence in LLM Agents Paper • 2606.29225 • Published 4 days ago • 5 • 2
GUICrafter: Weakly-Supervised GUI Agent Leveraging Massive Unannotated Screenshots Paper • 2606.29705 • Published 3 days ago • 11 • 2
MIMFlow: Integrating Masked Image Modeling with Normalizing Flows for End-to-End Image Generation Paper • 2606.26016 • Published 8 days ago • 6 • 2
Interleaved Speech Language Models Latently Work In Text Paper • 2606.22473 • Published 11 days ago • 11 • 2
Bridging VideoQA and Video-Guided Agentic Tasks via Generalized Keyframe Extraction Paper • 2606.29445 • Published 4 days ago • 22 • 2
Geometric Stability of Neural Population Codes: Regional Variation, Behavioral Relevance, and Circuit Dependence Paper • 2606.29655 • Published 4 days ago • 1 • 2
One Scene, Two Depths: Probing Geometric Ambiguity in Monocular Foundation Models Paper • 2606.29600 • Published 4 days ago • 2 • 2
A Gravitational Interpretation of Fine-Tuning Reversion Paper • 2606.28525 • Published 6 days ago • 1 • 2
Beyond IID: How General Are Tabular Foundation Models, Really? Paper • 2606.30410 • Published 3 days ago • 37 • 3
SWE-Together: Evaluating Coding Agents in Interactive User Sessions Paper • 2606.29957 • Published 3 days ago • 12 • 2
DreamForge-World 0.1 Preview: A Low-Compute Real-Time Controllable World Model Paper • 2606.30292 • Published 3 days ago • 10 • 2
Focusing on What Matters: Saliency-Harnessing Accurate Routing for Diffusion MoE Paper • 2606.26938 • Published 7 days ago • 3 • 2
Monte Carlo Energy Aggregation for Mobile 3D Gaussian Splatting Paper • 2606.30017 • Published 3 days ago • 16 • 2