Agent0-VL: Exploring Self-Evolving Agent for Tool-Integrated Vision-Language Reasoning Paper • 2511.19900 • Published Nov 25, 2025 • 48
When Visualizing is the First Step to Reasoning: MIRA, a Benchmark for Visual Chain-of-Thought Paper • 2511.02779 • Published Nov 4, 2025 • 58
In-the-Flow Agentic System Optimization for Effective Planning and Tool Use Paper • 2510.05592 • Published Oct 7, 2025 • 106
UQ: Assessing Language Models on Unsolved Questions Paper • 2508.17580 • Published Aug 25, 2025 • 15 • 4
UAlign: Pushing the Limit of Template-free Retrosynthesis Prediction with Unsupervised SMILES Alignment Paper • 2404.00044 • Published Mar 25, 2024
EvoLM: In Search of Lost Language Model Training Dynamics Paper • 2506.16029 • Published Jun 19, 2025