Schoenfeld's Anatomy of Mathematical Reasoning by Language Models Paper • 2512.19995 • Published 15 days ago • 14 • 5
V-REX: Benchmarking Exploratory Visual Reasoning via Chain-of-Questions Paper • 2512.11995 • Published 26 days ago • 9 • 4
Routing Manifold Alignment Improves Generalization of Mixture-of-Experts LLMs Paper • 2511.07419 • Published Nov 10, 2025 • 26 • 2
ChartAB: A Benchmark for Chart Grounding & Dense Alignment Paper • 2510.26781 • Published Oct 30, 2025 • 1
Skip a Layer or Loop it? Test-Time Depth Adaptation of Pretrained LLMs Paper • 2507.07996 • Published Jul 10, 2025 • 34 • 14
Skip a Layer or Loop it? Test-Time Depth Adaptation of Pretrained LLMs Paper • 2507.07996 • Published Jul 10, 2025 • 34 • 14
Skip a Layer or Loop it? Test-Time Depth Adaptation of Pretrained LLMs Paper • 2507.07996 • Published Jul 10, 2025 • 34 • 14
Skip a Layer or Loop it? Test-Time Depth Adaptation of Pretrained LLMs Paper • 2507.07996 • Published Jul 10, 2025 • 34 • 14
Skip a Layer or Loop it? Test-Time Depth Adaptation of Pretrained LLMs Paper • 2507.07996 • Published Jul 10, 2025 • 34 • 14
Skip a Layer or Loop it? Test-Time Depth Adaptation of Pretrained LLMs Paper • 2507.07996 • Published Jul 10, 2025 • 34 • 14
Where to find Grokking in LLM Pretraining? Monitor Memorization-to-Generalization without Test Paper • 2506.21551 • Published Jun 26, 2025 • 28 • 2
FaSTA$^*$: Fast-Slow Toolpath Agent with Subroutine Mining for Efficient Multi-turn Image Editing Paper • 2506.20911 • Published Jun 26, 2025 • 41 • 2
FaSTA$^*$: Fast-Slow Toolpath Agent with Subroutine Mining for Efficient Multi-turn Image Editing Paper • 2506.20911 • Published Jun 26, 2025 • 41 • 2
Where to find Grokking in LLM Pretraining? Monitor Memorization-to-Generalization without Test Paper • 2506.21551 • Published Jun 26, 2025 • 28 • 2
Optimizing Length Compression in Large Reasoning Models Paper • 2506.14755 • Published Jun 17, 2025 • 10 • 2
WALL-E 2.0: World Alignment by NeuroSymbolic Learning improves World Model-based LLM Agents Paper • 2504.15785 • Published Apr 22, 2025 • 22 • 4
WALL-E 2.0: World Alignment by NeuroSymbolic Learning improves World Model-based LLM Agents Paper • 2504.15785 • Published Apr 22, 2025 • 22 • 4
Exploring Expert Failures Improves LLM Agent Tuning Paper • 2504.13145 • Published Apr 17, 2025 • 12 • 4
Exploring Expert Failures Improves LLM Agent Tuning Paper • 2504.13145 • Published Apr 17, 2025 • 12 • 4
ColorBench: Can VLMs See and Understand the Colorful World? A Comprehensive Benchmark for Color Perception, Reasoning, and Robustness Paper • 2504.10514 • Published Apr 10, 2025 • 48 • 4