Mind's Eye: A Benchmark of Visual Abstraction, Transformation and Composition for Multimodal LLMs Paper • 2604.16054 • Published 6 days ago
Chain-of-Thought Degrades Visual Spatial Reasoning Capabilities of Multimodal LLMs Paper • 2604.16060 • Published 6 days ago
Mind's Eye: A Benchmark of Visual Abstraction, Transformation and Composition for Multimodal LLMs Paper • 2604.16054 • Published 6 days ago
Faithful GRPO: Improving Visual Spatial Reasoning in Multimodal Language Models via Constrained Policy Optimization Paper • 2604.08476 • Published 13 days ago • 8
Faithful GRPO: Improving Visual Spatial Reasoning in Multimodal Language Models via Constrained Policy Optimization Paper • 2604.08476 • Published 13 days ago • 8
Faithful GRPO: Improving Visual Spatial Reasoning in Multimodal Language Models via Constrained Policy Optimization Paper • 2604.08476 • Published 13 days ago • 8
Do You See Me : A Multidimensional Benchmark for Evaluating Visual Perception in Multimodal LLMs Paper • 2506.02022 • Published May 28, 2025
MIR: Methodology Inspiration Retrieval for Scientific Research Problems Paper • 2506.00249 • Published May 30, 2025 • 2