SpaceDG: Benchmarking Spatial Intelligence under Visual Degradation Paper • 2605.22536 • Published 8 days ago • 28
Conditional Equivalence of DPO and RLHF: Implicit Assumption, Failure Modes, and Provable Alignment Paper • 2605.20834 • Published 9 days ago • 5
SQuTR: A Robustness Benchmark for Spoken Query to Text Retrieval under Acoustic Noise Paper • 2602.12783 • Published Feb 13 • 246
APEX: Large-scale Multi-task Aesthetic-Informed Popularity Prediction for AI-Generated Music Paper • 2605.03395 • Published 24 days ago • 5
KWBench: Measuring Unprompted Problem Recognition in Knowledge Work Paper • 2604.15760 • Published Apr 17 • 2
LLaDA2.0-Uni: Unifying Multimodal Understanding and Generation with Diffusion Large Language Model Paper • 2604.20796 • Published Apr 22 • 242
Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability Paper • 2604.06628 • Published Apr 8 • 326
Can Natural Image Autoencoders Compactly Tokenize fMRI Volumes for Long-Range Dynamics Modeling? Paper • 2604.03619 • Published Apr 4 • 9
GaussianGPT: Towards Autoregressive 3D Gaussian Scene Generation Paper • 2603.26661 • Published Mar 27 • 25
DataFlex: A Unified Framework for Data-Centric Dynamic Training of Large Language Models Paper • 2603.26164 • Published Mar 27 • 365
CARLA-Air: Fly Drones Inside a CARLA World -- A Unified Infrastructure for Air-Ground Embodied Intelligence Paper • 2603.28032 • Published Mar 30 • 342