Refinement via Regeneration: Enlarging Modification Space Boosts Image Refinement in Unified Multimodal Models Paper • 2604.25636 • Published 5 days ago • 23
HY-Embodied-0.5: Embodied Foundation Models for Real-World Agents Paper • 2604.07430 • Published 25 days ago • 187
HopChain: Multi-Hop Data Synthesis for Generalizable Vision-Language Reasoning Paper • 2603.17024 • Published Mar 17 • 109
SpatialActor Collection Models and datasets of SpatialActor (https://github.com/shihao1895/SpatialActor) • 4 items • Updated Jan 9 • 1
SpatialActor Collection Models and datasets of SpatialActor (https://github.com/shihao1895/SpatialActor) • 4 items • Updated Jan 9 • 1