PaCo-RL: Advancing Reinforcement Learning for Consistent Image Generation with Pairwise Reward Modeling Paper • 2512.04784 • Published 8 days ago • 23
Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length Paper • 2512.04677 • Published 6 days ago • 163
4DLangVGGT: 4D Language-Visual Geometry Grounded Transformer Paper • 2512.05060 • Published 6 days ago • 18
What about gravity in video generation? Post-Training Newton's Laws with Verifiable Rewards Paper • 2512.00425 • Published 11 days ago • 47
TUNA: Taming Unified Visual Representations for Native Unified Multimodal Models Paper • 2512.02014 • Published 9 days ago • 60
OneThinker: All-in-one Reasoning Model for Image and Video Paper • 2512.03043 • Published 8 days ago • 30
Self-Calibration Collection Efficient Test-Time Scaling via Self-Calibration https://arxiv.org/abs/2503.00031 • 7 items • Updated Jun 8 • 3
PosS-Speculative-Decoding Collection This collection contains models of the paper "PosS:Position Specialist Generates Better Draft for Speculative Decoding" • 9 items • Updated Jun 5 • 2
Guided Self-Evolving LLMs with Minimal Human Supervision Paper • 2512.02472 • Published 8 days ago • 48
Semantics Lead the Way: Harmonizing Semantic and Texture Modeling with Asynchronous Latent Diffusion Paper • 2512.04926 • Published 6 days ago • 40
PixelDiT: Pixel Diffusion Transformers for Image Generation Paper • 2511.20645 • Published 15 days ago • 26
view article Article Introducing AnyLanguageModel: One API for Local and Remote LLMs on Apple Platforms 21 days ago • 33