InteractiveOmni: A Unified Omni-modal Model for Audio-Visual Multi-turn Dialogue Paper • 2510.13747 • Published Oct 15, 2025 • 32
view article Article Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective Jan 27 • 71
Uni-ViGU: Towards Unified Video Generation and Understanding via A Diffusion-Based Video Generator Paper • 2604.08121 • Published 13 days ago • 42
Efficient RL Training for LLMs with Experience Replay Paper • 2604.08706 • Published 13 days ago • 17
Harvey Collection A legal reasoning model specialized in Salvadoran jurisprudence • 4 items • Updated 9 days ago • 1
Think in Strokes, Not Pixels: Process-Driven Image Generation via Interleaved Reasoning Paper • 2604.04746 • Published 14 days ago • 70
LongCat-Next: Lexicalizing Modalities as Discrete Tokens Paper • 2603.27538 • Published 23 days ago • 144
Aquiles-Studio Collection High-performance image and video generation models for Aquiles-Image. Faster inference, lower costs • 9 items • Updated Mar 2 • 1
view article Article We’re open-sourcing our text-to-image model and the process behind it Nov 12, 2025 • 97