LTX-2: Efficient Joint Audio-Visual Foundation Model Paper • 2601.03233 • Published 1 day ago • 38
LTX-2 Collection LTX-2 base models and accompanying LoRAs and IC-LoRAs • 12 items • Updated 1 day ago • 8
HiStream: Efficient High-Resolution Video Generation via Redundancy-Eliminated Streaming Paper • 2512.21338 • Published 14 days ago • 21
SemanticGen: Video Generation in Semantic Space Paper • 2512.20619 • Published 15 days ago • 89
SiD-DiT Collection Collection of Distilled Flow Matching Models with Score Identity Distillation • 17 items • Updated Nov 29, 2025 • 1
WorldPlay: Towards Long-Term Geometric Consistency for Real-Time Interactive World Modeling Paper • 2512.14614 • Published 22 days ago • 67
Chatterbox Turbo Collection Ultra-Fast, Open-Source Text-to-Speech for Real-Time Voice AI • 3 items • Updated 23 days ago • 14
Distribution Matching Variational AutoEncoder Paper • 2512.07778 • Published about 1 month ago • 28
TwinFlow: Realizing One-step Generation on Large Models with Self-adversarial Flows Paper • 2512.05150 • Published Dec 3, 2025 • 74
Video Foundation Models Collection A list of all the (usable) video generation diffusion models. Models that are not upto current standards are skipped. • 10 items • Updated Dec 3, 2025 • 1
Ovis-Image Collection Ovis-Image is a 7B text-to-image model specifically optimized for high-quality text rendering under stringent computational constraints. • 7 items • Updated Dec 4, 2025 • 6
Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer Paper • 2511.22699 • Published Nov 27, 2025 • 224
DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models Paper • 2512.02556 • Published Dec 2, 2025 • 245
Glance: Accelerating Diffusion Models with 1 Sample Paper • 2512.02899 • Published Dec 2, 2025 • 29
Inferix: A Block-Diffusion based Next-Generation Inference Engine for World Simulation Paper • 2511.20714 • Published Nov 25, 2025 • 48