Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length Paper • 2512.04677 • Published 28 days ago • 167
SemiCD-VL: Visual-Language Model Guidance Makes Better Semi-supervised Change Detector Paper • 2405.04788 • Published May 8, 2024
Hallucination at a Glance: Controlled Visual Edits and Fine-Grained Multimodal Learning Paper • 2506.07227 • Published Jun 8, 2025
FreeSwim: Revisiting Sliding-Window Attention Mechanisms for Training-Free Ultra-High-Resolution Video Generation Paper • 2511.14712 • Published Nov 18, 2025 • 2
FreeSwim: Revisiting Sliding-Window Attention Mechanisms for Training-Free Ultra-High-Resolution Video Generation Paper • 2511.14712 • Published Nov 18, 2025 • 2