GenLCA: 3D Diffusion for Full-Body Avatars from In-the-Wild Videos Paper • 2604.07273 • Published 5 days ago • 4
PixelPrune: Pixel-Level Adaptive Visual Token Reduction via Predictive Coding Paper • 2604.00886 • Published 12 days ago • 6
Speed by Simplicity: A Single-Stream Architecture for Fast Audio-Video Generative Foundation Model Paper • 2603.21986 • Published 21 days ago • 123
AVControl: Efficient Framework for Training Audio-Visual Controls Paper • 2603.24793 • Published 18 days ago • 26
Less Gaussians, Texture More: 4K Feed-Forward Textured Splatting Paper • 2603.25745 • Published 17 days ago • 15
4DGS360: 360° Gaussian Reconstruction of Dynamic Objects from a Single Video Paper • 2603.21618 • Published 21 days ago • 15