SkyReels-V4: Multi-modal Video-Audio Generation, Inpainting and Editing model Paper โข 2602.21818 โข Published 16 days ago โข 53
mPLUG-DocOwl 1.5: Unified Structure Learning for OCR-free Document Understanding Paper โข 2403.12895 โข Published Mar 19, 2024 โข 32