Running on A100 226 Omnilingual ASR Media Transcription 🌍 226 Transcribe audio or video into text in any language
facebook/dinov3-vitb16-pretrain-lvd1689m Image Feature Extraction • 85.7M • Updated Aug 19, 2025 • 212k • 91
microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated about 1 month ago • 183k • 1.56k