MMAR: A Challenging Benchmark for Deep Reasoning in Speech, Audio, Music, and Their Mix Paper • 2505.13032 • Published May 19, 2025 • 3
MNV-17: A High-Quality Performative Mandarin Dataset for Nonverbal Vocalization Recognition in Speech Paper • 2509.18196 • Published Sep 19, 2025 • 2
MOSS Transcribe Diarize: Accurate Transcription with Speaker Diarization Paper • 2601.01554 • Published 3 days ago • 34
Running Featured 17 MOSS Transcribe Diarize 🏢 17 Transcribe audio/video files with speaker identification