Running on A100 235 Omnilingual ASR Media Transcription 🌍 235 Transcribe audio/video to text in many languages
microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • Updated Dec 10, 2025 • 322k • 1.58k
speechbrain/emotion-recognition-wav2vec2-IEMOCAP Audio Classification • Updated Jul 23, 2024 • 535k • 176