Audio Spaces
-
71📈
-
951
Seamless M4T
📞 -
5.07k
MusicGen
🎵Generate music from text descriptions and optional melodies
-
812
Audioldm Text To Audio Generation
🔊Generate audio from text descriptions
-
308
AudioLDM2 Text2Audio Text2Music Generation
🔊Generate audio and waveform video from text
-
222
AudioSep
🐠 -
170
Lp Music Caps
🎵Generate captions for music audio
-
311
Tortoise Tts
🐢ExpressivText-to-Speech
-
22
All In One
📊 -
2.77k
XTTS
🐸Generate speech from text using a reference voice
-
189
Coqui Bark Voice Cloning
🐸 -
367
VALL E X
🎙Generate audio from text using voice prompts
-
193
WavJourney
🔥 -
264
Music To Image
🎶 -
277
MMS
🌍Transform and identify speech with MMS
-
607
ElevenLabs TTS
🗣Generate voice from text using ElevenLabs
-
289
AudioGPT
🚀 -
2.38k
Bark
🐶Generate realistic audio from text
-
36
SpeechT5 Speech Recognition Demo
👩 -
174
CoquiTTS (Official)
🐸 -
2.57k
Whisper
📉Transcribe audio files or YouTube videos into text
-
659
Moe TTS
😊Generate and convert voice with text and audio inputs
-
17
YourTTS
🔥 -
557
Talking Face Generation with Multilingual TTS
👄Generate a talking face video from text in multiple languages
-
562
OpenAI TTS New
📊 -
167
Mustango
🐢 -
55
OWSM Demo
🔊 -
696
StyleTTS 2
🗣Efficient, fast, and natural text to speech with StyleTTS 2!
-
400
HierSpeech++ (Zero-shot TTS)
⚡Generate high-quality speech from text using a prompt audio
-
21
Video2music
📚Generate music for a video based on its content and key
-
187
Whisper Large V2
🤫 -
64
Musicgen Prompt Upsampling
🌖Generate music from text prompts 🎶
-
516
Seamless M4T v2
📞Translate speech and text between languages
-
318
Seamless Streaming
📞Translate text between languages
-
52
Matcha TTS
🍵Generate speech from text with speaker selection
-
275
MusicGen Streaming
🔥Generate music from text prompts
-
412
Resemble Enhance
🚀Enhance and denoise your audio files
-
260
Singing Voice Conversion
🎼Transform your voice into a singer's
-
52
NaturalSpeech2
🎧Generate speech with cloned timbre
-
21
Create Your Own TTS Dataset
🔥 -
Podcast Transcription
🐢 -
1.1k
OpenVoice
🤗Generate voice from text using a reference audio
-
94
M2UGen Demo
💻 -
68
Pheme
📊 -
7
ESPnet2 TTS
📈Convert text to speech in English, Chinese, or Japanese
-
37
Whisper-WebUI
🚀Generate subtitles and translate audio files
-
173
Image2SFX Comparison
👂Generates audio environment from an image
-
379
WhisperSpeech
🌬 -
144
MetaVoice 1B
🗣A demo of MetaVoice 1B, a new TTS model by MetaVoice.
-
890
TTS Arena V2
🏆Vote on the latest TTS models!
-
173
Whisper Speech X DreamTalk
😽Combine voice cloning and portrait lipsync animation
-
197
Canary 1b
🐤Transcribe and translate audio into text
-
81
SALMONN Audio Questioning
⚡Deeply interrogate audio file content
-
468
MeloTTS
🗣Fast, efficient, & multilingual text-to-speech
-
311
Audio Editing
🎧Edit audios with text prompts
-
18
ChatMusician
💻 -
73
xVASynth TTS
🧝CPU powered, low RTF, emotional, multilingual TTS
-
180
NaturalSpeech3 FACodec
🏃Convert and reconstruct speech files
-
25
Hey Gemma
☎ -
70
Ratchet + Whisper
🗣Convert audio to text
-
3
AutoSubs
📜Automatically add on-screen subs to your videos
-
161
VoiceCraft
📈 -
321
TangoFlux
🚀Text to Audio (Sound SFX) Generator
-
826
Parler-TTS
🥖High-fidelity Text-To-Speech
-
184
Sing an idea ➡️ Music
🔥Bring song ideas to life
-
75
Musicgen Songstarter Demo
👁Generate music using descriptions and optional melody audio
-
145
Whisper JAX
👀Transcribe or translate audio from microphone, file, or YouTube
-
22
AudioLCM
🏢Generate audio from text
-
160
Stable Audio Live Multiplayer
💻Generate audio from text prompts
-
446
Stable Audio Open Zero
🔥Generate audio from text prompts
-
13
Make An Audio 3
🐠Generate audio from text prompts
-
60
Mars5 Space
📉 -
5
Tango Music AF
🎵Text to Music Generator
-
16
Jam
🐠Generate a song from lyrics and style reference
-
107
BigVGAN
🔊Generate high-quality audio from input audio
-
89
SenseVoice
🐠Transcribe audio with emotions and events
-
29
PicoAudio
📈Generate audio from text descriptions with timestamps
-
7
Audio Flamingo Demo
📚 -
29
MusiConGen
🪩 -
20
Mms Zeroshot
🌍Transcribe audio in any language using text data
-
198
GPT SoVITS V2 Pro Plus
🤗Generate speech from text using reference audio
-
274
EzAudio
🟣Generate and edit audio from text prompts
-
214
OpenMusic
🎶Generate music from text descriptions
-
544
Midi Music Generator
🎼Generate MIDI music from prompts
-
986
Whisper Turbo
🤯Transcribe audio or YouTube videos into text
-
338
Realtime Whisper Turbo
🤯Realtime implementation of Whisper large turbo
-
163
Whisper Large V3 Turbo WebGPU
🚀ML-powered speech recognition directly in your browser
-
649
OpenAudio S1
🏆Generate speech from text
-
444
TTS Spaces Arena
🤗Blind vote on HF TTS models!
-
19
Diva Realtime Chat
🗣Generate text responses from audio input
-
2.64k
F5-TTS
🗣F5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)
-
260
MaskGCT TTS Demo
😻MaskGCT TTS Demo
-
128
MelodyFlow
🎵Generate music from text descriptions
-
146
Fish Agent
💬An end-to-end (e2e) Voice Language Model by Fish Audio.
-
64
Nexa Omni Demo
🎧Generate text from audio input
-
2.98k
Kokoro TTS
❤Upgraded to v1.0!
-
117
Make Custom Voices With KokoroTTS
⚡Make Custom Voices With KokoroTTS
-
310
Llasa 3b Tts
🔥Zero Shot voice cloning with llasa 3b (Unofficial Demo)
-
12
Llasa 1b Multilingual TTS
🌍Generate speech from text with or without cloning a voice
-
343
Kokoro Text-to-Speech (WebGPU)
🗣High-quality speech synthesis powered by Kokoro TTS
-
42
Hibiki Simple
👄High-Fidelity Simultaneous Speech-To-Speech Translation
-
407
Zonos
🌍Generate audio from text with customizable emotions and settings
-
74
Kokoro Web
🗣ML-powered speech synthesis directly in your browser
-
636
Di♪♪Rhythm
🎶Blazingly Fast and Embarrassingly Simple Song Generation
-
22
Audiobox Aesthetics
📚Demo for audiobox-aesthetics
-
229
Spark TTS
🌖A text-to-speech model powered by SparkAudio and Mobvoi.
-
844
Sesame CSM
🌱Conversational speech generation
-
238
Orpheus TTS
🚀Try Orpheus TTS here
-
42
Canary 1B Flash
🐤Canary 1B Flash demo
-
216
IndexTTS: An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System
🎙Generate speech from text using a reference audio
-
6
AudioMorphix
🌊Prepare environment and run Gradio app
-
92
MegaTTS3 Demo
👋 -
155
AudioX
👀Generate audio from text and video prompts
-
100
Vevo for Zero-shot VC, TTS, and More
🐠Controllable Zero-Shot Voice Imitation
-
1.69k
Dia 1.6B
👯Generate realistic dialogue from a script, using Dia!
-
43
Aero 1 Audio Demo
💬Demo for Aero-1-Audio
-
44
Voila Demo
💻Chat with a voice-clone AI
-
585
ACE Step
😻A Step Towards Music Generation Foundation Model
-
2
Audio Difficulty Estimator
🎹Estimate piano difficulty from audio
-
105
TIGER Audio Extractor
✂Extraction & Reconstruction for Efficient Speech Separation
-
14
Music2emo
📊Towards Unified Music Emotion Recognition across Dimensional
-
13
SonicVerse
🖼Generate detailed music descriptions from audio clips
-
39
Auffusion
😻Audio Gen, Audio Style Transfer and Audio InPainting
-
1.56k
Chatterbox TTS
🍿Expressive Zeroshot TTS
-
117
PlayDiffusion
🎨Generate modified audio from text and voice
-
2
Voice Clone Arena
🏆Vote on the latest Voice Clone TTS models!
-
218
Conversational WebGPU
🚀 -
450
Song Generation
🎵Generate a custom song from lyrics and optional prompts
-
54
NotaGen
📊Generate classical sheet music in ABC notation
-
70
Audio Flamingo 3 Demo
🚀Audio Flamingo 3 Demo
-
32
Audio Flamingo 3 Chat
🐠Audio Flamingo 3 demo for multi-turn multi-audio chat
-
6
MSR UTMOS
🐢Multiple sampling rate MOS prediction with SFI conv
-
382
Higgs Audio Demo
🎤Higgs Audio Demo
-
15
sidon_demo_beta
🐋Speech restoration demo of Sidon.
-
63
Canary 1b V2
🐤Transcribe and Translate in 25 European Languages
-
17
SonicMaster – Text-Guided Music Restoration & Mastering
🎧Enhance audio using text prompts
-
6
OLMoASR
🌍Open Models and Data for Training Robust Speech Recognition
-
85
VibeVoice-Large
🏃Generate a podcast audio from a script and voice samples
-
10
TaDiCodec TTS AR Qwen2.5 0.5B
📚Generate speech from text with voice cloning
-
8
EchoX
🔥An end-to-end speech large language model.
-
43
VoxCPM 0.5B
🐢Generate expressive speech from text with optional voice cloning
-
34
FireRedTTS2
🔥Long-form multi-speaker dialogue generation
-
3
FireRedASR
🚀FireRedASR Demo
-
438
IndexTTS 2 Demo
🏢Generate expressive speech from text with emotion control
-
5
SongFormer
🎵State-of-the-art music analysis with multi-scale datasets