Running on Zero Featured 2.82k F5-TTS 🗣 2.82k F5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)
MiniMax-Speech: Intrinsic Zero-Shot Text-to-Speech with a Learnable Speaker Encoder Paper • 2505.07916 • Published May 12, 2025 • 134
VITA-Audio: Fast Interleaved Cross-Modal Token Generation for Efficient Large Speech-Language Model Paper • 2505.03739 • Published May 6, 2025 • 9