LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM Paper • 2503.04724 • Published 8 days ago • 60
HKUSTAudio/Llasa-1B-multi-speakers-genshin-zh-en-ja-ko Text-to-Speech • Updated 29 days ago • 145 • 2
Running on Zero 6 6 Llasa 1B Finetuned For Two Speakers 🔥 Using dataset shb777/gemini-flash-2.0-speech for finetuning
Running on Zero 10 10 Llasa 1B Multi Speakers Genshin Zh En Ja Ko 🚀 Llasa-1B-Multilingual finetuned using simon3000/genshin-voic
Llasa Collection TTS foundation model compatible with Llama framework (160k hours tokenized speech data released) • 11 items • Updated 22 days ago • 15