Post
787
Added
@amphion
MaskGCT &
@hexgrad
StyleTTS fine tuned model by the name of kokoro to the forked TTS Arena Space. If things keep up from what is seen in the preliminary results, then these two may end up in the TOP 5 of all TTS models. ๐ค๏ธ๐๏ธ
Pendrokar/TTS-Spaces-Arena
Svngoku/maskgct-audio-lab
hexgrad/Kokoro-TTS
I chose @Svngoku 's forked HF space over amphion's due to the overly high ZeroGPU duration demand on the latter. 300s!
amphion/maskgct
Had to remove @mrfakename 's MetaVoice-1B Space from the available models as that space has been down for quite some time. ๐ค๏ธ
mrfakename/MetaVoice-1B-v0.1
I'm close to syncing the code to the original Arena's code structure. Then I'd like to use ASR in order to validate and create synthetic public datasets from the generated samples. And then make the Arena multilingual, which will surely attract quite the crowd!
Pendrokar/TTS-Spaces-Arena
Svngoku/maskgct-audio-lab
hexgrad/Kokoro-TTS
I chose @Svngoku 's forked HF space over amphion's due to the overly high ZeroGPU duration demand on the latter. 300s!
amphion/maskgct
Had to remove @mrfakename 's MetaVoice-1B Space from the available models as that space has been down for quite some time. ๐ค๏ธ
mrfakename/MetaVoice-1B-v0.1
I'm close to syncing the code to the original Arena's code structure. Then I'd like to use ASR in order to validate and create synthetic public datasets from the generated samples. And then make the Arena multilingual, which will surely attract quite the crowd!