Vc Chat
📚
Transcribe and Synthesize Audio in Realtime
Yes! Thanks for letting me know
around 10gb, and around 300 chars is the sweet spot. you can chunk text and do it though
I had a look at both, it seems doable. Ill try follow the repeng example. But its a bit confusing how they generate the dataset