emirhanbilgic/read-my-pdf-outloud

Aug 31

The voice seems to change a bit for every sentence. Is there any way to ensure consistency across each sentence?

Owner Sep 1

Hey John, not completely possible with this model. Giving a name to the description helps but not perfect. You need to train a model with data from only one speaker. I will upload a tutorial soon on how to do it. Will keep you posted.

johnblues

Sep 1

Ok, thank you. I will look forward to that.

emirhanbilgic

Owner Sep 4

Hey John, I've just created a tutorial on how to fine-tune a text-to-speech model in any language. If your dataset consists of only one person's voice, you'll get great voice consistency in the output.

The tutorial video:
https://www.youtube.com/watch?v=TZIBQ24UCgA

Model weights:
https://huggingface.co/emirhanbilgic/speecht5_finetuned_emirhan_tr

Training codes:
https://github.com/emirhanbilgic/Turkish-TTS

Dataset I use:
https://huggingface.co/datasets/erenfazlioglu/turkishvoicedataset

Hugging face demo to try:
https://huggingface.co/spaces/emirhanbilgic/Text-to-speech-Turkish

Spaces:

emirhanbilgic
/

read-my-pdf-outloud

Running

Voice consistency