metadata
license: cc-by-4.0
datasets:
- SPRINGLab/IndicTTS-Hindi
- SPRINGLab/IndicVoices-R_Hindi
language:
- hi
pipeline_tag: text-to-speech
F5 Hindi 24KHz Model
This is a Hindi Text-to-Speech model trained from scratch using the F5 architecure.
Details
- Developed by: SPRING Lab, Indian Institute of Technology, Madras
- Language: Hindi
- License: CC-BY-4.0
Uses
The model was developed and is primarily intended for research purposes.
How to Get Started with the Model
Clone the following github repo and refer to the README: https://github.com/rumourscape/F5-TTS/tree/main
Training Details
The model was trained on 8x A100 40GB GPUs for close to a week. We would like to thank CDAC for providing the compute resources.
Training Data
We used the Hindi subsets of IndicTTS and IndicVoices-R datasets for training this model.
- IndicTTS-Hindi: https://huggingface.co/datasets/SPRINGLab/IndicTTS-Hindi
- IndicVoices-R_Hindi: https://huggingface.co/datasets/SPRINGLab/IndicVoices-R_Hindi