Text-to-Speech
F5-TTS
Hindi
F5-Hindi-24KHz / README.md
rumourscape's picture
Update README.md
418921c verified
|
raw
history blame
1.3 kB
metadata
license: cc-by-4.0
datasets:
  - SPRINGLab/IndicTTS-Hindi
  - SPRINGLab/IndicVoices-R_Hindi
language:
  - hi
pipeline_tag: text-to-speech

F5-TTS Hindi 24KHz Model

This is a Hindi Text-to-Speech model trained from scratch using the F5 architecure.

Details

  • Developed by: SPRING Lab, Indian Institute of Technology, Madras
  • Language: Hindi
  • License: CC-BY-4.0

Uses

The model was developed and is primarily intended for research purposes.

How to Get Started with the Model

Clone the following github repo and refer to the README: https://github.com/rumourscape/F5-TTS/tree/main

Training Details

The model was trained on 8x A100 40GB GPUs for close to a week. We would like to thank CDAC for providing the compute resources.

We used the "small" configuration(151M parameter) model for training according to the F5 paper.

Training Data

We used the Hindi subsets of IndicTTS and IndicVoices-R datasets for training this model.