F5-TTS-Russian / README.md
hotstone228's picture
Update README.md
34b348c verified
metadata
license: cc-by-nc-sa-4.0
datasets:
  - mozilla-foundation/common_voice_17_0
  - bond005/sberdevices_golos_10h_crowd
  - bond005/sova_rudevices
  - Aniemore/resd_annotated
language:
  - ru
base_model:
  - SWivid/F5-TTS

Overview

The F5-TTS model is finetuned specifically for Russian language

License

This model is released under the Creative Commons Attribution Non Commercial Share Alike 4.0 license, which allows for free usage, modification, and distribution

Model Information

Base Model: SWivid/F5-TTS
Total Training Duration: 250.000 steps

Training Configuration:

"exp_name": "F5TTS_Base",
"learning_rate": 1e-05,
"batch_size_per_gpu": 4500,
"batch_size_type": "frame",
"max_samples": 64,
"grad_accumulation_steps": 1,
"max_grad_norm": 1,
"epochs": 144,
"num_warmup_updates": 5838,
"save_per_updates": 11676,
"last_per_steps": 2918,
"finetune": true,
"file_checkpoint_train": "",
"tokenizer_type": "char",
"tokenizer_file": "",
"mixed_precision": "fp16",
"logger": "wandb",
"bnb_optimizer": true

Usage Instructions

Go to base repo

To do

  • Correct stressmarks
  • English support

Other links