|
--- |
|
license: cc-by-nc-sa-4.0 |
|
datasets: |
|
- mozilla-foundation/common_voice_17_0 |
|
- bond005/sberdevices_golos_10h_crowd |
|
- bond005/sova_rudevices |
|
- Aniemore/resd_annotated |
|
language: |
|
- ru |
|
base_model: |
|
- SWivid/F5-TTS |
|
--- |
|
## Overview |
|
The F5-TTS model is finetuned specifically for Russian language |
|
|
|
## License |
|
This model is released under the Creative Commons Attribution Non Commercial Share Alike 4.0 license, which allows for free usage, modification, and distribution |
|
|
|
## Model Information |
|
**Base Model**: SWivid/F5-TTS |
|
**Total Training Duration:** 250.000 steps |
|
|
|
**Training Configuration:** |
|
```json |
|
"exp_name": "F5TTS_Base", |
|
"learning_rate": 1e-05, |
|
"batch_size_per_gpu": 4500, |
|
"batch_size_type": "frame", |
|
"max_samples": 64, |
|
"grad_accumulation_steps": 1, |
|
"max_grad_norm": 1, |
|
"epochs": 144, |
|
"num_warmup_updates": 5838, |
|
"save_per_updates": 11676, |
|
"last_per_steps": 2918, |
|
"finetune": true, |
|
"file_checkpoint_train": "", |
|
"tokenizer_type": "char", |
|
"tokenizer_file": "", |
|
"mixed_precision": "fp16", |
|
"logger": "wandb", |
|
"bnb_optimizer": true |
|
``` |
|
|
|
## Usage Instructions |
|
Go to [base repo](https://github.com/SWivid/F5-TTS) |
|
|
|
## To do |
|
- Correct stressmarks |
|
- English support |
|
|
|
# Other links |
|
- [Github repo](https://github.com/HotDro4illa/F5-TTS) |