File size: 1,228 Bytes
2b9adc1 0e8f65d 2b9adc1 9e97798 2b9adc1 34b348c |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 |
---
license: cc-by-nc-sa-4.0
datasets:
- mozilla-foundation/common_voice_17_0
- bond005/sberdevices_golos_10h_crowd
- bond005/sova_rudevices
- Aniemore/resd_annotated
language:
- ru
base_model:
- SWivid/F5-TTS
---
## Overview
The F5-TTS model is finetuned specifically for Russian language
## License
This model is released under the Creative Commons Attribution Non Commercial Share Alike 4.0 license, which allows for free usage, modification, and distribution
## Model Information
**Base Model**: SWivid/F5-TTS
**Total Training Duration:** 250.000 steps
**Training Configuration:**
```json
"exp_name": "F5TTS_Base",
"learning_rate": 1e-05,
"batch_size_per_gpu": 4500,
"batch_size_type": "frame",
"max_samples": 64,
"grad_accumulation_steps": 1,
"max_grad_norm": 1,
"epochs": 144,
"num_warmup_updates": 5838,
"save_per_updates": 11676,
"last_per_steps": 2918,
"finetune": true,
"file_checkpoint_train": "",
"tokenizer_type": "char",
"tokenizer_file": "",
"mixed_precision": "fp16",
"logger": "wandb",
"bnb_optimizer": true
```
## Usage Instructions
Go to [base repo](https://github.com/SWivid/F5-TTS)
## To do
- Correct stressmarks
- English support
# Other links
- [Github repo](https://github.com/HotDro4illa/F5-TTS) |