File size: 1,228 Bytes
2b9adc1
0e8f65d
2b9adc1
 
9e97798
 
 
2b9adc1
 
 
 
34b348c
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
---
license: cc-by-nc-sa-4.0
datasets:
- mozilla-foundation/common_voice_17_0
- bond005/sberdevices_golos_10h_crowd
- bond005/sova_rudevices
- Aniemore/resd_annotated
language:
- ru
base_model:
- SWivid/F5-TTS
---
## Overview
The F5-TTS model is finetuned specifically for Russian language

## License
This model is released under the Creative Commons Attribution Non Commercial Share Alike 4.0 license, which allows for free usage, modification, and distribution

## Model Information
**Base Model**: SWivid/F5-TTS  
**Total Training Duration:** 250.000 steps

**Training Configuration:**
```json
"exp_name": "F5TTS_Base",
"learning_rate": 1e-05,
"batch_size_per_gpu": 4500,
"batch_size_type": "frame",
"max_samples": 64,
"grad_accumulation_steps": 1,
"max_grad_norm": 1,
"epochs": 144,
"num_warmup_updates": 5838,
"save_per_updates": 11676,
"last_per_steps": 2918,
"finetune": true,
"file_checkpoint_train": "",
"tokenizer_type": "char",
"tokenizer_file": "",
"mixed_precision": "fp16",
"logger": "wandb",
"bnb_optimizer": true
```

## Usage Instructions
Go to [base repo](https://github.com/SWivid/F5-TTS)

## To do
- Correct stressmarks
- English support

# Other links
- [Github repo](https://github.com/HotDro4illa/F5-TTS)