marianbasti commited on
Commit
0caff00
·
verified ·
1 Parent(s): abcad42

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +68 -5
README.md CHANGED
@@ -1,5 +1,68 @@
1
- ---
2
- license: other
3
- license_name: coqui-public-model-license
4
- license_link: LICENSE
5
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: other
3
+ license_name: coqui-public-model-license
4
+ license_link: https://coqui.ai/cpml
5
+ library_name: coqui
6
+ pipeline_tag: text-to-speech
7
+ datasets:
8
+ - ylacombe/google-argentinian-spanish
9
+ language:
10
+ - es
11
+ ---
12
+ # ⓍTTS 🇦🇷
13
+ ⓍTTS is a Voice generation model that lets you clone voices into different languages by using just a quick 6-second audio clip. There is no need for an excessive amount of training data that spans countless hours.
14
+
15
+ This is the same or similar model to what powers [Coqui Studio](https://coqui.ai/) and [Coqui API](https://docs.coqui.ai/docs).
16
+
17
+ ### Language
18
+ This model's Spanish language has been finetuned using [ylacombe's google argentinian spanish dataset](https://huggingface.co/datasets/ylacombe/google-argentinian-spanish) to archieve an argentinian accent.
19
+
20
+ ### Training Parameters
21
+ ```
22
+ batch_size=8,
23
+ grad_accum_steps=96,
24
+ batch_group_size=48,
25
+ eval_batch_size=8,
26
+ num_loader_workers=8,
27
+ eval_split_max_size=256,
28
+ optimizer="AdamW",
29
+ optimizer_wd_only_on_weights=True,
30
+ optimizer_params={"betas": [0.9, 0.96], "eps": 1e-8, "weight_decay": 1e-2},
31
+ lr=5e-06,
32
+ lr_scheduler="MultiStepLR",
33
+ lr_scheduler_params={"milestones": [50000 * 18, 150000 * 18, 300000 * 18], "gamma": 0.5, "last_epoch": -1},
34
+ ```
35
+
36
+ ### License
37
+ This model is licensed under [Coqui Public Model License](https://coqui.ai/cpml). There's a lot that goes into a license for generative models, and you can read more of [the origin story of CPML here](https://coqui.ai/blog/tts/cpml).
38
+
39
+ Using 🐸TTS Command line:
40
+
41
+ ```console
42
+ tts --model_name /path/to/xtts/ \
43
+ --text "Che boludo, vamos a tomar unos mates." \
44
+ --speaker_wav /path/to/target/speaker.wav \
45
+ --language_idx es \
46
+ --use_cuda true
47
+ ```
48
+
49
+ Using the model directly:
50
+
51
+ ```python
52
+ from TTS.tts.configs.xtts_config import XttsConfig
53
+ from TTS.tts.models.xtts import Xtts
54
+
55
+ config = XttsConfig()
56
+ config.load_json("/path/to/xtts/config.json")
57
+ model = Xtts.init_from_config(config)
58
+ model.load_checkpoint(config, checkpoint_dir="/path/to/xtts/", eval=True)
59
+ model.cuda()
60
+
61
+ outputs = model.synthesize(
62
+ "Che boludo, vamos a tomar unos mates.",
63
+ config,
64
+ speaker_wav="/data/TTS-public/_refclips/3.wav",
65
+ gpt_cond_len=3,
66
+ language="es",
67
+ )
68
+ ```