AlexK-PL commited on
Commit
ab5a545
1 Parent(s): 77a1fb2

Update about.md

Browse files
Files changed (1) hide show
  1. about.md +3 -5
about.md CHANGED
@@ -224,11 +224,9 @@ This version is tailored for the Catalan language, as it was trained only on Cat
224
 
225
  ## Adaptation to Catalan
226
 
227
- The original Matcha-TTS model excels in English, but to bring its capabilities to Catalan, a multi-step process was undertaken. Firstly, we fine-tuned the model from English to Catalan central (Matxa-base), which laid the groundwork for understanding the language's nuances. This first fine-tuning from English was done using two datasets:
228
-
229
- * [Our version of the openslr-slr69 dataset.](https://huggingface.co/datasets/projecte-aina/openslr-slr69-ca-trimmed-denoised)
230
-
231
- * [Our version of the Festcat dataset.](https://huggingface.co/datasets/projecte-aina/festcat_trimmed_denoised)
232
 
233
  Then we further fine-tuned the single accent Catalan Matxa-based model with the soon to be published LaFrescat dataset that has 8.5 hours of recordings for four dialectal variants:
234
 
 
224
 
225
  ## Adaptation to Catalan
226
 
227
+ The original Matcha-TTS model excels in English, but to adapt it to Catalan, we have carried out a multi-stage process.
228
+ First, we fine-tuned the English to Central Catalan model by creating a Matxa-base, using a 100h subset of the CommonVoice v.16 Catalan database.
229
+ The selection of this small set of samples has been performed automatically by using the UTMOS system, a predictor of values of the metric Mean Opinion Score (MOS) a score that is usually set by humans according to their subjective perception of the quality of the speech.
 
 
230
 
231
  Then we further fine-tuned the single accent Catalan Matxa-based model with the soon to be published LaFrescat dataset that has 8.5 hours of recordings for four dialectal variants:
232