Update about.md
Browse files
about.md
CHANGED
@@ -224,11 +224,9 @@ This version is tailored for the Catalan language, as it was trained only on Cat
|
|
224 |
|
225 |
## Adaptation to Catalan
|
226 |
|
227 |
-
The original Matcha-TTS model excels in English, but to
|
228 |
-
|
229 |
-
|
230 |
-
|
231 |
-
* [Our version of the Festcat dataset.](https://huggingface.co/datasets/projecte-aina/festcat_trimmed_denoised)
|
232 |
|
233 |
Then we further fine-tuned the single accent Catalan Matxa-based model with the soon to be published LaFrescat dataset that has 8.5 hours of recordings for four dialectal variants:
|
234 |
|
|
|
224 |
|
225 |
## Adaptation to Catalan
|
226 |
|
227 |
+
The original Matcha-TTS model excels in English, but to adapt it to Catalan, we have carried out a multi-stage process.
|
228 |
+
First, we fine-tuned the English to Central Catalan model by creating a Matxa-base, using a 100h subset of the CommonVoice v.16 Catalan database.
|
229 |
+
The selection of this small set of samples has been performed automatically by using the UTMOS system, a predictor of values of the metric Mean Opinion Score (MOS) a score that is usually set by humans according to their subjective perception of the quality of the speech.
|
|
|
|
|
230 |
|
231 |
Then we further fine-tuned the single accent Catalan Matxa-based model with the soon to be published LaFrescat dataset that has 8.5 hours of recordings for four dialectal variants:
|
232 |
|