Update README.md
Browse files
README.md
CHANGED
@@ -15,7 +15,7 @@ datasets:
|
|
15 |
- projecte-aina/openslr-slr69-ca-trimmed-denoised
|
16 |
---
|
17 |
|
18 |
-
# Matcha
|
19 |
|
20 |
## Table of Contents
|
21 |
<details>
|
@@ -33,6 +33,11 @@ datasets:
|
|
33 |
|
34 |
## Model description
|
35 |
|
|
|
|
|
|
|
|
|
|
|
36 |
## Intended uses and limitations
|
37 |
|
38 |
## How to use
|
|
|
15 |
- projecte-aina/openslr-slr69-ca-trimmed-denoised
|
16 |
---
|
17 |
|
18 |
+
# Matcha-TTS Catalan Multispeaker
|
19 |
|
20 |
## Table of Contents
|
21 |
<details>
|
|
|
33 |
|
34 |
## Model description
|
35 |
|
36 |
+
Matcha-TTS is an encoder-decoder architecture designed for fast acoustic modelling in TTS. The encoder side is inspired by previous works (Grad-TTS and Glow-TTS)
|
37 |
+
modelling alignment with Monotonic Alignment Search (MOS). The decoder is essentially a U-Net inspired by Grad-TTS based on Transformers architecture combined with 1D CNNs,
|
38 |
+
making a high reduction on memory consumption while increasing synthesis speed. Matcha-TTS is probabilistic, non-autorregressive and is trained using optimal-transport
|
39 |
+
conditional flow matching (OT-CFM). This yields an ODE-based decoder capable of high output quality in fewer synthesis steps than models trained using score matching.
|
40 |
+
|
41 |
## Intended uses and limitations
|
42 |
|
43 |
## How to use
|