projecte-aina
/

matxa-tts-cat-multiaccent

acoustic modelling

Model card Files Files and versions Community

Baybars commited on Apr 18

Commit

77e448e

•

1 Parent(s): 04e5354

emoji added

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -12,7 +12,7 @@ pipeline_tag: text-to-speech
 license: cc-by-nc-4.0
 ---
-# Matxa-TTS (Matcha-TTS) Catalan Multiaccent
 ## Table of Contents
 <details>
@@ -30,7 +30,7 @@ license: cc-by-nc-4.0
 ## Model Description
-**Matxa-TTS** is based on **Matcha-TTS** that is an encoder-decoder architecture designed for fast acoustic modelling in TTS.
 The encoder part is based on a text encoder and a phoneme duration prediction that together predict averaged acoustic features.
 And the decoder has essentially a U-Net backbone inspired by [Grad-TTS](https://arxiv.org/pdf/2105.06337.pdf), which is based on the Transformer architecture.
 In the latter, by replacing 2D CNNs by 1D CNNs, a large reduction in memory consumption and fast synthesis is achieved.

 license: cc-by-nc-4.0
 ---
+# 🍵 Matxa-TTS (Matcha-TTS) Catalan Multiaccent
 ## Table of Contents
 <details>
 ## Model Description
+🍵 **Matxa-TTS** is based on **Matcha-TTS** that is an encoder-decoder architecture designed for fast acoustic modelling in TTS.
 The encoder part is based on a text encoder and a phoneme duration prediction that together predict averaged acoustic features.
 And the decoder has essentially a U-Net backbone inspired by [Grad-TTS](https://arxiv.org/pdf/2105.06337.pdf), which is based on the Transformer architecture.
 In the latter, by replacing 2D CNNs by 1D CNNs, a large reduction in memory consumption and fast synthesis is achieved.