Update README.md
Browse files
README.md
CHANGED
@@ -6,12 +6,18 @@ tags:
|
|
6 |
- jvs
|
7 |
- pyopenjtalk
|
8 |
- speech-to-text
|
|
|
9 |
---
|
10 |
|
11 |
# SpeechT5 (TTS task) for Japanese
|
12 |
-
SpeechT5 model fine-tuned for speech synthesis (text-to-speech) on [JVS]("https://sites.google.com/site/shinnosuketakamichi/research-topics/jvs_corpus").
|
|
|
|
|
|
|
|
|
13 |
Trained from [microsoft/speecht5_tts](https://huggingface.co/microsoft/speecht5_tts).
|
14 |
-
Modified tokenizer powered by [Open Jtalk](https://open-jtalk.sp.nitech.ac.jp/)
|
|
|
15 |
|
16 |
# Model description
|
17 |
See [original model card](https://huggingface.co/microsoft/speecht5_tts#model-description)
|
|
|
6 |
- jvs
|
7 |
- pyopenjtalk
|
8 |
- speech-to-text
|
9 |
+
pipeline_tag: text-to-speech
|
10 |
---
|
11 |
|
12 |
# SpeechT5 (TTS task) for Japanese
|
13 |
+
SpeechT5 model fine-tuned for Japanese speech synthesis (text-to-speech) on [JVS]("https://sites.google.com/site/shinnosuketakamichi/research-topics/jvs_corpus").
|
14 |
+
This model utilizes the JVS dataset which encompasses 100 speakers.
|
15 |
+
From this dataset, speaker embeddings were crafted, segregating them based on male and female voice types, and producing a unique speaker embedding vector.
|
16 |
+
This 16-dimensional speaker embedding vector is designed with an aim to provide a voice quality that is independent of any specific speaker.
|
17 |
+
|
18 |
Trained from [microsoft/speecht5_tts](https://huggingface.co/microsoft/speecht5_tts).
|
19 |
+
Modified tokenizer powered by [Open Jtalk](https://open-jtalk.sp.nitech.ac.jp/).
|
20 |
+
|
21 |
|
22 |
# Model description
|
23 |
See [original model card](https://huggingface.co/microsoft/speecht5_tts#model-description)
|