huseinzol05
commited on
Commit
·
7e0014c
1
Parent(s):
f1b59a6
Update README.md
Browse files
README.md
CHANGED
@@ -9,9 +9,9 @@ language:
|
|
9 |
Distil Whisper Large V3 on Malaysian dataset,
|
10 |
1. IMDA STT, https://huggingface.co/datasets/mesolitica/IMDA-STT
|
11 |
2. Pseudolabel Malaysian youtube videos, https://huggingface.co/datasets/mesolitica/pseudolabel-malaysian-youtube-whisper-large-v3
|
|
|
|
|
12 |
|
13 |
-
We follow exact distillation process from https://github.com/huggingface/distil-whisper with minor changes.
|
14 |
-
|
15 |
-
Script at https://github.com/mesolitica/malaya-speech/tree/malaysian-speech/session/distill-whisper
|
16 |
|
17 |
Wandb at https://wandb.ai/huseinzol05/distil-whisper?workspace=user-huseinzol05
|
|
|
9 |
Distil Whisper Large V3 on Malaysian dataset,
|
10 |
1. IMDA STT, https://huggingface.co/datasets/mesolitica/IMDA-STT
|
11 |
2. Pseudolabel Malaysian youtube videos, https://huggingface.co/datasets/mesolitica/pseudolabel-malaysian-youtube-whisper-large-v3
|
12 |
+
3. Malay Conversational Speech Corpus, https://huggingface.co/datasets/malaysia-ai/malay-conversational-speech-corpus
|
13 |
+
4. Haqkiem TTS Dataset, this is private, but you request access from https://www.linkedin.com/in/haqkiem-daim/
|
14 |
|
15 |
+
We follow exact distillation process from https://github.com/huggingface/distil-whisper with minor changes, script at https://github.com/mesolitica/malaya-speech/tree/malaysian-speech/session/distill-whisper
|
|
|
|
|
16 |
|
17 |
Wandb at https://wandb.ai/huseinzol05/distil-whisper?workspace=user-huseinzol05
|