huseinzol05 commited on
Commit
7e0014c
·
1 Parent(s): f1b59a6

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -3
README.md CHANGED
@@ -9,9 +9,9 @@ language:
9
  Distil Whisper Large V3 on Malaysian dataset,
10
  1. IMDA STT, https://huggingface.co/datasets/mesolitica/IMDA-STT
11
  2. Pseudolabel Malaysian youtube videos, https://huggingface.co/datasets/mesolitica/pseudolabel-malaysian-youtube-whisper-large-v3
 
 
12
 
13
- We follow exact distillation process from https://github.com/huggingface/distil-whisper with minor changes.
14
-
15
- Script at https://github.com/mesolitica/malaya-speech/tree/malaysian-speech/session/distill-whisper
16
 
17
  Wandb at https://wandb.ai/huseinzol05/distil-whisper?workspace=user-huseinzol05
 
9
  Distil Whisper Large V3 on Malaysian dataset,
10
  1. IMDA STT, https://huggingface.co/datasets/mesolitica/IMDA-STT
11
  2. Pseudolabel Malaysian youtube videos, https://huggingface.co/datasets/mesolitica/pseudolabel-malaysian-youtube-whisper-large-v3
12
+ 3. Malay Conversational Speech Corpus, https://huggingface.co/datasets/malaysia-ai/malay-conversational-speech-corpus
13
+ 4. Haqkiem TTS Dataset, this is private, but you request access from https://www.linkedin.com/in/haqkiem-daim/
14
 
15
+ We follow exact distillation process from https://github.com/huggingface/distil-whisper with minor changes, script at https://github.com/mesolitica/malaya-speech/tree/malaysian-speech/session/distill-whisper
 
 
16
 
17
  Wandb at https://wandb.ai/huseinzol05/distil-whisper?workspace=user-huseinzol05