File size: 598 Bytes
419b084 f1b59a6 419b084 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 |
---
language:
- ms
- en
---
# Malaysian Distil Whisper Large V3
Distil Whisper Large V3 on Malaysian dataset,
1. IMDA STT, https://huggingface.co/datasets/mesolitica/IMDA-STT
2. Pseudolabel Malaysian youtube videos, https://huggingface.co/datasets/mesolitica/pseudolabel-malaysian-youtube-whisper-large-v3
We follow exact distillation process from https://github.com/huggingface/distil-whisper with minor changes.
Script at https://github.com/mesolitica/malaya-speech/tree/malaysian-speech/session/distill-whisper
Wandb at https://wandb.ai/huseinzol05/distil-whisper?workspace=user-huseinzol05 |