whisper-base-khmer-mlx-fp16

This model was converted to MLX format from openai-whisper-base, then fine-tined to Khmer language using two datasets:

It achieves the following word error rate (wer) on 2 popular datasets:

62.9% on test split of google/fleurs km-kh
55.3% on train split of openslr/openslr SLR42

NOTE MLX format is usable for M-chip series of Apple.

Use with mlx

pip install mlx-whisper

Write a python script, example.py, as the following

import mlx_whisper

result = mlx_whisper.transcribe(
    SPEECH_FILE_NAME,
    path_or_hf_repo="mlx-community/whisper-base-khmer-mlx-fp16",
    fp16=True
)
print(result['text'])

Then execute this script example.py to see the result.

You can also use command line in terminal

mlx_whisper --model mlx-community/whisper-base-khmer-mlx-fp16 --task transcribe SPEECH_FILE_NAME --fp16 True

Downloads last month: 44

Safetensors

Model size

71.8M params

Tensor type

I64

F16

Dataset used to train mlx-community/whisper-base-khmer-mlx-fp16

Evaluation results

test on test split of "km_kh" in google/fleurs
self-reported

62.9%
test on train split of "SLR42" in openslr/openslr
self-reported

55.3%

View on Papers With Code