zionia's picture
Add model card
2ca0402 verified
metadata
language: zu
license: apache-2.0
tags:
  - whisper
  - automatic-speech-recognition
  - south-african-languages
datasets:
  - zionia/isizulu-asr-train
  - zionia/isizulu-asr-gaussian-noise
base_model: openai/whisper-small

Whisper Small - Combined South African Languages

This model is a fine-tuned version of openai/whisper-small on multiple South African language datasets.

Training Data

This model was trained on 2 combined datasets:

  • zionia/isizulu-asr-train
  • zionia/isizulu-asr-gaussian-noise

Total training samples: 1050 Total test samples: 263

Training Details

  • Training epochs: 15
  • Learning rate: 1e-05
  • Batch size: 16
  • Best WER: 63.57%

Usage

from transformers import pipeline

pipe = pipeline("automatic-speech-recognition", model="zionia/whisper-small-isizulu-noisy")
result = pipe("path/to/audio.wav")