Pooya-Fallah's picture
Update README.md
5689893 verified
metadata
language:
  - en
license: apache-2.0
base_model: openai/whisper-tiny.en
tags:
  - generated_from_trainer
datasets:
  - mispeech/speechocean762
metrics:
  - wer
model-index:
  - name: Whisper Tiny En - speechocean762
    results:
      - task:
          name: Automatic Speech Recognition
          type: automatic-speech-recognition
        dataset:
          name: speechocean762
          type: mispeech/speechocean762
        metrics:
          - name: Wer
            type: wer
            value: 38.84869455803712

Whisper Tiny En - speechocean762

This model is a fine-tuned version of openai/whisper-tiny.en on the speechocean762 dataset. It achieves the following results on the evaluation set (Best results):

  • Loss: 0.6837
  • Wer: 28.3422

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 1e-05
  • train_batch_size: 128
  • eval_batch_size: 8
  • seed: 42
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • lr_scheduler_warmup_steps: 250
  • training_steps: 500

Training results

Training Loss Epoch Step Validation Loss Wer
4.0683 0.2778 10 4.0618 38.3139
3.9216 0.5556 20 3.8951 37.4961
3.7275 0.8333 30 3.6337 45.8635
3.3621 1.1111 40 3.2960 36.4580
2.9818 1.3889 50 2.8749 40.7046
2.5404 1.6667 60 2.3590 44.1648
1.9537 1.9444 70 1.7972 49.4495
1.4184 2.2222 80 1.3603 66.3731
1.1875 2.5 90 1.1660 55.5206
1.1203 2.7778 100 1.0743 44.8569
1.024 3.0556 110 1.0085 44.2277
0.905 3.3333 120 0.9581 42.4347
0.8787 3.6111 130 0.9169 40.9877
0.8677 3.8889 140 0.8844 37.2130
0.7563 4.1667 150 0.8573 36.4895
0.7497 4.4444 160 0.8324 35.9862
0.7283 4.7222 170 0.8097 35.2941
0.7055 5.0 180 0.7907 30.6071
0.6259 5.2778 190 0.7770 30.9531
0.6115 5.5556 200 0.7601 30.3555
0.5998 5.8333 210 0.7457 29.8207
0.5752 6.1111 220 0.7368 29.9465
0.5031 6.3889 230 0.7284 29.7892
0.5079 6.6667 240 0.7140 29.0028
0.4969 6.9444 250 0.7006 29.3174
0.4285 7.2222 260 0.6951 32.5889
0.466 7.5 270 0.6886 31.6766
0.4101 7.7778 280 0.6837 28.3422
0.4021 8.0556 290 0.6755 31.4250
0.359 8.3333 300 0.6763 32.5260
0.3281 8.6111 310 0.6727 32.2114
0.3604 8.8889 320 0.6695 36.1120
0.3085 9.1667 330 0.6698 32.1799
0.3007 9.4444 340 0.6698 32.3372
0.3313 9.7222 350 0.6659 35.7974
0.2862 10.0 360 0.6638 32.0226
0.278 10.2778 370 0.6639 31.9912
0.2645 10.5556 380 0.6639 32.0856
0.2708 10.8333 390 0.6649 32.0541
0.257 11.1111 400 0.6620 32.1799
0.2455 11.3889 410 0.6621 31.8025
0.2506 11.6667 420 0.6636 38.9745
0.2545 11.9444 430 0.6635 38.9116
0.2266 12.2222 440 0.6644 31.8339
0.2072 12.5 450 0.6652 32.1799
0.2382 12.7778 460 0.6661 31.9597
0.219 13.0556 470 0.6653 38.7858
0.2256 13.3333 480 0.6649 38.9431
0.2178 13.6111 490 0.6652 38.9431
0.2229 13.8889 500 0.6654 38.8487

Framework versions

  • Transformers 4.41.1
  • Pytorch 2.3.0+cu121
  • Datasets 2.19.2.dev0
  • Tokenizers 0.19.1