GogetaBlueMUI commited on
Commit
19e4456
·
verified ·
1 Parent(s): 97573b8

End of training

Browse files
Files changed (2) hide show
  1. README.md +16 -17
  2. generation_config.json +1 -3
README.md CHANGED
@@ -7,7 +7,7 @@ base_model: openai/whisper-small
7
  tags:
8
  - generated_from_trainer
9
  datasets:
10
- - mozilla-foundation/common_voice_11_0
11
  metrics:
12
  - wer
13
  model-index:
@@ -17,15 +17,15 @@ model-index:
17
  name: Automatic Speech Recognition
18
  type: automatic-speech-recognition
19
  dataset:
20
- name: Common Voice 11.0
21
- type: mozilla-foundation/common_voice_11_0
22
  config: ur
23
  split: test
24
  args: 'config: ur, split: test'
25
  metrics:
26
  - name: Wer
27
  type: wer
28
- value: 35.41545725453771
29
  ---
30
 
31
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -33,10 +33,10 @@ should probably proofread and complete it, then remove this comment. -->
33
 
34
  # Whisper Small Ur - Muhammad Abdullah
35
 
36
- This model is a fine-tuned version of [openai/whisper-small](https://huggingface.co/openai/whisper-small) on the Common Voice 11.0 dataset.
37
  It achieves the following results on the evaluation set:
38
- - Loss: 0.6090
39
- - Wer: 35.4155
40
 
41
  ## Model description
42
 
@@ -55,28 +55,27 @@ More information needed
55
  ### Training hyperparameters
56
 
57
  The following hyperparameters were used during training:
58
- - learning_rate: 1e-05
59
  - train_batch_size: 16
60
  - eval_batch_size: 8
61
  - seed: 42
62
  - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
63
  - lr_scheduler_type: linear
64
- - lr_scheduler_warmup_steps: 500
65
- - training_steps: 1200
66
  - mixed_precision_training: Native AMP
67
 
68
  ### Training results
69
 
70
- | Training Loss | Epoch | Step | Validation Loss | Wer |
71
- |:-------------:|:------:|:----:|:---------------:|:-------:|
72
- | 0.451 | 1.5444 | 400 | 0.6250 | 39.7229 |
73
- | 0.133 | 3.0888 | 800 | 0.5866 | 37.1028 |
74
- | 0.0726 | 4.6332 | 1200 | 0.6090 | 35.4155 |
75
 
76
 
77
  ### Framework versions
78
 
79
- - Transformers 4.47.0
80
  - Pytorch 2.5.1+cu121
81
- - Datasets 3.1.0
82
  - Tokenizers 0.21.0
 
7
  tags:
8
  - generated_from_trainer
9
  datasets:
10
+ - mozilla-foundation/common_voice_17_0
11
  metrics:
12
  - wer
13
  model-index:
 
17
  name: Automatic Speech Recognition
18
  type: automatic-speech-recognition
19
  dataset:
20
+ name: Common Voice 17.0
21
+ type: mozilla-foundation/common_voice_17_0
22
  config: ur
23
  split: test
24
  args: 'config: ur, split: test'
25
  metrics:
26
  - name: Wer
27
  type: wer
28
+ value: 92.16802838986547
29
  ---
30
 
31
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
33
 
34
  # Whisper Small Ur - Muhammad Abdullah
35
 
36
+ This model is a fine-tuned version of [openai/whisper-small](https://huggingface.co/openai/whisper-small) on the Common Voice 17.0 dataset.
37
  It achieves the following results on the evaluation set:
38
+ - Loss: 0.6097
39
+ - Wer: 92.1680
40
 
41
  ## Model description
42
 
 
55
  ### Training hyperparameters
56
 
57
  The following hyperparameters were used during training:
58
+ - learning_rate: 5e-06
59
  - train_batch_size: 16
60
  - eval_batch_size: 8
61
  - seed: 42
62
  - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
63
  - lr_scheduler_type: linear
64
+ - lr_scheduler_warmup_steps: 100
65
+ - training_steps: 500
66
  - mixed_precision_training: Native AMP
67
 
68
  ### Training results
69
 
70
+ | Training Loss | Epoch | Step | Validation Loss | Wer |
71
+ |:-------------:|:------:|:----:|:---------------:|:--------:|
72
+ | 0.6086 | 0.7440 | 250 | 0.6541 | 102.7675 |
73
+ | 0.4791 | 1.4881 | 500 | 0.6097 | 92.1680 |
 
74
 
75
 
76
  ### Framework versions
77
 
78
+ - Transformers 4.47.1
79
  - Pytorch 2.5.1+cu121
80
+ - Datasets 3.2.0
81
  - Tokenizers 0.21.0
generation_config.json CHANGED
@@ -160,7 +160,6 @@
160
  "<|yo|>": 50325,
161
  "<|zh|>": 50260
162
  },
163
- "language": "ur",
164
  "max_initial_timestamp_index": 50,
165
  "max_length": 448,
166
  "no_timestamps_token_id": 50363,
@@ -255,10 +254,9 @@
255
  50361,
256
  50362
257
  ],
258
- "task": "transcribe",
259
  "task_to_id": {
260
  "transcribe": 50359,
261
  "translate": 50358
262
  },
263
- "transformers_version": "4.47.0"
264
  }
 
160
  "<|yo|>": 50325,
161
  "<|zh|>": 50260
162
  },
 
163
  "max_initial_timestamp_index": 50,
164
  "max_length": 448,
165
  "no_timestamps_token_id": 50363,
 
254
  50361,
255
  50362
256
  ],
 
257
  "task_to_id": {
258
  "transcribe": 50359,
259
  "translate": 50358
260
  },
261
+ "transformers_version": "4.47.1"
262
  }