steja
/

whisper-small-telugu-large-data

@@ -7,7 +7,6 @@ tags:
 - generated_from_trainer
 datasets:
 - openslr
-- google/fleurs
 metrics:
 - wer
 model-index:
@@ -19,12 +18,12 @@ model-index:
     dataset:
       name: google/fleurs
       type: openslr
-      config: "te_in"
       split: None
     metrics:
     - name: Wer
       type: wer
-      value: 123.97713870997006
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -34,8 +33,8 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [openai/whisper-small](https://huggingface.co/openai/whisper-small) on the google/fleurs dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.6348
-- Wer: 123.9771
 ## Model description
@@ -55,23 +54,34 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 1e-05
-- train_batch_size: 16
-- eval_batch_size: 64
 - seed: 42
 - gradient_accumulation_steps: 2
 - total_train_batch_size: 32
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 500
-- training_steps: 50
 - mixed_precision_training: Native AMP
 ### Training results
-| Training Loss | Epoch | Step | Validation Loss | Wer      |
-|:-------------:|:-----:|:----:|:---------------:|:--------:|
-| 2.2343        | 0.11  | 25   | 1.8167          | 116.8103 |
-| 1.9575        | 0.23  | 50   | 1.6348          | 123.9771 |
 ### Framework versions

 - generated_from_trainer
 datasets:
 - openslr
 metrics:
 - wer
 model-index:
     dataset:
       name: google/fleurs
       type: openslr
+      config: null
       split: None
     metrics:
     - name: Wer
       type: wer
+      value: 38.84604916991744
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 This model is a fine-tuned version of [openai/whisper-small](https://huggingface.co/openai/whisper-small) on the google/fleurs dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.3310
+- Wer: 38.8460
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 1e-05
+- train_batch_size: 4
+- eval_batch_size: 16
 - seed: 42
+- distributed_type: multi-GPU
+- num_devices: 4
 - gradient_accumulation_steps: 2
 - total_train_batch_size: 32
+- total_eval_batch_size: 64
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 500
+- training_steps: 5000
 - mixed_precision_training: Native AMP
 ### Training results
+| Training Loss | Epoch | Step | Validation Loss | Wer     |
+|:-------------:|:-----:|:----:|:---------------:|:-------:|
+| 0.128         | 2.27  | 500  | 0.2015          | 45.1692 |
+| 0.0462        | 4.55  | 1000 | 0.1877          | 41.1050 |
+| 0.0184        | 6.82  | 1500 | 0.2241          | 40.5153 |
+| 0.0045        | 9.09  | 2000 | 0.2590          | 39.7260 |
+| 0.0019        | 11.36 | 2500 | 0.2824          | 39.0819 |
+| 0.0006        | 13.64 | 3000 | 0.3002          | 38.9096 |
+| 0.0002        | 15.91 | 3500 | 0.3141          | 38.5920 |
+| 0.0001        | 18.18 | 4000 | 0.3232          | 38.7463 |
+| 0.0001        | 20.45 | 4500 | 0.3289          | 38.8370 |
+| 0.0001        | 22.73 | 5000 | 0.3310          | 38.8460 |
 ### Framework versions

pytorch_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:4d5e743cdb1716e4b7a87803d4cb8b08a4d8d10de76db10b9dacdc5c37614843
 size 967102601

 version https://git-lfs.github.com/spec/v1
+oid sha256:ba236c002f17aa00cc8f69b97ca0e3f5094376acee0731a60669249fb715b069
 size 967102601