Model save

Browse files

Files changed (6) hide show

README.md +36 -40
config.json +10 -10
generation_config.json +13 -13
logs/events.out.tfevents.1720616464.8ba778dc7a53.54433.0 +3 -0
model.safetensors +2 -2
training_args.bin +2 -2

README.md CHANGED Viewed

@@ -1,45 +1,42 @@
 ---
-language:
-- he
 license: apache-2.0
-base_model: openai/whisper-medium
 tags:
-- hf-asr-leaderboard
 - generated_from_trainer
 metrics:
 - wer
 model-index:
-- name: he
   results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
-# he
-This model is a fine-tuned version of [openai/whisper-medium](https://huggingface.co/openai/whisper-medium) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.8710
-- Wer: 98.7805
-- Avg Precision Exact: 0.0834
-- Avg Recall Exact: 0.0774
-- Avg F1 Exact: 0.0795
-- Avg Precision Letter Shift: 0.1227
-- Avg Recall Letter Shift: 0.1114
-- Avg F1 Letter Shift: 0.1157
-- Avg Precision Word Level: 0.1433
-- Avg Recall Word Level: 0.1299
-- Avg F1 Word Level: 0.1351
-- Avg Precision Word Shift: 0.2771
-- Avg Recall Word Shift: 0.2479
-- Avg F1 Word Shift: 0.2589
-- Precision Median Exact: 0.0714
-- Recall Median Exact: 0.0556
-- F1 Median Exact: 0.0645
-- Precision Max Exact: 0.3333
-- Recall Max Exact: 0.3636
-- F1 Max Exact: 0.3478
 - Precision Min Exact: 0.0
 - Recall Min Exact: 0.0
 - F1 Min Exact: 0.0
@@ -70,28 +67,27 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 1e-05
-- train_batch_size: 1
 - eval_batch_size: 32
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- training_steps: 80
 - mixed_precision_training: Native AMP
 ### Training results
-| Training Loss | Epoch | Step | Validation Loss | Wer      | Avg Precision Exact | Avg Recall Exact | Avg F1 Exact | Avg Precision Letter Shift | Avg Recall Letter Shift | Avg F1 Letter Shift | Avg Precision Word Level | Avg Recall Word Level | Avg F1 Word Level | Avg Precision Word Shift | Avg Recall Word Shift | Avg F1 Word Shift | Precision Median Exact | Recall Median Exact | F1 Median Exact | Precision Max Exact | Recall Max Exact | F1 Max Exact | Precision Min Exact | Recall Min Exact | F1 Min Exact | Precision Min Letter Shift | Recall Min Letter Shift | F1 Min Letter Shift | Precision Min Word Level | Recall Min Word Level | F1 Min Word Level | Precision Min Word Shift | Recall Min Word Shift | F1 Min Word Shift |
-|:-------------:|:-----:|:----:|:---------------:|:--------:|:-------------------:|:----------------:|:------------:|:--------------------------:|:-----------------------:|:-------------------:|:------------------------:|:---------------------:|:-----------------:|:------------------------:|:---------------------:|:-----------------:|:----------------------:|:-------------------:|:---------------:|:-------------------:|:----------------:|:------------:|:-------------------:|:----------------:|:------------:|:--------------------------:|:-----------------------:|:-------------------:|:------------------------:|:---------------------:|:-----------------:|:------------------------:|:---------------------:|:-----------------:|
-| No log        | 0.0   | 20   | 2.9186          | 101.6260 | 0.0171              | 0.0289           | 0.0210       | 0.0322                     | 0.0338                  | 0.0293              | 0.0481                   | 0.0869                | 0.0581            | 0.1638                   | 0.2132                | 0.1777            | 0.0                    | 0.0                 | 0.0             | 0.1538              | 0.25             | 0.1667       | 0.0                 | 0.0              | 0.0          | 0.0                        | 0.0                     | 0.0                 | 0.0                      | 0.0                   | 0.0               | 0.0                      | 0.0                   | 0.0               |
-| 3.58          | 0.0   | 40   | 2.1540          | 108.1301 | 0.0579              | 0.0712           | 0.0631       | 0.1109                     | 0.1375                  | 0.1217              | 0.1321                   | 0.1689                | 0.1470            | 0.2462                   | 0.3116                | 0.2727            | 0.0625                 | 0.0833              | 0.0741          | 0.2222              | 0.2222           | 0.2222       | 0.0                 | 0.0              | 0.0          | 0.0                        | 0.0                     | 0.0                 | 0.0                      | 0.0                   | 0.0               | 0.0                      | 0.0                   | 0.0               |
-| 1.8957        | 0.0   | 60   | 1.9215          | 102.0325 | 0.0804              | 0.0967           | 0.0861       | 0.1169                     | 0.1421                  | 0.1264              | 0.1370                   | 0.1739                | 0.1512            | 0.2660                   | 0.3394                | 0.2929            | 0.0833                 | 0.0909              | 0.0833          | 0.25                | 0.4286           | 0.3158       | 0.0                 | 0.0              | 0.0          | 0.0                        | 0.0                     | 0.0                 | 0.0                      | 0.0                   | 0.0               | 0.0                      | 0.0                   | 0.0               |
-| 1.2925        | 0.0   | 80   | 1.8710          | 98.7805  | 0.0834              | 0.0774           | 0.0795       | 0.1227                     | 0.1114                  | 0.1157              | 0.1433                   | 0.1299                | 0.1351            | 0.2771                   | 0.2479                | 0.2589            | 0.0714                 | 0.0556              | 0.0645          | 0.3333              | 0.3636           | 0.3478       | 0.0                 | 0.0              | 0.0          | 0.0                        | 0.0                     | 0.0                 | 0.0                      | 0.0                   | 0.0               | 0.0                      | 0.0                   | 0.0               |
 ### Framework versions
-- Transformers 4.39.0.dev0
-- Pytorch 2.2.1+cu121
-- Datasets 2.16.1
-- Tokenizers 0.15.0

 ---
 license: apache-2.0
+base_model: openai/whisper-tiny
 tags:
 - generated_from_trainer
 metrics:
 - wer
 model-index:
+- name: test
   results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
+# test
+This model is a fine-tuned version of [openai/whisper-tiny](https://huggingface.co/openai/whisper-tiny) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 8.6412
+- Wer: 161.3707
+- Avg Precision Exact: 0.0022
+- Avg Recall Exact: 0.0010
+- Avg F1 Exact: 0.0013
+- Avg Precision Letter Shift: 0.0160
+- Avg Recall Letter Shift: 0.0017
+- Avg F1 Letter Shift: 0.0030
+- Avg Precision Word Level: 0.0171
+- Avg Recall Word Level: 0.0191
+- Avg F1 Word Level: 0.0124
+- Avg Precision Word Shift: 0.0892
+- Avg Recall Word Shift: 0.0453
+- Avg F1 Word Shift: 0.0484
+- Precision Median Exact: 0.0
+- Recall Median Exact: 0.0
+- F1 Median Exact: 0.0
+- Precision Max Exact: 0.0667
+- Recall Max Exact: 0.0303
+- F1 Max Exact: 0.0417
 - Precision Min Exact: 0.0
 - Recall Min Exact: 0.0
 - F1 Min Exact: 0.0
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 1e-06
+- train_batch_size: 8
 - eval_batch_size: 32
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- lr_scheduler_warmup_steps: 20
+- training_steps: 5
 - mixed_precision_training: Native AMP
 ### Training results
+| Training Loss | Epoch  | Step | Validation Loss | Wer      | Avg Precision Exact | Avg Recall Exact | Avg F1 Exact | Avg Precision Letter Shift | Avg Recall Letter Shift | Avg F1 Letter Shift | Avg Precision Word Level | Avg Recall Word Level | Avg F1 Word Level | Avg Precision Word Shift | Avg Recall Word Shift | Avg F1 Word Shift | Precision Median Exact | Recall Median Exact | F1 Median Exact | Precision Max Exact | Recall Max Exact | F1 Max Exact | Precision Min Exact | Recall Min Exact | F1 Min Exact | Precision Min Letter Shift | Recall Min Letter Shift | F1 Min Letter Shift | Precision Min Word Level | Recall Min Word Level | F1 Min Word Level | Precision Min Word Shift | Recall Min Word Shift | F1 Min Word Shift |
+|:-------------:|:------:|:----:|:---------------:|:--------:|:-------------------:|:----------------:|:------------:|:--------------------------:|:-----------------------:|:-------------------:|:------------------------:|:---------------------:|:-----------------:|:------------------------:|:---------------------:|:-----------------:|:----------------------:|:-------------------:|:---------------:|:-------------------:|:----------------:|:------------:|:-------------------:|:----------------:|:------------:|:--------------------------:|:-----------------------:|:-------------------:|:------------------------:|:---------------------:|:-----------------:|:------------------------:|:---------------------:|:-----------------:|
+| No log        | 0.0040 | 1    | 8.6412          | 161.3707 | 0.0022              | 0.0010           | 0.0013       | 0.0160                     | 0.0017                  | 0.0030              | 0.0171                   | 0.0191                | 0.0124            | 0.0892                   | 0.0453                | 0.0484            | 0.0                    | 0.0                 | 0.0             | 0.0667              | 0.0303           | 0.0417       | 0.0                 | 0.0              | 0.0          | 0.0                        | 0.0                     | 0.0                 | 0.0                      | 0.0                   | 0.0               | 0.0                      | 0.0                   | 0.0               |
+| No log        | 0.0202 | 5    | 8.6412          | 161.3707 | 0.0022              | 0.0010           | 0.0013       | 0.0160                     | 0.0017                  | 0.0030              | 0.0171                   | 0.0191                | 0.0124            | 0.0892                   | 0.0453                | 0.0484            | 0.0                    | 0.0                 | 0.0             | 0.0667              | 0.0303           | 0.0417       | 0.0                 | 0.0              | 0.0          | 0.0                        | 0.0                     | 0.0                 | 0.0                      | 0.0                   | 0.0               | 0.0                      | 0.0                   | 0.0               |
 ### Framework versions
+- Transformers 4.41.2
+- Pytorch 2.2.1
+- Datasets 2.20.0
+- Tokenizers 0.19.1

config.json CHANGED Viewed

@@ -1,5 +1,5 @@
 {
-  "_name_or_path": "openai/whisper-medium",
   "activation_dropout": 0.0,
   "activation_function": "gelu",
   "apply_spec_augment": false,
@@ -13,18 +13,18 @@
   ],
   "bos_token_id": 50257,
   "classifier_proj_size": 256,
-  "d_model": 1024,
-  "decoder_attention_heads": 16,
-  "decoder_ffn_dim": 4096,
   "decoder_input_ids": null,
   "decoder_layerdrop": 0.0,
-  "decoder_layers": 24,
   "decoder_start_token_id": 50258,
   "dropout": 0.0,
-  "encoder_attention_heads": 16,
-  "encoder_ffn_dim": 4096,
   "encoder_layerdrop": 0.0,
-  "encoder_layers": 24,
   "eos_token_id": 50257,
   "forced_decoder_ids": null,
   "init_std": 0.02,
@@ -40,13 +40,13 @@
   "max_target_positions": 448,
   "median_filter_width": 7,
   "model_type": "whisper",
-  "num_hidden_layers": 24,
   "num_mel_bins": 80,
   "pad_token_id": 50257,
   "scale_embedding": false,
   "suppress_tokens": [],
   "torch_dtype": "float32",
-  "transformers_version": "4.39.0.dev0",
   "use_cache": false,
   "use_weighted_layer_sum": false,
   "vocab_size": 51896

 {
+  "_name_or_path": "openai/whisper-tiny",
   "activation_dropout": 0.0,
   "activation_function": "gelu",
   "apply_spec_augment": false,
   ],
   "bos_token_id": 50257,
   "classifier_proj_size": 256,
+  "d_model": 384,
+  "decoder_attention_heads": 6,
+  "decoder_ffn_dim": 1536,
   "decoder_input_ids": null,
   "decoder_layerdrop": 0.0,
+  "decoder_layers": 4,
   "decoder_start_token_id": 50258,
   "dropout": 0.0,
+  "encoder_attention_heads": 6,
+  "encoder_ffn_dim": 1536,
   "encoder_layerdrop": 0.0,
+  "encoder_layers": 4,
   "eos_token_id": 50257,
   "forced_decoder_ids": null,
   "init_std": 0.02,
   "max_target_positions": 448,
   "median_filter_width": 7,
   "model_type": "whisper",
+  "num_hidden_layers": 4,
   "num_mel_bins": 80,
   "pad_token_id": 50257,
   "scale_embedding": false,
   "suppress_tokens": [],
   "torch_dtype": "float32",
+  "transformers_version": "4.41.2",
   "use_cache": false,
   "use_weighted_layer_sum": false,
   "vocab_size": 51896

generation_config.json CHANGED Viewed

@@ -1,28 +1,28 @@
 {
   "alignment_heads": [
     [
-      13,
-      15
     ],
     [
-      15,
-      4
     ],
     [
-      15,
-      15
     ],
     [
-      16,
-      1
     ],
     [
-      20,
-      0
     ],
     [
-      23,
-      4
     ]
   ],
   "begin_suppress_tokens": [
@@ -245,6 +245,6 @@
     "transcribe": 50359,
     "translate": 50358
   },
-  "transformers_version": "4.39.0.dev0",
   "use_cache": false
 }

 {
   "alignment_heads": [
     [
+      2,
+      2
     ],
     [
+      3,
+      0
     ],
     [
+      3,
+      2
     ],
     [
+      3,
+      3
     ],
     [
+      3,
+      4
     ],
     [
+      3,
+      5
     ]
   ],
   "begin_suppress_tokens": [
     "transcribe": 50359,
     "translate": 50358
   },
+  "transformers_version": "4.41.2",
   "use_cache": false
 }

logs/events.out.tfevents.1720616464.8ba778dc7a53.54433.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:d5f34cb197ab8e52228a36a71ebd2abfcce77200b18e6f734f912035ed4fbc61
+size 10373

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:18d4a3e02f396a7fa0830d23f758336a940190cfd2e4a334ae45ea2aff69ecdd
-size 3055671280

 version https://git-lfs.github.com/spec/v1
+oid sha256:4dcc940ee6ceacb215635c54c91d2163fc7bff40dd2b61939b601e07ca78edee
+size 151109288

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:399aaa1aa7d71a5de7366c3f538b91151b293a6f4a7e471784e8f8374bd4edc4
-size 5048

 version https://git-lfs.github.com/spec/v1
+oid sha256:6b4a71d5782b69d52484d7b28c3d46d9a266c9265c4bc84a8a01b1c68fc01b19
+size 5240