longriverlong commited on
Commit
e4fea3e
1 Parent(s): b3664dc

End of training

Browse files
Files changed (3) hide show
  1. README.md +71 -0
  2. generation_config.json +148 -0
  3. model.safetensors +1 -1
README.md ADDED
@@ -0,0 +1,71 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language:
3
+ - zh
4
+ license: apache-2.0
5
+ base_model: openai/whisper-tiny
6
+ tags:
7
+ - hf-asr-leaderboard
8
+ - generated_from_trainer
9
+ datasets:
10
+ - google/fleurs
11
+ model-index:
12
+ - name: Whisper Tiny Chinese - Tommy
13
+ results: []
14
+ ---
15
+
16
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
17
+ should probably proofread and complete it, then remove this comment. -->
18
+
19
+ # Whisper Tiny Chinese - Tommy
20
+
21
+ This model is a fine-tuned version of [openai/whisper-tiny](https://huggingface.co/openai/whisper-tiny) on the google fleurs dataset.
22
+ It achieves the following results on the evaluation set:
23
+ - Loss: 2.0604
24
+ - Cer: 686.3631
25
+
26
+ ## Model description
27
+
28
+ More information needed
29
+
30
+ ## Intended uses & limitations
31
+
32
+ More information needed
33
+
34
+ ## Training and evaluation data
35
+
36
+ More information needed
37
+
38
+ ## Training procedure
39
+
40
+ ### Training hyperparameters
41
+
42
+ The following hyperparameters were used during training:
43
+ - learning_rate: 1e-05
44
+ - train_batch_size: 16
45
+ - eval_batch_size: 8
46
+ - seed: 42
47
+ - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
48
+ - lr_scheduler_type: linear
49
+ - lr_scheduler_warmup_steps: 100
50
+ - training_steps: 3500
51
+ - mixed_precision_training: Native AMP
52
+
53
+ ### Training results
54
+
55
+ | Training Loss | Epoch | Step | Validation Loss | Cer |
56
+ |:-------------:|:-----:|:----:|:---------------:|:--------:|
57
+ | 2.7861 | 2.18 | 500 | 2.9273 | 541.9488 |
58
+ | 2.2443 | 4.37 | 1000 | 2.5364 | 440.6712 |
59
+ | 1.9019 | 6.55 | 1500 | 2.3037 | 705.8989 |
60
+ | 1.6275 | 8.73 | 2000 | 2.1769 | 746.9195 |
61
+ | 1.4541 | 10.92 | 2500 | 2.1022 | 756.6663 |
62
+ | 1.3193 | 13.1 | 3000 | 2.0731 | 699.0618 |
63
+ | 1.2989 | 15.28 | 3500 | 2.0604 | 686.3631 |
64
+
65
+
66
+ ### Framework versions
67
+
68
+ - Transformers 4.37.0.dev0
69
+ - Pytorch 2.1.0+cu121
70
+ - Datasets 2.16.1
71
+ - Tokenizers 0.15.0
generation_config.json ADDED
@@ -0,0 +1,148 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "alignment_heads": [
3
+ [
4
+ 1,
5
+ 0
6
+ ],
7
+ [
8
+ 2,
9
+ 0
10
+ ],
11
+ [
12
+ 2,
13
+ 5
14
+ ],
15
+ [
16
+ 3,
17
+ 0
18
+ ],
19
+ [
20
+ 3,
21
+ 1
22
+ ],
23
+ [
24
+ 3,
25
+ 2
26
+ ],
27
+ [
28
+ 3,
29
+ 3
30
+ ],
31
+ [
32
+ 3,
33
+ 4
34
+ ]
35
+ ],
36
+ "begin_suppress_tokens": [
37
+ 220,
38
+ 50256
39
+ ],
40
+ "bos_token_id": 50257,
41
+ "decoder_start_token_id": 50257,
42
+ "eos_token_id": 50256,
43
+ "forced_decoder_ids": [
44
+ [
45
+ 1,
46
+ 50362
47
+ ]
48
+ ],
49
+ "is_multilingual": false,
50
+ "max_initial_timestamp_index": 1,
51
+ "max_length": 448,
52
+ "no_timestamps_token_id": 50362,
53
+ "pad_token_id": 50256,
54
+ "return_timestamps": false,
55
+ "suppress_tokens": [
56
+ 1,
57
+ 2,
58
+ 7,
59
+ 8,
60
+ 9,
61
+ 10,
62
+ 14,
63
+ 25,
64
+ 26,
65
+ 27,
66
+ 28,
67
+ 29,
68
+ 31,
69
+ 58,
70
+ 59,
71
+ 60,
72
+ 61,
73
+ 62,
74
+ 63,
75
+ 90,
76
+ 91,
77
+ 92,
78
+ 93,
79
+ 357,
80
+ 366,
81
+ 438,
82
+ 532,
83
+ 685,
84
+ 705,
85
+ 796,
86
+ 930,
87
+ 1058,
88
+ 1220,
89
+ 1267,
90
+ 1279,
91
+ 1303,
92
+ 1343,
93
+ 1377,
94
+ 1391,
95
+ 1635,
96
+ 1782,
97
+ 1875,
98
+ 2162,
99
+ 2361,
100
+ 2488,
101
+ 3467,
102
+ 4008,
103
+ 4211,
104
+ 4600,
105
+ 4808,
106
+ 5299,
107
+ 5855,
108
+ 6329,
109
+ 7203,
110
+ 9609,
111
+ 9959,
112
+ 10563,
113
+ 10786,
114
+ 11420,
115
+ 11709,
116
+ 11907,
117
+ 13163,
118
+ 13697,
119
+ 13700,
120
+ 14808,
121
+ 15306,
122
+ 16410,
123
+ 16791,
124
+ 17992,
125
+ 19203,
126
+ 19510,
127
+ 20724,
128
+ 22305,
129
+ 22935,
130
+ 27007,
131
+ 30109,
132
+ 30420,
133
+ 33409,
134
+ 34949,
135
+ 40283,
136
+ 40493,
137
+ 40549,
138
+ 47282,
139
+ 49146,
140
+ 50257,
141
+ 50357,
142
+ 50358,
143
+ 50359,
144
+ 50360,
145
+ 50361
146
+ ],
147
+ "transformers_version": "4.37.0.dev0"
148
+ }
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:45fac6091723d5783ce84093da16dcf4de21ae8909d4dd5daed212db652f1c84
3
  size 151060136
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:80ff052cd5a0895476b95f8c0001de0082c8441e2d3c78a47f31009222ecdfb0
3
  size 151060136