Update README.md
Browse files
README.md
CHANGED
@@ -16,7 +16,10 @@ should probably proofread and complete it, then remove this comment. -->
|
|
16 |
# whisper-small-se
|
17 |
This model is a fine-tuned version of [openai/whisper-small](https://huggingface.co/openai/whisper-small) on the Common Voice 11.0 dataset.
|
18 |
## Model description
|
19 |
-
|
|
|
|
|
|
|
20 |
## Intended uses & limitations
|
21 |
More information needed
|
22 |
## Training and evaluation data
|
@@ -35,8 +38,15 @@ The following hyperparameters were used during training:
|
|
35 |
- mixed_precision_training: Native AMP
|
36 |
|
37 |
### Training results
|
|
|
|
|
|
|
|
|
|
|
38 |
|
|
|
39 |
|
|
|
40 |
|
41 |
### Framework versions
|
42 |
|
|
|
16 |
# whisper-small-se
|
17 |
This model is a fine-tuned version of [openai/whisper-small](https://huggingface.co/openai/whisper-small) on the Common Voice 11.0 dataset.
|
18 |
## Model description
|
19 |
+
The model was initially trained on 680 000 hours of audio with corresponding transcripts from the internet, 65% of which was in english audio and 83 % of which had english transcripts.
|
20 |
+
|
21 |
+
The model was then further trained for 4000 iterations, 500 of which as warm-up, on Swedish data from [Common_voice 11.0](https://huggingface.co/datasets/mozilla-foundation/common_voice_11_0). Achieving a WER of 19.865.
|
22 |
+
|
23 |
## Intended uses & limitations
|
24 |
More information needed
|
25 |
## Training and evaluation data
|
|
|
38 |
- mixed_precision_training: Native AMP
|
39 |
|
40 |
### Training results
|
41 |
+
![Training table](whisper_finetune.png)
|
42 |
+
## Model Plot
|
43 |
+
|
44 |
+
<details>
|
45 |
+
<summary>View Training Plots</summary>
|
46 |
|
47 |
+
![Metrics](whisper_metrics.png)
|
48 |
|
49 |
+
</details>
|
50 |
|
51 |
### Framework versions
|
52 |
|