vasilis
/

wav2vec2-large-xlsr-53-finnish

@@ -25,10 +25,10 @@ model-index:
     metrics:
        - name: Test WER
          type: wer
-         value: 47.117220
        - name: Test CER
          type: cer
-         value: 7.880525
 ---
 # Wav2Vec2-Large-XLSR-53-finnish
@@ -88,8 +88,8 @@ import re
 test_dataset = load_dataset("common_voice", "fi", split="test") #TODO: replace {lang_id} in your language code here. Make sure the code is one of the *ISO codes* of [this](https://huggingface.co/languages) site.
 wer = load_metric("wer")
-processor = Wav2Vec2Processor.from_pretrained("vasilis/wav2vec2-large-xlsr-53-finnish") #TODO: replace {model_id} with your model id. The model id consists of {your_username}/{your_modelname}, *e.g.* `elgeish/wav2vec2-large-xlsr-53-arabic`
-model = Wav2Vec2ForCTC.from_pretrained("vasilis/wav2vec2-large-xlsr-53-finnish") #TODO: replace {model_id} with your model id. The model id consists of {your_username}/{your_modelname}, *e.g.* `elgeish/wav2vec2-large-xlsr-53-arabic`
 model.to("cuda")
 chars_to_ignore_regex = "[\,\?\.\!\-\;\:\"\“\%\‘\”\�\']"  # TODO: adapt this list to include all special characters you removed from the data
@@ -134,15 +134,12 @@ print("CER: {:2f}".format(100 * wer.compute(predictions=[" ".join(list(entry)) f
 ```
-**Test Result**:  47.117220 %
 ## Training
 The Common Voice train dataset was used for training. Also all of `CSS10 Finnish` was used using the normalized transcripts.
-The model hasn't converged yet.

     metrics:
        - name: Test WER
          type: wer
+         value: 38.335242
        - name: Test CER
          type: cer
+         value: 6.552408
 ---
 # Wav2Vec2-Large-XLSR-53-finnish
 test_dataset = load_dataset("common_voice", "fi", split="test") #TODO: replace {lang_id} in your language code here. Make sure the code is one of the *ISO codes* of [this](https://huggingface.co/languages) site.
 wer = load_metric("wer")
+processor = Wav2Vec2Processor.from_pretrained("vasilis/wav2vec2-large-xlsr-53-finnish")
+model = Wav2Vec2ForCTC.from_pretrained("vasilis/wav2vec2-large-xlsr-53-finnish")
 model.to("cuda")
 chars_to_ignore_regex = "[\,\?\.\!\-\;\:\"\“\%\‘\”\�\']"  # TODO: adapt this list to include all special characters you removed from the data
 ```
+**Test Result**:  38.335242 %
 ## Training
 The Common Voice train dataset was used for training. Also all of `CSS10 Finnish` was used using the normalized transcripts.
+After 20000 steps the models was finetuned using the common voice train and validation sets for 2000 steps more.

config.json CHANGED Viewed

@@ -1,5 +1,5 @@
 {
-  "_name_or_path": "facebook/wav2vec2-large-xlsr-53",
   "activation_dropout": 0.0,
   "apply_spec_augment": true,
   "architectures": [

 {
+  "_name_or_path": "/speech-data-1/dev/hugging_face_finetuning_week/fi_demo/checkpoints/2020_27_3_v4/checkpoint-15200",
   "activation_dropout": 0.0,
   "apply_spec_augment": true,
   "architectures": [

pytorch_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:dcc8846b1f384bd0511e6a21a9993e4c38c796eef0f34468bfc31198c084f11f
 size 1262056855

 version https://git-lfs.github.com/spec/v1
+oid sha256:10069dd469767be123bf30757be12a8d249b99394d8475bd44cb4c671d367131
 size 1262056855