lucio
/

xls-r-kyrgiz-cv8

Automatic Speech Recognition

Generated from Trainer

hf-asr-leaderboard

mozilla-foundation/common_voice_8_0

robust-speech-event

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

lucio commited on Feb 6, 2022

Commit

fcbf49b

•

1 Parent(s): 7c4a099

fix model card

Files changed (1) hide show

README.md +3 -3

README.md CHANGED Viewed

@@ -34,7 +34,7 @@ should probably proofread and complete it, then remove this comment. -->
 # XLS-R-300M Kyrgiz CV8
 This model is a fine-tuned version of [facebook/wav2vec2-xls-r-300m](https://huggingface.co/facebook/wav2vec2-xls-r-300m) on the MOZILLA-FOUNDATION/COMMON_VOICE_8_0 - KY dataset.
-It achieves the following results on the evaluation set:
 - Loss: 0.5497
 - Wer: 0.2945
 - Cer: 0.0791
@@ -55,11 +55,11 @@ The model is not reliable enough to use as a substitute for live captions for ac
 ## Training and evaluation data
-The combination of `train` and `dev` of common voice official splits were used as training data. The half of the official `test` split was used as validation data as well as for final evaluation.
 ## Training procedure
-The featurization layers of the XLS-R model are frozen while tuning a final CTC/LM layer on the Uyghur CV8 example sentences. A ramped learning rate is used with an initial warmup phase of 500 steps, a max of 0.0001, and cooling back towards 0 for the remainder of the 8100 steps (300 epochs).
 ### Training hyperparameters

 # XLS-R-300M Kyrgiz CV8
 This model is a fine-tuned version of [facebook/wav2vec2-xls-r-300m](https://huggingface.co/facebook/wav2vec2-xls-r-300m) on the MOZILLA-FOUNDATION/COMMON_VOICE_8_0 - KY dataset.
+It achieves the following results on the validation set:
 - Loss: 0.5497
 - Wer: 0.2945
 - Cer: 0.0791
 ## Training and evaluation data
+The combination of `train`, `dev` and `other` of common voice official splits were used as training data. The half of the official `test` split was used as validation data, as and the full `test` set was used for final evaluation.
 ## Training procedure
+The featurization layers of the XLS-R model are frozen while tuning a final CTC/LM layer on the Kyrgiz CV8 example sentences. A ramped learning rate is used with an initial warmup phase of 500 steps, a max of 0.0001, and cooling back towards 0 for the remainder of the 8100 steps (300 epochs).
 ### Training hyperparameters