Improved results using a 8s + 2s chunking strategy

Files changed (4) hide show

README.md CHANGED Viewed

@@ -38,10 +38,10 @@ model-index:
     metrics:
        - name: Test WER
          type: wer
-         value: 22.58
        - name: Test CER
          type: cer
-         value: 11.26
 ---
 # XLS-R-based CTC model with 5-gram language model from Common Voice

     metrics:
        - name: Test WER
          type: wer
+         value: 20.79
        - name: Test CER
          type: cer
+         value: 10.72
 ---
 # XLS-R-based CTC model with 5-gram language model from Common Voice

eval.sh CHANGED Viewed

	@@ -1,2 +1,2 @@
1	python ./eval.py --model_id FremyCompany/xls-r-nl-v1-cv8-lm --dataset mozilla-foundation/common_voice_8_0 --config nl --split test --log_outputs
2	- python ./eval.py --model_id FremyCompany/xls-r-nl-v1-cv8-lm --dataset speech-recognition-community-v2/dev_data --config nl --split validation --chunk_length_s 5.0 --stride_length_s 1.0


1	python ./eval.py --model_id FremyCompany/xls-r-nl-v1-cv8-lm --dataset mozilla-foundation/common_voice_8_0 --config nl --split test --log_outputs
2	+ python ./eval.py --model_id FremyCompany/xls-r-nl-v1-cv8-lm --dataset speech-recognition-community-v2/dev_data --config nl --split validation --chunk_length_s 8.0 --stride_length_s 2.0

log_speech-recognition-community-v2_dev_data_nl_validation_predictions.txt CHANGED Viewed

The diff for this file is too large to render. See raw diff

speech-recognition-community-v2_dev_data_nl_validation_eval_results.txt CHANGED Viewed