FremyCompany
commited on
Commit
•
93b5096
1
Parent(s):
f6ca04e
Improved results using a 8s + 2s chunking strategy
Browse files
README.md
CHANGED
@@ -38,10 +38,10 @@ model-index:
|
|
38 |
metrics:
|
39 |
- name: Test WER
|
40 |
type: wer
|
41 |
-
value:
|
42 |
- name: Test CER
|
43 |
type: cer
|
44 |
-
value:
|
45 |
---
|
46 |
|
47 |
# XLS-R-based CTC model with 5-gram language model from Common Voice
|
|
|
38 |
metrics:
|
39 |
- name: Test WER
|
40 |
type: wer
|
41 |
+
value: 20.79
|
42 |
- name: Test CER
|
43 |
type: cer
|
44 |
+
value: 10.72
|
45 |
---
|
46 |
|
47 |
# XLS-R-based CTC model with 5-gram language model from Common Voice
|
eval.sh
CHANGED
@@ -1,2 +1,2 @@
|
|
1 |
python ./eval.py --model_id FremyCompany/xls-r-nl-v1-cv8-lm --dataset mozilla-foundation/common_voice_8_0 --config nl --split test --log_outputs
|
2 |
-
python ./eval.py --model_id FremyCompany/xls-r-nl-v1-cv8-lm --dataset speech-recognition-community-v2/dev_data --config nl --split validation --chunk_length_s
|
|
|
1 |
python ./eval.py --model_id FremyCompany/xls-r-nl-v1-cv8-lm --dataset mozilla-foundation/common_voice_8_0 --config nl --split test --log_outputs
|
2 |
+
python ./eval.py --model_id FremyCompany/xls-r-nl-v1-cv8-lm --dataset speech-recognition-community-v2/dev_data --config nl --split validation --chunk_length_s 8.0 --stride_length_s 2.0
|
log_speech-recognition-community-v2_dev_data_nl_validation_predictions.txt
CHANGED
The diff for this file is too large to render.
See raw diff
|
|
speech-recognition-community-v2_dev_data_nl_validation_eval_results.txt
CHANGED
@@ -1,2 +1,2 @@
|
|
1 |
-
WER: 0.
|
2 |
-
CER: 0.
|
|
|
1 |
+
WER: 0.20791270350703156
|
2 |
+
CER: 0.10724088973024136
|