update LM eval results
Browse files
README.md
CHANGED
@@ -20,10 +20,16 @@ model-index:
|
|
20 |
type: mozilla-foundation/common_voice_8_0
|
21 |
args: ky
|
22 |
metrics:
|
23 |
-
- name: Test WER
|
|
|
|
|
|
|
|
|
|
|
|
|
24 |
type: wer
|
25 |
value: 31.28
|
26 |
-
- name: Test CER
|
27 |
type: cer
|
28 |
value: 7.66
|
29 |
---
|
@@ -45,6 +51,8 @@ For a description of the model architecture, see [facebook/wav2vec2-xls-r-300m](
|
|
45 |
|
46 |
The model vocabulary consists of the cyrillic alphabet with punctuation removed.
|
47 |
|
|
|
|
|
48 |
## Intended uses & limitations
|
49 |
|
50 |
This model is expected to be of some utility for low-fidelity use cases such as:
|
|
|
20 |
type: mozilla-foundation/common_voice_8_0
|
21 |
args: ky
|
22 |
metrics:
|
23 |
+
- name: Test WER (with LM)
|
24 |
+
type: wer
|
25 |
+
value: 19.01
|
26 |
+
- name: Test CER (with LM)
|
27 |
+
type: cer
|
28 |
+
value: 5.38
|
29 |
+
- name: Test WER (no LM)
|
30 |
type: wer
|
31 |
value: 31.28
|
32 |
+
- name: Test CER (no LM)
|
33 |
type: cer
|
34 |
value: 7.66
|
35 |
---
|
|
|
51 |
|
52 |
The model vocabulary consists of the cyrillic alphabet with punctuation removed.
|
53 |
|
54 |
+
The kenlm language model is built using the text of the train and invalidated corpus splits.
|
55 |
+
|
56 |
## Intended uses & limitations
|
57 |
|
58 |
This model is expected to be of some utility for low-fidelity use cases such as:
|
mozilla-foundation_common_voice_8_0_ky_test_eval_results.txt
CHANGED
@@ -1,2 +1,2 @@
|
|
1 |
-
WER: 0.
|
2 |
-
CER: 0.
|
|
|
1 |
+
WER: 0.19011371973587674
|
2 |
+
CER: 0.05388927913480272
|