radiogroup-crits commited on
Commit
df8b5a6
1 Parent(s): 303c84a

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +86 -0
README.md CHANGED
@@ -1,3 +1,89 @@
1
  ---
 
 
2
  license: apache-2.0
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
+ language:
3
+ - it
4
  license: apache-2.0
5
+ datasets:
6
+ - mozilla-foundation/common_voice_8_0
7
+ metrics:
8
+ - wer
9
+ - cer
10
+ tags:
11
+ - audio
12
+ - automatic-speech-recognition
13
+ - hf-asr-leaderboard
14
+ - it
15
+ - mozilla-foundation/common_voice_8_0
16
+ - speech
17
+ - wav2vec2
18
+ model-index:
19
+ - name: XLS-R Wav2Vec2 Ita by radiogroup crits
20
+ results:
21
+ - task:
22
+ name: Speech Recognition
23
+ type: automatic-speech-recognition
24
+ dataset:
25
+ name: Common Voice 8.0 italian
26
+ type: mozilla-foundation/common_voice_8_0
27
+ args: it
28
+ metrics:
29
+ - name: Test WER
30
+ type: wer
31
+ value: 7.27
32
+ - name: Test CER
33
+ type: cer
34
+ value: 1.9
35
+ - name: Test WER (+LM)
36
+ type: wer
37
+ value: 5.82
38
+ - name: Test CER (+LM)
39
+ type: cer
40
+ value: 1.54
41
  ---
42
+ # XLS-R-1B-ITA-LMWIKI500
43
+
44
+ ## Fine-tuned XLS-R 1B model for speech recognition in Italian
45
+
46
+ Fine-tuned [facebook/wav2vec2-xls-r-1b](https://huggingface.co/facebook/wav2vec2-xls-r-1b) on Italian using the train and validation splits of [Common Voice 8.0](https://huggingface.co/datasets/mozilla-foundation/common_voice_8_0), [Multilingual TEDx](http://www.openslr.org/100), [Multilingual LibriSpeech](https://www.openslr.org/94/), and [Voxpopuli](https://github.com/facebookresearch/voxpopuli).
47
+
48
+ When using this model, make sure that your speech input is sampled at 16kHz.
49
+
50
+
51
+ ## Language model information
52
+
53
+ Our language model was generated using a dataset of Italian wikipedia articles.
54
+
55
+
56
+ ## Download CommonVoice8.0 dataset for italian language
57
+ ```python
58
+ from datasets import load_dataset
59
+
60
+ dataset = load_dataset("mozilla-foundation/common_voice_8_0", "it", use_auth_token=True)
61
+ ```
62
+
63
+ ## Evaluation Commands
64
+
65
+ To evaluate on `mozilla-foundation/common_voice_8_0` with split `test`:
66
+
67
+ ```bash
68
+ python eval.py --model_id radiogroup-crits/wav2vec2-xls-r-1b-cv8ita-new-lmwiki500 --dataset mozilla-foundation/common_voice_8_0 --config it --split test --log_outputs --greedy
69
+
70
+ mv log_mozilla-foundation_common_voice_8_0_it_test_predictions.txt log_mozilla-foundation_common_voice_8_0_it_test_predictions_greedy.txt
71
+
72
+ mv mozilla-foundation_common_voice_8_0_it_test_eval_results.txt mozilla-foundation_common_voice_8_0_it_test_eval_results_greedy.txt
73
+
74
+ python eval.py --model_id radiogroup-crits/wav2vec2-xls-r-1b-cv8ita-new-lmwiki500 --dataset mozilla-foundation/common_voice_8_0 --config it --split test --log_outputs
75
+ ```
76
+
77
+ ## Citation
78
+ If you want to cite this model you can use this:
79
+
80
+ ```bibtex
81
+ @misc{crits2023wav2vec2-xls-r-1b-cv8ita-new-lmwiki500,
82
+ title={XLS-R Wav2Vec2 Ita by radiogroup crits},
83
+ author={Teraoni Prioletti Raffaele, Casagranda Paolo and Russo Francesco},
84
+ publisher={Hugging Face},
85
+ journal={Hugging Face Hub},
86
+ howpublished={\url{https://huggingface.co/radiogroup-crits/wav2vec2-xls-r-1b-cv8ita-new-lmwiki500}},
87
+ year={2023}
88
+ }
89
+ ```