Cnam-LMSSC
/

wav2vec2-french-phonemizer

Automatic Speech Recognition

Inference Endpoints

Model card Files Files and versions Community

zinc75 commited on Nov 8, 2023

Commit

45209c5

•

1 Parent(s): 14a7c4b

Update README.md

Files changed (1) hide show

README.md +13 -11

README.md CHANGED Viewed

@@ -9,10 +9,11 @@ tags:
 - audio
 - automatic-speech-recognition
 - speech
 model-index:
-- name: Wav2Vec2-base French finetuned for phonemes by LMSSC
   results:
-  - task:
       name: Speech Recognition
       type: automatic-speech-recognition
     dataset:
@@ -20,17 +21,18 @@ model-index:
       type: common_voice
       args: fr
     metrics:
-       - name: Test PER
-         type: per
-         value: 5.52
-       - name: Val PER
-         type: per
-         value: 4.31
 ---
 # Fine-tuned French Voxpopuli v2 wav2vec2-base model for speech-to-phoneme task in French
 Fine-tuned [facebook/wav2vec2-base-fr-voxpopuli-v2](https://huggingface.co/facebook/wav2vec2-base-fr-voxpopuli-v2) on French using the train and validation splits of [Common Voice v13](https://huggingface.co/datasets/mozilla-foundation/common_voice_13_0).
-When using this model, make sure that your speech input is sampled at 16kHz.

 - audio
 - automatic-speech-recognition
 - speech
+- phonemize
 model-index:
+- name: Wav2Vec2-base French finetuned for phonemes by LMSSC
   results:
+  - task:
       name: Speech Recognition
       type: automatic-speech-recognition
     dataset:
       type: common_voice
       args: fr
     metrics:
+    - name: Test PER on Common Voice FR 13.0
+      type: per
+      value: 5.52
+    - name: Test PER on Multilingual Librispeech -fr
+      type: per
+      value: 4.36
+    - name: Val PER on Common Voice FR 13.0
+      type: per
+      value: 4.31
 ---
 # Fine-tuned French Voxpopuli v2 wav2vec2-base model for speech-to-phoneme task in French
 Fine-tuned [facebook/wav2vec2-base-fr-voxpopuli-v2](https://huggingface.co/facebook/wav2vec2-base-fr-voxpopuli-v2) on French using the train and validation splits of [Common Voice v13](https://huggingface.co/datasets/mozilla-foundation/common_voice_13_0).
+When using this model, make sure that your speech input is sampled at 16kHz.