zinc75's picture
Update README.md
29ef43f
|
raw
history blame
1.12 kB
metadata
license: mit
language: fr
datasets:
  - common_voice
metrics:
  - per
tags:
  - audio
  - automatic-speech-recognition
  - speech
  - phonemize
model-index:
  - name: Wav2Vec2-base French finetuned for phonemes by LMSSC
    results:
      - task:
          name: Speech Recognition
          type: automatic-speech-recognition
        dataset:
          name: Common Voice v13
          type: mozilla-foundation/common_voice_13_0
          args: fr
        metrics:
          - name: Test PER on Common Voice FR 13.0 | Trained
            type: per
            value: 5.52
          - name: Test PER on Multilingual Librispeech FR | Trained
            type: per
            value: 4.36
          - name: Val PER on Common Voice FR 13.0 | Trained
            type: per
            value: 4.31

Fine-tuned French Voxpopuli v2 wav2vec2-base model for speech-to-phoneme task in French

Fine-tuned facebook/wav2vec2-base-fr-voxpopuli-v2 on French using the train and validation splits of Common Voice v13. When using this model, make sure that your speech input is sampled at 16kHz.