README.md · mitchelldehaven/whisper-large-v2-ru at dd398db0b56f6a7b2c6c863474210149f7e1e58c

metadata

model-index:
  - name: whisper-large-v2-ru
    results:
      - task:
          type: automatic-speech-recognition
          name: Automatic Speech Recognition
        dataset:
          name: mozilla-foundation/common_voice_11_0
          type: mozilla-foundation/common_voice_11_0
          config: ru
          split: test
        metrics:
          - type: wer
            value: 7.73
            name: WER
tags:
  - whisper-event

Whisper model finetuned using audio data from Open STT Russian Dataset (https://github.com/snakers4/open_stt).

Due to differences in tokenization of source data (in our data normalization process, we replace punctucation with "" rather than Whisper's " "), there is a slight degredation on CommonVoice.