README.md · primeline/whisper-large-v3-german at 42cb2774ba5f4648ddbcb41f8bb7bcf6d22162c7

metadata

license: apache-2.0
language:
  - de
library_name: transformers
pipeline_tag: automatic-speech-recognition
model-index:
  - name: whisper-large-v3-german by Florian Zimmermeister @primeLine
    results:
      - task:
          type: automatic-speech-recognition
          name: Speech Recognition
        dataset:
          name: Common Voice de
          type: common_voice_15
          args: de
        metrics:
          - type: wer
            value: 3.002 %
            name: Test WER
          - type: cer
            value: 0.81 %
            name: Test CER

Summary

This model map provides information about a model based on Whisper Large v3 that has been fine-tuned for speech recognition in German. Whisper is a powerful speech recognition platform developed by OpenAI. This model has been specially optimized for processing and recognizing German speech.

Applications

This model can be used in various application areas, including

Transcription of spoken German language
Voice commands and voice control
Automatic subtitling for German videos
Voice-based search queries in German
Dictation functions in word processing programs

Training data

The training data for this model includes a large amount of spoken German from various sources. The data was carefully selected and processed to optimize recognition performance.

Training process

The training of the model was performed with the following hyperparameters

Batch size: 1024
Epochs: 2
Learning rate: 1e-5
Data augmentation: No

Model author: Florian Zimmermeister