flozi00's picture
Update README.md
f325a6a
|
raw
history blame
1.61 kB
---
license: apache-2.0
language:
- de
library_name: transformers
pipeline_tag: automatic-speech-recognition
model-index:
- name: whisper-large-v3-german by Florian Zimmermeister @primeLine
results:
- task:
type: automatic-speech-recognition
name: Speech Recognition
dataset:
name: Common Voice de
type: common_voice_15
args: de
metrics:
- type: wer
value: 3.002 %
name: Test WER
- type: cer
value: 0.81 %
name: Test CER
---
### Summary
This model map provides information about a model based on Whisper Large v3 that has been fine-tuned for speech recognition in German. Whisper is a powerful speech recognition platform developed by OpenAI. This model has been specially optimized for processing and recognizing German speech.
### Applications
This model can be used in various application areas, including
- Transcription of spoken German language
- Voice commands and voice control
- Automatic subtitling for German videos
- Voice-based search queries in German
- Dictation functions in word processing programs
### Training data
The training data for this model includes a large amount of spoken German from various sources. The data was carefully selected and processed to optimize recognition performance.
### Training process
The training of the model was performed with the following hyperparameters
- Batch size: 1024
- Epochs: 2
- Learning rate: 1e-5
- Data augmentation: No
Model author: [Florian Zimmermeister](https://huggingface.co/flozi00)