Update README.md
Browse files
README.md
CHANGED
|
@@ -23,9 +23,7 @@ model-index:
|
|
| 23 |
name: Automatic Speech Recognition
|
| 24 |
type: automatic-speech-recognition
|
| 25 |
dataset:
|
| 26 |
-
name: IWSLT-2023, FLEURS, BiteSize, SpokenWords, Tatoeba, and Wikimedia
|
| 27 |
-
well as a copy of the dataset with noise reduction and normalization (for
|
| 28 |
-
both train and test)
|
| 29 |
type: ymoslem/IWSLT2023-GA-EN
|
| 30 |
metrics:
|
| 31 |
- name: Bleu
|
|
@@ -42,6 +40,7 @@ should probably proofread and complete it, then remove this comment. -->
|
|
| 42 |
# Whisper Small GA-EN Speech Translation
|
| 43 |
|
| 44 |
This model is a fine-tuned version of [openai/whisper-small](https://huggingface.co/openai/whisper-small) on the IWSLT-2023, FLEURS, BiteSize, SpokenWords, Tatoeba, and Wikimedia as well as a copy of the dataset with noise reduction and normalization (for both train and test) dataset.
|
|
|
|
| 45 |
It achieves the following results on the evaluation set:
|
| 46 |
- Loss: 1.3339
|
| 47 |
- Bleu: 30.66
|
|
|
|
| 23 |
name: Automatic Speech Recognition
|
| 24 |
type: automatic-speech-recognition
|
| 25 |
dataset:
|
| 26 |
+
name: IWSLT-2023, FLEURS, BiteSize, SpokenWords, Tatoeba, and Wikimedia, normalized
|
|
|
|
|
|
|
| 27 |
type: ymoslem/IWSLT2023-GA-EN
|
| 28 |
metrics:
|
| 29 |
- name: Bleu
|
|
|
|
| 40 |
# Whisper Small GA-EN Speech Translation
|
| 41 |
|
| 42 |
This model is a fine-tuned version of [openai/whisper-small](https://huggingface.co/openai/whisper-small) on the IWSLT-2023, FLEURS, BiteSize, SpokenWords, Tatoeba, and Wikimedia as well as a copy of the dataset with noise reduction and normalization (for both train and test) dataset.
|
| 43 |
+
The datasets were processed with noise reduction and normalization (both the train and test splits).
|
| 44 |
It achieves the following results on the evaluation set:
|
| 45 |
- Loss: 1.3339
|
| 46 |
- Bleu: 30.66
|