ymoslem
/

whisper-small-ga2en-v4

Automatic Speech Recognition

Generated from Trainer

Model card Files Files and versions

Metrics Training metrics Community

ymoslem commited on Apr 18, 2024

Commit

439a972

·

verified ·

1 Parent(s): a98035b

Update README.md

Files changed (1) hide show

README.md +2 -3

README.md CHANGED Viewed

@@ -23,9 +23,7 @@ model-index:
       name: Automatic Speech Recognition
       type: automatic-speech-recognition
     dataset:
-      name: IWSLT-2023, FLEURS, BiteSize, SpokenWords, Tatoeba, and Wikimedia     as
-        well as a copy of the dataset with noise reduction and normalization (for
-        both train and test)
       type: ymoslem/IWSLT2023-GA-EN
     metrics:
     - name: Bleu
@@ -42,6 +40,7 @@ should probably proofread and complete it, then remove this comment. -->
 # Whisper Small GA-EN Speech Translation
 This model is a fine-tuned version of [openai/whisper-small](https://huggingface.co/openai/whisper-small) on the IWSLT-2023, FLEURS, BiteSize, SpokenWords, Tatoeba, and Wikimedia     as well as a copy of the dataset with noise reduction and normalization (for both train and test) dataset.
 It achieves the following results on the evaluation set:
 - Loss: 1.3339
 - Bleu: 30.66

       name: Automatic Speech Recognition
       type: automatic-speech-recognition
     dataset:
+      name: IWSLT-2023, FLEURS, BiteSize, SpokenWords, Tatoeba, and Wikimedia, normalized
       type: ymoslem/IWSLT2023-GA-EN
     metrics:
     - name: Bleu
 # Whisper Small GA-EN Speech Translation
 This model is a fine-tuned version of [openai/whisper-small](https://huggingface.co/openai/whisper-small) on the IWSLT-2023, FLEURS, BiteSize, SpokenWords, Tatoeba, and Wikimedia     as well as a copy of the dataset with noise reduction and normalization (for both train and test) dataset.
+The datasets were processed with noise reduction and normalization (both the train and test splits).
 It achieves the following results on the evaluation set:
 - Loss: 1.3339
 - Bleu: 30.66