softcatala
/

whisper-small-ca

Automatic Speech Recognition

Generated from Trainer

hf-asr-leaderboard

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

jordimas commited on Apr 1, 2023

Commit

dfad97f

·

1 Parent(s): 802c1a9

Fixes to documentation

Files changed (1) hide show

TRAINING.md +3 -1

TRAINING.md CHANGED Viewed

@@ -20,7 +20,7 @@ The model improves in WER evaluation metric when it is evaluated against the Com
 **2. Model degrades according to human evaluation**
-When doing human evaliuation the results for finetuned Catalan language model were disapointing. The fine-tuned models clearly perform worse than the original OpenAI models as reported by all users (half dozen) that test them.
 Our hypothesis is that the evaluation on Common Voice gives better results because the model is overfitted and has lost generalization capabilities.
@@ -50,6 +50,8 @@ Summary as March 2023:
 **b**. HuggingFace Whisper implementation performs poorly. This can be really misleading when doing evaluations, since HuggingFace is the stack used for fine-tuning
 In our experiments
 | Whisper Client | WER |

 **2. Model degrades according to human evaluation**
+When doing human evaluation the results for finetuned Catalan language model were disapointing. The fine-tuned models clearly perform worse than the original OpenAI models as reported by all users (half dozen) that test them.
 Our hypothesis is that the evaluation on Common Voice gives better results because the model is overfitted and has lost generalization capabilities.
 **b**. HuggingFace Whisper implementation performs poorly. This can be really misleading when doing evaluations, since HuggingFace is the stack used for fine-tuning
+**c**. We have only been able to use the models reliable with Whisper.cpp and CTranslate 2 inference clients.
 In our experiments
 | Whisper Client | WER |