mitchelldehaven commited on
Commit
899e249
1 Parent(s): 1503b28

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +5 -1
README.md CHANGED
@@ -16,4 +16,8 @@ model-index:
16
  name: WER
17
  tags:
18
  - whisper-event
19
- ---
 
 
 
 
 
16
  name: WER
17
  tags:
18
  - whisper-event
19
+ ---
20
+
21
+ Whisper model finetuned using audio data from CommonVoice Ukrainian v10 train and dev set with additional data via semi-supervised data.
22
+
23
+ There is a differences in tokenization of source data (in our data normalization process, we replace punctucation with `""` rather than Whisper's `" "`). This mismatch leads to a slight degradation on CommonVoice.