speechbrain
/

asr-wav2vec2-commonvoice-en

Automatic Speech Recognition

Model card Files Files and versions Community

Mirco commited on Nov 30, 2021

Commit

21436a9

·

1 Parent(s): db6b0d5

Update README.md

Files changed (1) hide show

README.md +2 -0

README.md CHANGED Viewed

@@ -40,6 +40,8 @@ the train transcriptions (train.tsv) of CommonVoice (EN).
 - Acoustic model (wav2vec2.0 + CTC/Attention). A pretrained wav2vec 2.0 model ([wav2vec2-lv60-large](https://huggingface.co/facebook/wav2vec2-large-lv60)) is combined with two DNN layers and finetuned on CommonVoice En.
 The obtained final acoustic representation is given to the CTC and attention decoders.
 ## Install SpeechBrain

 - Acoustic model (wav2vec2.0 + CTC/Attention). A pretrained wav2vec 2.0 model ([wav2vec2-lv60-large](https://huggingface.co/facebook/wav2vec2-large-lv60)) is combined with two DNN layers and finetuned on CommonVoice En.
 The obtained final acoustic representation is given to the CTC and attention decoders.
+The system is trained with recordings sampled at 16kHz (single channel).
+The code will automatically normalize your audio (i.e., resampling + mono channel selection) when calling *transcribe_file* if needed.
 ## Install SpeechBrain