Update README.md
Browse files
README.md
CHANGED
@@ -40,6 +40,8 @@ the train transcriptions (train.tsv) of CommonVoice (EN).
|
|
40 |
- Acoustic model (wav2vec2.0 + CTC/Attention). A pretrained wav2vec 2.0 model ([wav2vec2-lv60-large](https://huggingface.co/facebook/wav2vec2-large-lv60)) is combined with two DNN layers and finetuned on CommonVoice En.
|
41 |
The obtained final acoustic representation is given to the CTC and attention decoders.
|
42 |
|
|
|
|
|
43 |
|
44 |
## Install SpeechBrain
|
45 |
|
|
|
40 |
- Acoustic model (wav2vec2.0 + CTC/Attention). A pretrained wav2vec 2.0 model ([wav2vec2-lv60-large](https://huggingface.co/facebook/wav2vec2-large-lv60)) is combined with two DNN layers and finetuned on CommonVoice En.
|
41 |
The obtained final acoustic representation is given to the CTC and attention decoders.
|
42 |
|
43 |
+
The system is trained with recordings sampled at 16kHz (single channel).
|
44 |
+
The code will automatically normalize your audio (i.e., resampling + mono channel selection) when calling *transcribe_file* if needed.
|
45 |
|
46 |
## Install SpeechBrain
|
47 |
|