torch>=1.7.0 transformers>=4.0.0 gradio>=3.1.0 espnet==0.10.0 espnet_model_zoo numpy PyYAML soundfile sentencepiece