pypinyin einops omegaconf==2.0.6 encodec vocos scipy transformers torch k_diffusion tensorboard txtsplit