metadata
license: apache-2.0
datasets:
- alvanlii/cantonese-youtube
base_model:
- TencentGameMate/chinese-hubert-base
library_name: fairseq
cantonese-hubert-base-l9-k200
This is a fine-tuned Hubert model based on TencentGameMate/chinese-hubert-base for generate speech discete units, The K-means model is trained on 9k+ hours Cantonese speech data, with 200 clusters and representations from 9th layer of the model.