license: apache-2.0 | |
datasets: | |
- alvanlii/cantonese-youtube | |
base_model: | |
- TencentGameMate/chinese-hubert-base | |
library_name: fairseq | |
# cantonese-hubert-base-l9-k200 | |
This is a fine-tuned Hubert model based on [TencentGameMate/chinese-hubert-base](https://huggingface.co/TencentGameMate/chinese-hubert-base) for generate speech discete units, The K-means model is trained on [9k+ hours Cantonese speech data](https://huggingface.co/datasets/alvanlii/cantonese-youtube), with 200 clusters and representations from 9th layer of the model. |