metadata

license: apache-2.0
datasets:
  - alvanlii/cantonese-youtube
base_model:
  - TencentGameMate/chinese-hubert-base
library_name: fairseq

cantonese-hubert-base-l9-k200

This is a fine-tuned Hubert model based on TencentGameMate/chinese-hubert-base for generate speech discete units, The K-means model is trained on 9k+ hours Cantonese speech data, with 200 clusters and representations from 9th layer of the model.