hon9kon9ize
/

cantonese-hubert-base-l9-k200

Model card Files Files and versions Community

cantonese-hubert-base-l9-k200 / README.md

indiejoseph's picture

Update README.md

7731ac3 verified about 1 month ago

|

history blame contribute delete

542 Bytes

	---
	license: apache-2.0
	datasets:
	- alvanlii/cantonese-youtube
	base_model:
	- TencentGameMate/chinese-hubert-base
	library_name: fairseq
	---

	# cantonese-hubert-base-l9-k200

	This is a fine-tuned Hubert model based on [TencentGameMate/chinese-hubert-base](https://huggingface.co/TencentGameMate/chinese-hubert-base) for generate speech discete units, The K-means model is trained on [9k+ hours Cantonese speech data](https://huggingface.co/datasets/alvanlii/cantonese-youtube), with 200 clusters and representations from 9th layer of the model.