張悦楷 GPT-SoVITS
本模型係 GPT-SoVITS v2ProPlus 用咗全部張悦楷講古語音數據集 CanCLID/zoengjyutgaai,即總共 188.67 個鐘數據微調出嚟嘅。語音合成效果請見laubonghaudoi/zoengjyutgaai_tts。
模型文件
模型用嘅係 v2ProPlus 版,詳情請見 GPT‐SoVITS‐features (各版本特性)
SoVITS
sovits/e1_e50_s5950.pth
- Epoch: 50
- Steps: 5950
GPT
gpt/dpo1-e200.ckpt
- 用咗 DPO
- Epoch: 200
- top_3_acc_epoch 大概 0.8038
- total_loss_epoch 大概 3214
gpt/dpo1-e600.ckpt
- 用咗 DPO
- Epoch: 600
- top_3_acc_epoch 大概 0.8619
- total_loss_epoch 大概 4671 (唔知點解比上面仲高)
gpt/dpo1-e1000.ckpt
- 用咗 DPO
- Epoch: 1000
- top_3_acc_epoch 大概 0.8975
- total_loss_epoch 大概 1774
使用
from huggingface_hub import hf_hub_download
# Download GPT model
gpt_model = hf_hub_download(
repo_id="laubonghaudoi/zoengjyutgaai_tts",
filename="gpt/dpo1-e1000.ckpt"
)
# Download SoVITS model
sovits_model = hf_hub_download(
repo_id="laubonghaudoi/zoengjyutgaai_tts",
filename="sovits/e1_e50_s5950.pth"
)
Model tree for laubonghaudoi/zoengjyutgaai_tts
Base model
lj1995/GPT-SoVITS