So-Vits-Svc Base Model V1

The base model to generate new voices with so-vits-svc voice lab.

The dataset was comprised of 278 english speaking people. 4 datasets where used:

Genshin Voice: Only speakers with more than 30min of audio
VCTK
Vocalset
Private scraped dataset

The model was trained for around 4 days and 16 hours on a single rtx 3090 (61 epochs / 430k steps)

Downloads last month: -; Downloads are not tracked for this model. How to track

Inference Providers NEW

This model is not currently available via any of the supported third-party Inference Providers, and HF Inference API was unable to determine this model's library.

Zeger56644
/

voice-lab-v1

So-Vits-Svc Base Model V1

Datasets used to train Zeger56644/voice-lab-v1