ArkanDash
/

rvc-genshin-impact

Model card Files Files and versions Community

rvc-genshin-impact / README.md

ArkanDash's picture

feat: update raiden-jp

334753a over 1 year ago

|

2.35 kB

	---
	license: mit
	language:
	- ja
	metrics:
	- accuracy
	pipeline_tag: audio-to-audio
	tags:
	- rvc
	---
	# <center> RVC Genshin Impact Japanese Voice Model<br />
	![model-cover.png](https://huggingface.co/ArkanDash/rvc-genshin-impact/resolve/main/model-cover.png)

	# About Retrieval based Voice Conversion (RVC)
	Learn more about Retrieval based Voice Conversion in this link below:<br />
	[RVC WebUI](https://github.com/RVC-Project/Retrieval-based-Voice-Conversion-WebUI)

	# How to use?
	Download the prezipped model and put to your RVC Project.

	Model test: [Google Colab](https://colab.research.google.com/drive/110kiMZTdP6Ri1lY9-NbQf17GVPPhHyeT?usp=sharing) / [RVC Models New](https://huggingface.co/spaces/ArkanDash/rvc-models-new) (Which is basically the same but hosted on spaces)


	## <center> INFO <br />
	Model Created by ArkanDash <br />
	The voice that was used in this model belongs to Hoyoverse.

	The voice I make to make this model was ripped from the game (3.7). <br />
	Total Models: 33 Models (19 V1 Models & 14 V2 Models)

	Plans: <br />
	- Kuki Shinobu V2 RVC
	- Nahida V2 RVC

	Replace:
	- Raiden Shogun model is now replaced with newer dataset due to bad voice from older model, The old model is now deleted.

	### V1 Model <br />
	This was trained on Original RVC.<br />
	Pitch Extract using Harvest.<br />
	This model was trained with 100 epochs, 10 batch sizes, and a 40K sample rate (some models had a 48k sample rate).<br />
	Every V1 model was trained more or less around 30 minutes of character voice.

	I may exclude some models to higher epochs due to the low duration of the character's voice.<br />
	- Klee 150 Epochs
	- Fischl 150 Epochs

	### (New) V2 Model <br />
	This was trained on Mangio-Fork RVC.<br />
	Pitch Extract using Crepe.<br />
	This model was trained with 100 epochs, 8 batch sizes, and a 48K sample rate. (some models had a 40k sample rate).<br />
	Every V2 model was trained more or less around 60 minutes of character voice.

	Other request:<br />
	- Greater Lord Rukkhadevata: 750 Epochs, 16 Batch size, 48k Sample rate. (10 minutes dataset)
	- Charlotte: 400 Epochs, 16 Batch size, 48k Sample rate. (18 minutes dataset)

	Note:
	- For faruzan, somehow the index file is smaller, But it output a log when training here: <br />
	`Converged (lack of improvement in inertia) at step 1152/48215` <br />
	I might retrain faruzan soon.