Nekoray-XL-V0.7

Model Card

NekoRay v0.7 is a SDXL checkpoint finetuned on SDXL 1.0, using 1.5m quality-tagged images from a selection of image sites. It aims to be the next Waifu-diffusion, creating generation freedom for the SD community.

The project is a WIP, and further checkpoints with enhanced augmentations & more images is currently being developed.

Two models trained on the same dataset/hardware/hyperparameters are currently available:

fp16mixed: 2epoch, half-precision
32full: 1.7epoch (still training), full-precision

Usage:

It's recommended to use the exact same resolution as specified below, since the original SDXL doesn't perform well out of these resolutions. For prompt danbooru-styled caption is preferred.

we use the same aspect ratio as the original SDXL:

Height	Width	Aspect Ratio
512	2048	0.25
512	1984	0.26
512	1920	0.27
512	1856	0.28
576	1792	0.32
576	1728	0.33
576	1664	0.35
640	1600	0.4
640	1536	0.42
704	1472	0.48
704	1408	0.5
704	1344	0.52
768	1344	0.57
768	1280	0.6
832	1216	0.68
832	1152	0.72
896	1152	0.78
896	1088	0.82
960	1088	0.88
960	1024	0.94
1024	1024	1.0
1024	960	...

for prompts the following keywords are appended for better separation of genres:

'sensitive-rated', 'questionable-rated', 'explicit-rated'

The models are intended to be used as a pretrained checkpoint, and further finetuning is strongly recommended for downstream use. For more info 'Finetuning' part below:

Finetuning

finetuning on SDXL inherently provides better clarity and reduced 'noisinesss' on higher resolutions compared to sd1.4. We've got some good results for further finetuning on various anime-related subject matters, including but not limited to:

style finetunes (tested on nijijourney images & pvc/figure datasets)
character finetunes (tested on 8 Blue Archive characters)
concept finetunes (tested on nsfw gestures)

Actual samples of downstream finetunes will be posted once we get the finetuners' conscents.

License

The model (which is still very WIP) is intended to be used as a foundation for various downstream finetunes. The license is under discussion but it would be generally follow Openrail-M agreements.