CodeChris
/

EleetModel

Model card Files Files and versions Community

File size: 5,463 Bytes

---
license: creativeml-openrail-m
---
# Eleet Model: an anime style stable diffusion model

> This model is also available on: [Eleet Model - Civitai](https://civitai.com/models/77807).

Eleet model is a block-weighted merged stable diffusion model aiming at generating good quality 2D anime style images that satisfy my personal taste and, hopefully, your taste.

ELEET is an Internet slang for the word 'elite', which dates back to 80s/90s. Its modified spelling in a digital form is 31337, which has been the most common Eta Noise Seed Delta (ENSD) value in stable diffusion probably since the NAI era. The model is named in honor of this tradition.

- [Suggested settings](#suggested-settings)
- [Samples](#samples)
- [Merge ideas](#merge-ideas)
- [License](#license)


## Suggested settings

For users who know little about stable diffusion settings, I recommend:

* **Prompt**: Start with `masterpiece, best quality, aesthetic`. Personally I also like to add photography phrases such as `cinematic lighting, professional shadow`, etc.
* **Negative prompt**:
  
  > (worst quality, low quality:1.4), lowres, bad anatomy, (blurry), (text, logo, watermark, signature, username)

* **Sampler**: DPM++ 2M Karras
* **CFG Scale**: 6\~9
* **Steps**: 16\~30
* Highres Fix (optional): Latent sampler; 0.6~0.7 Denoising strength; 16 Highres steps.
* **Clip skip**: 2
* No external VAE needed

## Samples

Here are 4 samples of the latest Eleet model version.

*Sample 1* (Highres Fix from 512x800):

![Seed: 1396780322](samples/v2-01.jpg)

```
masterpiece, best quality, aesthetic, 1girl, solo, black eyes, green hair, low-tied long hair, school uniform, sitting, wariza, thighhighs, black thighhighs, from above, looking at viewer, (cityscape), cinematic lighting, professional shadow
```

Other common settings through all samples:

```
Negative prompt: (worst quality, low quality:1.4), lowres, bad anatomy, (child, loli), (blurry), (text, logo, watermark, signature, username)

Steps: 20, Sampler: DPM++ 2M Karras, CFG scale: 7, Denoising strength: 0.6, Clip skip: 2, Hires upscale: 1.25, Hires steps: 16, Hires upscaler: Latent
```

*Sample 2* (Highres Fix from 512x800):

![Seed: 3293204921](samples/v2-02.jpg)

```
masterpiece, best quality, aesthetic, 1girl, solo, red eyes, one eye closed, brown hair, long hair, grin, arms up, crop top, denim shorts, midriff, navel, athletic body, earings, (cowboy shot), looking at viewer, (waterfall), sunny, cinematic lighting, professional shadow
```

*Sample 3* (Highres Fix from 832x512):

![Seed: 3444941526](samples/v2-03.jpg)

```
masterpiece, best quality, aesthetic, no humans, scenery, sky, cloud, outdoors, mountain, water, sunset, cloudy sky, lake, river, landscape, reflection, tree, nature, mountainous horizon, blue sky, evening, professional shadow, award winner photo
```

*Sample 4* (Highres Fix from 800x512):

![Seed: 371634422](samples/v2-04.jpg)

```
masterpiece, best quality, aesthetic, forest, grass, blue sky, cloud, outdoors, no humans, nature, scenery, railroad tracks, sunlight, sunbeam, lens flare, cinematic lighting, professional shadow
```

Additionally, here are samples of the previous model versions. Click to expand:

<details>
<summary>Eleet v1.0 samples</summary>

*Sample 1*: Scenery. txt2img+highres, (640x384) x1.5.

![](samples/v1-01.png)

```
masterpiece, best quality, aesthetic, highres RAW photo, landscape photography, wide shot, from below, scenery, sunrise, blue sky, clouds, lake, reflection, sun, trees, floating leaves, ripples, foreground interest, depth of field, cinematic lighting, asymmetric composition, professional shadows, sharp focus, lens flare
```

*Sample 2*: Anime girl. txt2img, 576x832.

![](samples/v1-02.png)

```
masterpiece, best quality, aesthetic, 1girl, :d, blue eyes, fox ears, gold hair, twin braids, crop top, off-shoulder jacket, blue pleated skirt, thighhighs, garter belt, skindentation, breasts, bare shoulders, midriff, (cowboy shot), looking at viewer, waterfall, mountains
```

*Sample 3*: Scenery. txt2img, 832x576.

![](samples/v1-03.png)

```
masterpiece, best quality, aesthetic, highres RAW photo, cool color tone, wide shot, scenery, snow, blue sky, cityscape, skyline, buildings, rooftop, roads, cinematic lighting, professional shadows, sharp focus
```

*Sample 4*: Anime girl. txt2img+highres, (384x640) x1.5.

![](samples/v1-04.png)

```
masterpiece, best quality, 1girl, solo, angry, brown eyes, blue hair, bangs, sidelocks, half updo, clenched hand, parted lips, [bodysuit|armored dress], long sleeves, gloves, armor, looking to the side, clenched teeth, (cowboy shot), night, full moon, forest, underexposure, professional lighting
```
</details>

## Merge ideas

The weights for merging Eleet model were optimized through an automatic procedure with scoring, but I didn't necessarily pick the best-scored one as the final version. Instead, I will evaluate a number of high-scored candidates and score their outputs manually by myself.

I conducted the evaluation mostly on anime girls topics (no doubt) but also considered the model performance on scenery images. I will consider:

* Prompt response
* Image aesthetic quality over 3 scenarios:
  1. txt2img under `--lowvram` mode on a low-end GPU
  2. txt2img under normal or `--medvram` mode on a better GPU
  3. txt2img + highres fix on the previous GPU
* Image flaws (color shift, illogical drawing, etc.) over the above scenarios


## License

CreativeML OpenRAIL-M.