diffusion / usage.md
adamelliotfields's picture
Remove runwayml/stable-diffusion-v1-5 😢
af07f4b verified
|
raw
history blame
4.46 kB

Usage

Enter a prompt and click Generate. Roll the 🎲 for a random prompt.

Prompting

Positive and negative prompts are embedded by Compel for weighting. See syntax features to learn more and read Civitai's guide on prompting for best practices.

Arrays

Arrays allow you to generate different images from a single prompt. For example, [[cat,corgi]] will expand into 2 separate prompts. Make sure Images is set accordingly (e.g., 2). Only works for the positive prompt. Inspired by Fooocus.

Embeddings

Select multiple negative textual inversion embeddings. Fast Negative and Bad Dream can be used standalone or together; Unrealistic Dream should be combined with one of the others:

Styles

Styles are prompt templates from twri's sdxl_prompt_styler Comfy node. Start with a subject like "cat", pick a style, and iterate from there.

Scale

Rescale up to 4x using Real-ESRGAN from ai-forever.

Models

Each model checkpoint has a different aesthetic:

Image-to-Image

The 🖼️ Image tab enables the image-to-image and IP-Adapter pipelines. Either use the image input or select a generation from the gallery. To disable, simply clear the image input (the x overlay button).

Denoising strength is essentially how much the generation will differ from the input image. A value of 0 will be identical to the original, while 1 will be a completely new image. You may want to also increase the number of inference steps. Only applies to the image-to-image input.

IP-Adapter

In an image-to-image pipeline, the input image is used as the initial latent. With IP-Adapter, the input image is processed by a separate image encoder and the encoded features are used as conditioning along with the text prompt.

For capturing faces, enable IP-Adapter Face to use the full-face model. You should use an input image that is mostly a face and it should be high quality. You can generate fake portraits with Realistic Vision to experiment. Note that you'll never get true identity preservation without an advanced pipeline like InstantID, which combines many techniques.

Advanced

DeepCache

DeepCache caches lower UNet layers and reuses them every Interval steps. Trade quality for speed:

  • 1: no caching (default)
  • 2: more quality
  • 3: balanced
  • 4: more speed

FreeU

FreeU re-weights the contributions sourced from the UNet’s skip connections and backbone feature maps. Can sometimes improve image quality.

Clip Skip

When enabled, the last CLIP layer is skipped. Can sometimes improve image quality.

Tiny VAE

Enable madebyollin/taesd for near-instant latent decoding with a minor loss in detail. Useful for development.