metadata

license: creativeml-openrail-m
pipeline_tag: text-to-image
library_name: diffusers
tags:
  - art
  - text-to-image
  - stable-diffusion
  - lora
  - diffusers
widget:
  - text: >-
      A refined, noblewoman standing gracefully in a lush garden. She has an
      elaborate updo adorned with delicate pearls and flowers, and she wears a
      flowing, pastel-colored gown with intricate lace details. Her serene
      expression is highlighted by soft, natural lighting. The background
      features blooming roses and a classical stone fountain, creating a sense
      of timeless elegance and tranquility.
    output:
      url: images/image_3.png
  - text: >-
      A bearded inventor with wild, curly hair, standing confidently in his
      steampunk workshop. He wears brass goggles on his forehead, a leather
      apron over a white shirt with rolled-up sleeves, and fingerless gloves.
      His hands are slightly stained with grease, and he holds a small,
      intricate mechanical device. The background is cluttered with gears,
      blueprints, and tools, illuminated by the warm glow of oil lamps, giving
      the scene a creative, industrious atmosphere.
    output:
      url: images/image_4.png
  - text: >-
      A cyberpunk hacker with neon blue hair and cybernetic implants on their
      face, seated in front of multiple holographic screens in a dark, high-tech
      room. They wear a sleek, black leather jacket with glowing circuit
      patterns, and their fingers are covered in cybernetic gloves that
      interface directly with the floating data. The background is a chaotic mix
      of digital code, flashing lights, and wires, emphasizing the high-tech,
      futuristic setting.
    output:
      url: images/image_5.png
base_model: stabilityai/stable-diffusion-xl-base-1.0

cutton_doll_lora-xl

Need more performance?

Use it with a LCM Lora!

Use 8 steps and guidance scale of 1.5 1.2 Lora strength for the Pixel Art XL works better

from diffusers import DiffusionPipeline, LCMScheduler
import torch

model_id = "stabilityai/stable-diffusion-xl-base-1.0"
lcm_lora_id = "latent-consistency/lcm-lora-sdxl"
pipe = DiffusionPipeline.from_pretrained(model_id, variant="fp16")
pipe.scheduler = LCMScheduler.from_config(pipe.scheduler.config)

pipe.load_lora_weights(lcm_lora_id, adapter_name="lora")
pipe.load_lora_weights("./cutton_doll_lora-xl.safetensors", adapter_name="doll_sdxl")

pipe.set_adapters(["lora", "doll_sdxl"], adapter_weights=[1.0, 1.2])
pipe.to(device="cuda", dtype=torch.float16)

prompt = "a chibi doll, cute"
negative_prompt = "3d render, realistic"

num_images = 9

for i in range(num_images):
    img = pipe(
        prompt=prompt,
        negative_prompt=negative_prompt,
        num_inference_steps=8,
        guidance_scale=1.5,
    ).images[0]
    
    img.save(f"lcm_lora_{i}.png")

Tips:

Don't use refiner

Works great with only 1 text encoder

No style prompt required

No trigger keyword require

Works great with isometric and non-isometric

Works with 0.9 and 1.0

Download model

Weights for this model are available in Safetensors format. Download them in the Files & versions tab.

Changelog

v1: Initial release