AlekseyCalvin's picture
Update README.md
ad25321 verified
metadata
license: apache-2.0
tags:
  - Flux
  - FluxPipeline
  - text-to-image
  - flux schnell
  - image-generation
  - flux-diffusers
  - diffusers
  - photo
  - realism
pipeline_tag: text-to-image
library_name: diffusers
emoji: 🔜
language:
  - en
base_model: black-forest-labs/FLUX.1-schnell
instance_prompt: HST autochrome photo
widget:
  - text: >-
      HST style autochrome photo of a young woman playing poker against a
      blue-feathered dinosaur sitting across from her, moderately wrinkled
      blemished lined skin texture with pores
    output:
      url: Hstv2r.png
  - text: >-
      (w/ our Mayakovsky LoRA) HST photo of Mayakovsky sleeping, seeing a dream
      wherein rice shoots bud on lush green fields, text \MAYAKOVSKY SAW A
      DREAM\
    output:
      url: Hstv2Mayak.webp
  - text: >-
      HST style photo of a young woman playing a Telecaster electric guitar and
      singing the blues
    output:
      url: Hstv2guitar.webp
  - text: >-
      hst style photo of an aging dark-haired woman playing guitar in an old
      Soviet apartment
    output:
      url: Hstv2r3.png
  - text: >-
      hst style photo of a young dark-haired woman embracing a red-feathered
      dinosaur
    output:
      url: HistoricI.png
  - text: >-
      hst style autochrome vintage color photo of gigantic Rosa Luxemburg
      walking over iced-over planet Earth
    output:
      url: Hst2legs.png

Historic Color Soon® V.2

The second FLUX-based & open-licensed full-model checkpoint in our HSToric Color series.

Trained on HD scans of early color photos (circa 1900s-1910s) by Sergey Prokudin-Gorsky, who traveled and photographed widely in those years whilst perfecting implementations of a pioneering 3-color-composite photography technique.

This model is aimed at being useful for:

  • Quality generation at a low step-count (2 to 8, for most scenarios), with 4-step inference at around 768x768 routinely producing photorealistic outputs at a quality plausibly preferrable to that of Flux v.1 Dev.
  • Producing realistic images reminiscent of color film analog photography, exhibiting parallels to a broad spectrum of iconic instrumentalities and visual paradigms, from Autochrome-to-Kodachrome-to-Fujifilm-and-beyond.
  • Producing visuals with a vaguely "historical" or "lived-in" aesthetic character, striking chromaticity and luminosity dynamics, as well as textural/anatomical/skin details more reliably lifelike than other models at a comparable step-count/resource expenditure.
  • Extending realism options under an unrevokable commercial license.
Prompt
HST style autochrome photo of a young woman playing poker against a blue-feathered dinosaur sitting across from her, moderately wrinkled blemished lined skin texture with pores
Prompt
(w/ our Mayakovsky LoRA) HST photo of Mayakovsky sleeping, seeing a dream wherein rice shoots bud on lush green fields, text \MAYAKOVSKY SAW A DREAM\
Prompt
HST style photo of a young woman playing a Telecaster electric guitar and singing the blues
Prompt
hst style photo of an aging dark-haired woman playing guitar in an old Soviet apartment
Prompt
hst style photo of a young dark-haired woman embracing a red-feathered dinosaur
Prompt
hst style autochrome vintage color photo of gigantic Rosa Luxemburg walking over iced-over planet Earth

Testing Space:

You may try out the V2 checkpoint at one of our LoRA gallery spaces, along with many of our trained LoRAs!

Bit of Model History + TOOL SHARES:

Historic Color Soon® V.1 was fine-tuned by us from HumbleMikey's Pixelwave Schnell V.1 model which, in its turn, is a generalized base checkpoint trained from FLUX.1-schnell by Black Forest Labs, consolidating (in comparison w/vanilla-base-Schnell) further inference speed improvements (more reliable results at 2-3 steps), whilst raising the overall quality and consistency standards across most aesthetic categories and at every step.
This version, Historic Color Soon® V.2 was created through merging into V.1 a handful of LoRAs trained by us on the (fairly narrow) available range of realistic Flux checkpoint models that are exclusively Schnell-derived, so as to stay within the fairly open Apache 2.0 licensing domain (which was among our reasons to do all this in the first place).

Historic Color Soon® V.1 is available here in both Safetensors (fp8) & Diffusers formats.

To fine-tune Flux, try the dedicated Flux Training Notebook by Ostris.

Ostris' training adapter for Schnell is found here: ostris/FLUX.1-schnell-training-adapter.

To merge Flux* models and LoRAs, use the 'flux_merge_lora.py' script from the sd3-branch & /networks (subfolder) of Kohya-ss's sd-scripts git.

Bit of Actual History:

Prokudin-Gorsky's color photography technique would involve three photo-exposures, either simultaneous or sequential, using specialized color-spectrum filters (basically R.B.G.: red, blue, and green), rendering a subject/shot onto glass plates covered with light-emulsive mixture.
The photographer's focus on refining the developer and filter quality, in tandem with his incessant and wide-ranging experimentation, and his artful optimizations of glass plates (generally unwieldly, esp. for color, and by the 1910's already becoming outmoded for B&W on-location shoots, though elsewise extra reliable) ultimately led him to produce a color photography oeuvre of much greater fidelity and vividness than achieved by most of his contemporaries.

At the same time, the peculiarities of the photographer's method, coupled with his exceptionally hands-on execution thereof, would manifest in a range of idyosyncratic color, light, and motion artifacts common across the resulting prints.

Seldom marring the image as a whole, and less grave than the weaknesses of some cp-emerging autochrome techniques, the warm color hazes & flares framing many of Prokudin-Gorsky's prints constitute a kind of ephemeral signature.
Alongside some of the more subtle chromatic, textural, and (in some measure) figural characteristics of his work, these auras have reliably imprinted themselves into this and other LoRAs and Models within our gallery of fine-tunes for Flux and StableDiffusion3.5, fine-tuned exclusively on non-synthetic (human-made and pre-curated) open-access data from iconic, influential, and/or otherwise compelling historical sources.

We urge you to explore the works of Prokudin-Gorsky for yourself, at the wonderfully organized online archive at this link, featuring many hundreds of high quality downloadable scans of composite color photo prints from the photographer's original glass plate negatives, available at this site alongside relatively recent restorations of a substantial portion of the images. The original glass-plate negatives are currently held at and administrated by the Library of Congress in Washington, DC, USA.

Diffusers:

To use Historic Color SOON® V.2 with the 🧨 diffusers python library, first install or upgrade diffusers:

pip install -U diffusers

Then you can use FluxPipeline to run the model:

import torch
from diffusers import FluxPipeline
pipe = FluxPipeline.from_pretrained("AlekseyCalvin/HistoricColorSoonr_v2_FluxSchnell_Diffusers", torch_dtype=torch.bfloat16)
pipe.to("cuda")
pipe.enable_model_cpu_offload() #save some VRAM by offloading the model to CPU. Remove this if you have enough GPU power
prompt = "HST style autochrome film photograph portrait of 1910 woman playing poker against a purple feathered dinosaur, the green-eyed woman has moderately blemished skin with visible lines and pores, she smiles, film grain, Kodachrome"
image = pipe(
    prompt,
    guidance_scale=1.2,
    num_inference_steps=4,
    max_sequence_length=256,
    generator=torch.Generator("cpu").manual_seed(0)
).images[0]
image.save("hstcolor1.png")

To learn more check out the diffusers documentation.

Lastly, if you're into literature broadly and old modernist poetry specifically, check out our verse translations at SILVER AGE POETS!