Diffusers
Safetensors
fhai50032's picture
Create README.md
02e4787 verified
metadata
license: apache-2.0
datasets:
  - fhai50032/ControlNet-Poster
base_model:
  - black-forest-labs/FLUX.1-dev

Flux-ControlNet: Text-to-Image Diffusion Model with Caption Alignment

This repository hosts Flux-ControlNet, a customized ControlNet-based diffusion model designed for generating text-embedded images.


Key Features

  • Flux-ControlNet: Enhanced ControlNet architecture for better control over text-to-image generation.
  • Optimized Diffusion: Uses Hugging Face Diffusers and Accelerate for streamlined performance.

How It Works

  1. Input: Provide text prompts and conditioning image.
  2. Processing:
    • Flux-ControlNet processes the text and applies diffusion to synthesize aligned images.
  3. Output: High-quality, text-embedded images.

Training Parameters for Flux-ControlNet

General Parameters:
  Model Architecture: Flux-based ControlNet Model
  Image Resolution: 512x512
  Batch Size: 4
  Epochs: 50
  Optimizer: AdamW
  Learning Rate: 1e-5 (with cosine schedular)
  Weight Decay: 0.01
  Gradient Clipping: 1.0

Inference Code

Soon to be added