|
--- |
|
license: apache-2.0 |
|
datasets: |
|
- fhai50032/ControlNet-Poster |
|
base_model: |
|
- black-forest-labs/FLUX.1-dev |
|
--- |
|
|
|
|
|
# Flux-ControlNet: Text-to-Image Diffusion Model with Caption Alignment |
|
|
|
This repository hosts **Flux-ControlNet**, a customized ControlNet-based diffusion model designed for generating text-embedded images. |
|
|
|
--- |
|
|
|
## Key Features |
|
- **Flux-ControlNet**: Enhanced ControlNet architecture for better control over text-to-image generation. |
|
- **Optimized Diffusion**: Uses Hugging Face Diffusers and Accelerate for streamlined performance. |
|
|
|
--- |
|
|
|
## How It Works |
|
1. **Input**: Provide text prompts and conditioning image. |
|
2. **Processing**: |
|
- Flux-ControlNet processes the text and applies diffusion to synthesize aligned images. |
|
3. **Output**: High-quality, text-embedded images. |
|
|
|
--- |
|
|
|
# Training Parameters for Flux-ControlNet |
|
``` |
|
General Parameters: |
|
Model Architecture: Flux-based ControlNet Model |
|
Image Resolution: 512x512 |
|
Batch Size: 4 |
|
Epochs: 50 |
|
Optimizer: AdamW |
|
Learning Rate: 1e-5 (with cosine schedular) |
|
Weight Decay: 0.01 |
|
Gradient Clipping: 1.0 |
|
``` |
|
|
|
# Inference Code |
|
Soon to be added |
|
|