File size: 1,126 Bytes
02e4787 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 |
---
license: apache-2.0
datasets:
- fhai50032/ControlNet-Poster
base_model:
- black-forest-labs/FLUX.1-dev
---
# Flux-ControlNet: Text-to-Image Diffusion Model with Caption Alignment
This repository hosts **Flux-ControlNet**, a customized ControlNet-based diffusion model designed for generating text-embedded images.
---
## Key Features
- **Flux-ControlNet**: Enhanced ControlNet architecture for better control over text-to-image generation.
- **Optimized Diffusion**: Uses Hugging Face Diffusers and Accelerate for streamlined performance.
---
## How It Works
1. **Input**: Provide text prompts and conditioning image.
2. **Processing**:
- Flux-ControlNet processes the text and applies diffusion to synthesize aligned images.
3. **Output**: High-quality, text-embedded images.
---
# Training Parameters for Flux-ControlNet
```
General Parameters:
Model Architecture: Flux-based ControlNet Model
Image Resolution: 512x512
Batch Size: 4
Epochs: 50
Optimizer: AdamW
Learning Rate: 1e-5 (with cosine schedular)
Weight Decay: 0.01
Gradient Clipping: 1.0
```
# Inference Code
Soon to be added
|