Diffusers
Safetensors
fhai50032's picture
Create README.md
02e4787 verified
---
license: apache-2.0
datasets:
- fhai50032/ControlNet-Poster
base_model:
- black-forest-labs/FLUX.1-dev
---
# Flux-ControlNet: Text-to-Image Diffusion Model with Caption Alignment
This repository hosts **Flux-ControlNet**, a customized ControlNet-based diffusion model designed for generating text-embedded images.
---
## Key Features
- **Flux-ControlNet**: Enhanced ControlNet architecture for better control over text-to-image generation.
- **Optimized Diffusion**: Uses Hugging Face Diffusers and Accelerate for streamlined performance.
---
## How It Works
1. **Input**: Provide text prompts and conditioning image.
2. **Processing**:
- Flux-ControlNet processes the text and applies diffusion to synthesize aligned images.
3. **Output**: High-quality, text-embedded images.
---
# Training Parameters for Flux-ControlNet
```
General Parameters:
Model Architecture: Flux-based ControlNet Model
Image Resolution: 512x512
Batch Size: 4
Epochs: 50
Optimizer: AdamW
Learning Rate: 1e-5 (with cosine schedular)
Weight Decay: 0.01
Gradient Clipping: 1.0
```
# Inference Code
Soon to be added