Diffusers
Safetensors
File size: 1,126 Bytes
02e4787
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
---
license: apache-2.0
datasets:
- fhai50032/ControlNet-Poster
base_model:
- black-forest-labs/FLUX.1-dev
---


# Flux-ControlNet: Text-to-Image Diffusion Model with Caption Alignment

This repository hosts **Flux-ControlNet**, a customized ControlNet-based diffusion model designed for generating text-embedded images. 

---

## Key Features
- **Flux-ControlNet**: Enhanced ControlNet architecture for better control over text-to-image generation.
- **Optimized Diffusion**: Uses Hugging Face Diffusers and Accelerate for streamlined performance.

---

## How It Works
1. **Input**: Provide text prompts and conditioning image.
2. **Processing**: 
   - Flux-ControlNet processes the text and applies diffusion to synthesize aligned images.
3. **Output**: High-quality, text-embedded images.

---

# Training Parameters for Flux-ControlNet
```
General Parameters:
  Model Architecture: Flux-based ControlNet Model
  Image Resolution: 512x512
  Batch Size: 4
  Epochs: 50
  Optimizer: AdamW
  Learning Rate: 1e-5 (with cosine schedular)
  Weight Decay: 0.01
  Gradient Clipping: 1.0
```

# Inference Code
    Soon to be added