Edit model card

FLUX.1-schnell-GGUF

Original Model

black-forest-labs/FLUX.1-schnell

Run with sd-api-server

Quantized GGUF Models

Name Quant method Bits Size Use case
ae-f16.gguf f16 16 168 MB
ae.safetensors f32 32 335 MB
clip_l-Q8_0.gguf Q8_0 8 131 MB
clip_l.safetensors f16 16 246 MB
flux1-schnell-Q4_0.gguf Q4_0 4 6.69 GB
flux1-schnell-Q4_1.gguf Q4_1 4 7.43 GB
flux1-schnell-Q5_0.gguf Q5_0 5 8.18 GB
flux1-schnell-Q5_1.gguf Q5_1 5 8.92 GB
flux1-schnell-Q8_0.gguf Q8_0 8 12.6 GB
flux1-schnell.safetensors f16 16 23.8 GB
t5xxl-Q2_K.gguf Q2_K 2 1.61 GB
t5xxl-Q3_K.gguf Q3_K 3 2.10 GB
t5xxl-Q4_0.gguf Q4_0 4 2.75 GB
t5xxl-Q4_K.gguf Q4_K 4 2.75 GB
t5xxl-Q5_0.gguf Q5_0 5 3.36 GB
t5xxl-Q5_1.gguf Q5_1 5 3.67 GB
t5xxl-Q8_0.gguf Q8_0 8 5.20 GB
t5xxl_fp16.safetensors f16 16 9.79 GB

Quantized with stable-diffusion.cpp master-64d231f.

Downloads last month
928
GGUF
Model size
4.89B params
Architecture
undefined

2-bit

3-bit

4-bit

5-bit

8-bit

16-bit

Inference Examples
Unable to determine this model's library. Check the docs .

Model tree for second-state/FLUX.1-schnell-GGUF

Quantized
(13)
this model

Collection including second-state/FLUX.1-schnell-GGUF