Edit Models filters

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Mixture of Experts

Carbon Emissions

Models

133

Full-text search

Active filters: Quantization

nunchaku-ai/nunchaku-z-image-turbo

Text-to-Image • Updated 17 days ago • 37.9k • 150

nunchaku-ai/nunchaku-qwen-image-edit-2509

Text-to-Image • Updated Nov 16, 2025 • 55.7k • 273

nunchaku-ai/nunchaku

Text-to-Image • Updated 17 days ago • 34

nunchaku-ai/nunchaku-qwen-image

Text-to-Image • Updated Nov 16, 2025 • 57.7k • 247

nunchaku-ai/nunchaku-qwen-image-edit

Text-to-Image • Updated Nov 16, 2025 • 10.5k • 108

mit-han-lab/nunchaku-flux.1-kontext-dev

Image-to-Image • Updated Jul 21, 2025 • 10.4k • 167

nunchaku-ai/nunchaku-flux.1-krea-dev

Text-to-Image • Updated Nov 16, 2025 • 6.36k • 116

nunchaku-ai/nunchaku-sdxl-turbo

Text-to-Image • Updated Nov 16, 2025 • 869 • 11

spooknik/CyberRealistic-Flux-SVDQ

Text-to-Image • Updated Oct 23, 2025 • 211 • 10

thephimart/tinyllama-4x1.1b-moe.Q5_K_M.gguf

3B • Updated Jan 24, 2024 • 9 • 2

Irathernotsay/qwen2-1.5B-medical_qa-Finetune

Text Generation • 2B • Updated Jul 17, 2024 • 12

Riyuechang/Breeze-7B-PTT-Chat-v2_AWQ

Text Generation • 7B • Updated Sep 18, 2024 • 4

VPTQ-community/Meta-Llama-3.1-70B-Instruct-v16-k65536-32768-woft

7B • Updated Feb 25, 2025 • 4

VPTQ-community/Meta-Llama-3.1-8B-Instruct-v8-k65536-65536-woft

2B • Updated Mar 20, 2025 • 8

VPTQ-community/Meta-Llama-3.1-8B-Instruct-v8-k65536-4096-woft

2B • Updated Mar 20, 2025 • 6

VPTQ-community/Meta-Llama-3.1-8B-Instruct-v8-k65536-256-woft

2B • Updated Mar 20, 2025 • 25 • 1

VPTQ-community/Qwen2.5-72B-Instruct-v16-k65536-65536-woft

8B • Updated Feb 25, 2025 • 4 • 4

VPTQ-community/Meta-Llama-3.1-70B-Instruct-v16-k65536-65536-woft

8B • Updated Feb 25, 2025 • 15

VPTQ-community/Meta-Llama-3.1-70B-Instruct-v8-k65536-256-woft

9B • Updated Feb 25, 2025 • 7 • 1

VPTQ-community/Qwen2.5-7B-Instruct-v8-k65536-256-woft

2B • Updated Mar 20, 2025 • 2

VPTQ-community/Qwen2.5-72B-Instruct-v16-k65536-32768-woft

8B • Updated Feb 25, 2025 • 3 • 3

VPTQ-community/Meta-Llama-3.1-70B-Instruct-v8-k32768-0-woft

6B • Updated Feb 25, 2025 • 3 • 1

VPTQ-community/Meta-Llama-3.1-70B-Instruct-v8-k65536-65536-woft

11B • Updated Feb 25, 2025 • 1 • 2

VPTQ-community/Meta-Llama-3.1-70B-Instruct-v8-k16384-0-woft

6B • Updated Feb 25, 2025 • 1 • 2

VPTQ-community/Meta-Llama-3.1-70B-Instruct-v8-k65536-0-woft

7B • Updated Feb 25, 2025 • 6 • 2

VPTQ-community/Qwen2.5-72B-Instruct-v8-k65536-4-woft-duplicated

8B • Updated Feb 25, 2025 • 2 • 1

VPTQ-community/Meta-Llama-3.1-405B-Instruct-v16-k65536-1024-woft

26B • Updated Feb 25, 2025 • 2 • 1

VPTQ-community/Meta-Llama-3.1-405B-Instruct-v8-k4096-0-woft

23B • Updated Feb 25, 2025 • 6 • 1

VPTQ-community/Meta-Llama-3.1-405B-Instruct-v16-k65536-64-woft

22B • Updated Feb 25, 2025 • 24 • 3

VPTQ-community/Meta-Llama-3.1-405B-Instruct-v16-k32768-32768-woft

29B • Updated Feb 26, 2025 • 1 • 1