-
-
-
-
-
-
Inference Providers
Active filters:
4-bit
TheBloke/Wizard-Vicuna-30B-Uncensored-GPTQ
Text Generation
•
4B
•
Updated
•
555
•
589
Jalea96/DeepSeek-OCR-bnb-4bit-NF4
Image-Text-to-Text
•
3B
•
Updated
•
5.3k
•
13
mlx-community/Kimi-K2-Thinking
Text Generation
•
1T
•
Updated
•
3.4k
•
9
mlx-community/Llama-3.3-70B-Instruct-4bit
Text Generation
•
11B
•
Updated
•
1.35k
•
32
MaziyarPanahi/AngelSlayer-12B-Unslop-Mell-RPMax-DARKNESS-v2-GGUF
Text Generation
•
12B
•
Updated
•
528
•
5
Qwen/Qwen3-8B-AWQ
Text Generation
•
2B
•
Updated
•
157k
•
29
QuantTrio/GLM-4.5V-AWQ
Image-Text-to-Text
•
17B
•
Updated
•
1.61k
•
18
mlx-community/gpt-oss-20b-MXFP4-Q4
Text Generation
•
21B
•
Updated
•
950
•
10
mlx-community/gpt-oss-20b-MXFP4-Q8
Text Generation
•
21B
•
Updated
•
802k
•
18
mlx-community/Kimi-K2-Instruct-0905-mlx-DQ3_K_M
Text Generation
•
1T
•
Updated
•
1.94k
•
9
mlx-community/GLM-4.6-4bit
Text Generation
•
353B
•
Updated
•
2.72k
•
13
QuantTrio/MiniMax-M2-AWQ
Text Generation
•
229B
•
Updated
•
10.4k
•
6
mlx-community/Kimi-K2-Thinking-4bit
Text Generation
•
1T
•
Updated
•
2.17k
•
7
TheBloke/deepseek-coder-6.7B-instruct-AWQ
Text Generation
•
1B
•
Updated
•
217k
•
19
TheBloke/CodeLlama-70B-Python-AWQ
Text Generation
•
10B
•
Updated
•
30
•
6
MaziyarPanahi/BioMistral-7B-GGUF
Text Generation
•
7B
•
Updated
•
1.02k
•
53
Qwen/Qwen1.5-MoE-A2.7B-Chat-GPTQ-Int4
Text Generation
•
2B
•
Updated
•
711
•
47
MaziyarPanahi/zephyr-orpo-141b-A35b-v0.1-AWQ
Text Generation
•
19B
•
Updated
•
2
•
3
solidrust/KatyTestHistorical-SultrySilicon-7B-V2-AWQ
Text Generation
•
1B
•
Updated
•
2
•
1
MaziyarPanahi/Mistral-7B-Instruct-v0.3-GGUF
Text Generation
•
7B
•
Updated
•
101k
•
128
hugging-quants/Meta-Llama-3.1-8B-Instruct-AWQ-INT4
Text Generation
•
2B
•
Updated
•
232k
•
78
hugging-quants/Meta-Llama-3.1-70B-Instruct-AWQ-INT4
Text Generation
•
11B
•
Updated
•
315k
•
106
hugging-quants/Meta-Llama-3.1-405B-Instruct-GPTQ-INT4
Text Generation
•
59B
•
Updated
•
13
•
16
unsloth/Meta-Llama-3.1-8B-Instruct-bnb-4bit
Text Generation
•
5B
•
Updated
•
380k
•
86
Qwen/Qwen2-VL-2B-Instruct-GPTQ-Int4
Image-Text-to-Text
•
1B
•
Updated
•
601
•
28
Qwen/Qwen2.5-72B-Instruct-GPTQ-Int4
Text Generation
•
12B
•
Updated
•
27.8k
•
41
Qwen/Qwen2.5-3B-Instruct-AWQ
Text Generation
•
1.0B
•
Updated
•
4.84k
•
14
mlx-community/Llama-3.2-3B-Instruct-4bit
Text Generation
•
0.5B
•
Updated
•
13k
•
32
unsloth/Llama-3.2-1B-Instruct-bnb-4bit
Text Generation
•
0.8B
•
Updated
•
14.3k
•
20
John6666/llama-joycaption-alpha-two-hf-llava-nf4
Image-to-Text
•
3B
•
Updated
•
19
•
18