-
-
-
-
-
-
Inference Providers
Active filters:
vLLM
Text Generation
•
358B
•
Updated
•
18.5k
•
25
QuantTrio/GLM-4.7-Flash-AWQ
Text Generation
•
31B
•
Updated
•
132k
•
5
QuantTrio/Qwen3-235B-A22B-Instruct-2507-GPTQ-Int4-Int8Mix
Text Generation
•
248B
•
Updated
•
451
•
3
QuantTrio/Qwen3-235B-A22B-Thinking-2507-GPTQ-Int4-Int8Mix
Text Generation
•
253B
•
Updated
•
20
•
3
QuantTrio/Qwen3-VL-235B-A22B-Instruct-AWQ
Text Generation
•
236B
•
Updated
•
6.84k
•
13
QuantTrio/Qwen3-VL-235B-A22B-Thinking-AWQ
Text Generation
•
236B
•
Updated
•
1.21k
•
7
QuantTrio/Qwen3-VL-30B-A3B-Instruct-AWQ
Text Generation
•
31B
•
Updated
•
475k
•
37
QuantTrio/DeepSeek-V3.2-AWQ
Text Generation
•
685B
•
Updated
•
54.5k
•
11
Image-Text-to-Text
•
138B
•
Updated
•
476
•
1
model-scope/glm-4-9b-chat-GPTQ-Int4
Text Generation
•
9B
•
Updated
•
103
•
6
model-scope/glm-4-9b-chat-GPTQ-Int8
Text Generation
•
9B
•
Updated
•
4
•
2
tclf90/qwen2.5-72b-instruct-gptq-int4
Text Generation
•
73B
•
Updated
•
24
•
2
tclf90/qwen2.5-72b-instruct-gptq-int3
Text Generation
•
69B
•
Updated
•
28
prithivMLmods/Nu2-Lupi-Qwen-14B
Text Generation
•
15B
•
Updated
•
1
•
2
mradermacher/Nu2-Lupi-Qwen-14B-GGUF
15B
•
Updated
•
135
•
1
mradermacher/Nu2-Lupi-Qwen-14B-i1-GGUF
15B
•
Updated
•
5.33k
•
1
JunHowie/Qwen3-0.6B-GPTQ-Int4
Text Generation
•
0.6B
•
Updated
•
340
•
1
JunHowie/Qwen3-0.6B-GPTQ-Int8
Text Generation
•
0.6B
•
Updated
•
29
JunHowie/Qwen3-1.7B-GPTQ-Int4
Text Generation
•
2B
•
Updated
•
1.47k
•
1
JunHowie/Qwen3-1.7B-GPTQ-Int8
Text Generation
•
2B
•
Updated
•
14
JunHowie/Qwen3-32B-GPTQ-Int4
Text Generation
•
33B
•
Updated
•
9.7k
•
4
JunHowie/Qwen3-32B-GPTQ-Int8
Text Generation
•
33B
•
Updated
•
302
•
3
JunHowie/Qwen3-30B-A3B-GPTQ-Int4
Text Generation
•
5B
•
Updated
•
9
•
1
JunHowie/Qwen3-14B-GPTQ-Int8
Text Generation
•
15B
•
Updated
•
116
•
1
JunHowie/Qwen3-14B-GPTQ-Int4
Text Generation
•
15B
•
Updated
•
965
•
4
JunHowie/Qwen3-8B-GPTQ-Int8
Text Generation
•
8B
•
Updated
•
87
JunHowie/Qwen3-8B-GPTQ-Int4
Text Generation
•
8B
•
Updated
•
1.63k
•
4
JunHowie/Qwen3-4B-GPTQ-Int4
Text Generation
•
4B
•
Updated
•
804
•
1
JunHowie/Qwen3-4B-GPTQ-Int8
Text Generation
•
4B
•
Updated
•
23
JunHowie/Qwen3-30B-A3B-GPTQ-Int8
Text Generation
•
8B
•
Updated
•
169