-
-
-
-
-
-
Inference Providers
Active filters:
vLLM
QuantTrio/Step3-VL-10B-AWQ
Image-Text-to-Text
•
10B
•
Updated
•
3.79k
•
4
QuantTrio/GLM-4.7-Flash-AWQ
Text Generation
•
31B
•
Updated
•
35.5k
•
2
JunHowie/Qwen3-32B-GPTQ-Int4
Text Generation
•
33B
•
Updated
•
799
•
4
QuantTrio/Qwen3-Coder-30B-A3B-Instruct-GPTQ-Int8
Text Generation
•
31B
•
Updated
•
3.56k
•
7
QuantTrio/Qwen3-VL-235B-A22B-Instruct-AWQ
Text Generation
•
236B
•
Updated
•
2.38k
•
12
QuantTrio/Qwen3-VL-30B-A3B-Instruct-AWQ
Text Generation
•
31B
•
Updated
•
592k
•
36
QuantTrio/Qwen3-VL-30B-A3B-Thinking-AWQ
Text Generation
•
31B
•
Updated
•
7.58k
•
10
QuantTrio/Qwen3-VL-32B-Thinking-AWQ
Image-Text-to-Text
•
33B
•
Updated
•
1.93k
•
4
Text Generation
•
358B
•
Updated
•
26k
•
22
model-scope/glm-4-9b-chat-GPTQ-Int4
Text Generation
•
9B
•
Updated
•
63
•
6
model-scope/glm-4-9b-chat-GPTQ-Int8
Text Generation
•
9B
•
Updated
•
16
•
2
tclf90/qwen2.5-72b-instruct-gptq-int4
Text Generation
•
73B
•
Updated
•
49
•
2
tclf90/qwen2.5-72b-instruct-gptq-int3
Text Generation
•
69B
•
Updated
•
51
prithivMLmods/Nu2-Lupi-Qwen-14B
Text Generation
•
15B
•
Updated
•
1
•
2
mradermacher/Nu2-Lupi-Qwen-14B-GGUF
15B
•
Updated
•
86
•
1
mradermacher/Nu2-Lupi-Qwen-14B-i1-GGUF
15B
•
Updated
•
5.27k
•
1
JunHowie/Qwen3-0.6B-GPTQ-Int4
Text Generation
•
0.6B
•
Updated
•
369
•
1
JunHowie/Qwen3-0.6B-GPTQ-Int8
Text Generation
•
0.6B
•
Updated
•
15
JunHowie/Qwen3-1.7B-GPTQ-Int4
Text Generation
•
2B
•
Updated
•
1.17k
•
1
JunHowie/Qwen3-1.7B-GPTQ-Int8
Text Generation
•
2B
•
Updated
•
13
JunHowie/Qwen3-32B-GPTQ-Int8
Text Generation
•
33B
•
Updated
•
334
•
3
JunHowie/Qwen3-30B-A3B-GPTQ-Int4
Text Generation
•
5B
•
Updated
•
57
•
1
JunHowie/Qwen3-14B-GPTQ-Int8
Text Generation
•
15B
•
Updated
•
56
•
1
JunHowie/Qwen3-14B-GPTQ-Int4
Text Generation
•
15B
•
Updated
•
530
•
4
JunHowie/Qwen3-8B-GPTQ-Int8
Text Generation
•
8B
•
Updated
•
50
JunHowie/Qwen3-8B-GPTQ-Int4
Text Generation
•
8B
•
Updated
•
2.61k
•
4
JunHowie/Qwen3-4B-GPTQ-Int4
Text Generation
•
4B
•
Updated
•
693
•
1
JunHowie/Qwen3-4B-GPTQ-Int8
Text Generation
•
4B
•
Updated
•
6
JunHowie/Qwen3-30B-A3B-GPTQ-Int8
Text Generation
•
8B
•
Updated
•
591
QuantTrio/Qwen3-235B-A22B-GPTQ-Int8
Text Generation
•
235B
•
Updated
•
6