-
-
-
-
-
-
Inference Providers
Active filters:
4-bit
Qwen/Qwen2-VL-2B-Instruct-GPTQ-Int4
Image-Text-to-Text
•
1B
•
Updated
•
2.57k
•
27
MaziyarPanahi/Yi-Coder-9B-Chat-GGUF
Text Generation
•
9B
•
Updated
•
75.7k
•
6
Qwen/Qwen2.5-14B-Instruct-GPTQ-Int4
Text Generation
•
3B
•
Updated
•
21.4k
•
24
Qwen/Qwen2.5-7B-Instruct-AWQ
Text Generation
•
2B
•
Updated
•
550k
•
30
unsloth/Qwen2.5-3B-Instruct-bnb-4bit
Text Generation
•
2B
•
Updated
•
7.22k
•
10
unsloth/Qwen2.5-Coder-7B-bnb-4bit
Text Generation
•
4B
•
Updated
•
20.8k
•
9
unsloth/Llama-3.2-3B-bnb-4bit
Text Generation
•
2B
•
Updated
•
13.6k
•
20
unsloth/Llama-3.2-3B-Instruct-bnb-4bit
Text Generation
•
2B
•
Updated
•
34.3k
•
29
AMead10/Llama-3.2-3B-Instruct-AWQ
Text Generation
•
1B
•
Updated
•
556
•
3
shuyuej/Llama-3.2-1B-GPTQ
0.4B
•
Updated
•
203
•
1
Qwen/Qwen2.5-Coder-32B-Instruct-GPTQ-Int4
Text Generation
•
6B
•
Updated
•
37.6k
•
21
mlx-community/Qwen2.5-Coder-32B-Instruct-4bit
Text Generation
•
5B
•
Updated
•
147
•
10
unsloth/Qwen2.5-Coder-0.5B-Instruct-bnb-4bit
Text Generation
•
0.3B
•
Updated
•
1.78k
•
4
mlx-community/Llama-3.3-70B-Instruct-4bit
Text Generation
•
11B
•
Updated
•
1.02k
•
30
unsloth/Llama-3.3-70B-Instruct-bnb-4bit
Text Generation
•
37B
•
Updated
•
22.7k
•
51
sandbox-ai/Llama-3.1-Tango-70b-bnb_4b
Text Generation
•
37B
•
Updated
•
4
Satwik11/Microsoft-phi-4-Instruct-AutoRound-GPTQ-4bit
3B
•
Updated
•
48
•
2
Qwen/Qwen2.5-VL-7B-Instruct-AWQ
Image-Text-to-Text
•
3B
•
Updated
•
250k
•
92
nicoboss/DeepSeek-R1-Distill-Llama-70B-Uncensored-v2-Unbiased-Reasoner-Lora
mlx-community/OLMoE-1B-7B-0125-Instruct-4bit
Text Generation
•
1B
•
Updated
•
29
•
2
mlx-community/OLMoE-1B-7B-0125-4bit
Text Generation
•
1B
•
Updated
•
4
•
1
mlx-community/Stockmark-2-100B-Instruct-beta-4bit
Text Generation
•
15B
•
Updated
•
19
•
2
unsloth/gemma-3-27b-it-bnb-4bit
Image-Text-to-Text
•
15B
•
Updated
•
7.73k
•
18
mlx-community/DeepSeek-V3-0324-4bit
Text Generation
•
105B
•
Updated
•
780
•
38
unsloth/Qwen3-4B-unsloth-bnb-4bit
Text Generation
•
3B
•
Updated
•
36.3k
•
14
unsloth/Qwen3-0.6B-unsloth-bnb-4bit
Text Generation
•
0.4B
•
Updated
•
67.8k
•
18
unsloth/Qwen3-8B-bnb-4bit
5B
•
Updated
•
632k
•
3
MaziyarPanahi/Qwen3-0.6B-GGUF
Text Generation
•
0.8B
•
Updated
•
74.2k
•
7
Qwen/Qwen3-8B-AWQ
Text Generation
•
2B
•
Updated
•
228k
•
27
Qwen/Qwen3-30B-A3B-GPTQ-Int4
Text Generation
•
5B
•
Updated
•
72.9k
•
36