Edit Models filters

Apps

Inference Providers

HF Inference API

Misc

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Mixture of Experts

Carbon Emissions

Models

38,867

Full-text search

Active filters: 4-bit

Qwen/Qwen2-VL-2B-Instruct-GPTQ-Int4

Image-Text-to-Text • 1B • Updated Sep 21, 2024 • 2.57k • 27

MaziyarPanahi/Yi-Coder-9B-Chat-GGUF

Text Generation • 9B • Updated Sep 4, 2024 • 75.7k • 6

Qwen/Qwen2.5-14B-Instruct-GPTQ-Int4

Text Generation • 3B • Updated Oct 9, 2024 • 21.4k • 24

Qwen/Qwen2.5-7B-Instruct-AWQ

Text Generation • 2B • Updated Oct 9, 2024 • 550k • 30

unsloth/Qwen2.5-3B-Instruct-bnb-4bit

Text Generation • 2B • Updated Feb 6 • 7.22k • 10

unsloth/Qwen2.5-Coder-7B-bnb-4bit

Text Generation • 4B • Updated Nov 12, 2024 • 20.8k • 9

unsloth/Llama-3.2-3B-bnb-4bit

Text Generation • 2B • Updated Jun 2 • 13.6k • 20

unsloth/Llama-3.2-3B-Instruct-bnb-4bit

Text Generation • 2B • Updated Jun 2 • 34.3k • 29

AMead10/Llama-3.2-3B-Instruct-AWQ

Text Generation • 1B • Updated Sep 25, 2024 • 556 • 3

shuyuej/Llama-3.2-1B-GPTQ

0.4B • Updated Sep 25, 2024 • 203 • 1

Qwen/Qwen2.5-Coder-32B-Instruct-GPTQ-Int4

Text Generation • 6B • Updated Nov 18, 2024 • 37.6k • 21

mlx-community/Qwen2.5-Coder-32B-Instruct-4bit

Text Generation • 5B • Updated Nov 11, 2024 • 147 • 10

unsloth/Qwen2.5-Coder-0.5B-Instruct-bnb-4bit

Text Generation • 0.3B • Updated Nov 12, 2024 • 1.78k • 4

mlx-community/Llama-3.3-70B-Instruct-4bit

Text Generation • 11B • Updated Dec 6, 2024 • 1.02k • 30

unsloth/Llama-3.3-70B-Instruct-bnb-4bit

Text Generation • 37B • Updated Jan 7 • 22.7k • 51

sandbox-ai/Llama-3.1-Tango-70b-bnb_4b

Text Generation • 37B • Updated Jan 3 • 4

Satwik11/Microsoft-phi-4-Instruct-AutoRound-GPTQ-4bit

3B • Updated Jan 10 • 48 • 2

Qwen/Qwen2.5-VL-7B-Instruct-AWQ

Image-Text-to-Text • 3B • Updated Apr 6 • 250k • 92

nicoboss/DeepSeek-R1-Distill-Llama-70B-Uncensored-v2-Unbiased-Reasoner-Lora

Updated Feb 16 • 1 • 2

mlx-community/OLMoE-1B-7B-0125-Instruct-4bit

Text Generation • 1B • Updated Mar 4 • 29 • 2

mlx-community/OLMoE-1B-7B-0125-4bit

Text Generation • 1B • Updated Mar 5 • 4 • 1

mlx-community/Stockmark-2-100B-Instruct-beta-4bit

Text Generation • 15B • Updated Mar 8 • 19 • 2

unsloth/gemma-3-27b-it-bnb-4bit

Image-Text-to-Text • 15B • Updated May 12 • 7.73k • 18

mlx-community/DeepSeek-V3-0324-4bit

Text Generation • 105B • Updated Aug 20 • 780 • 38

unsloth/Qwen3-4B-unsloth-bnb-4bit

Text Generation • 3B • Updated May 13 • 36.3k • 14

unsloth/Qwen3-0.6B-unsloth-bnb-4bit

Text Generation • 0.4B • Updated Jun 23 • 67.8k • 18

unsloth/Qwen3-8B-bnb-4bit

5B • Updated May 13 • 632k • 3

MaziyarPanahi/Qwen3-0.6B-GGUF

Text Generation • 0.8B • Updated Apr 28 • 74.2k • 7

Qwen/Qwen3-8B-AWQ

Text Generation • 2B • Updated May 21 • 228k • 27

Qwen/Qwen3-30B-A3B-GPTQ-Int4

Text Generation • 5B • Updated May 21 • 72.9k • 36