Edit Models filters

Apps

Inference Providers

HF Inference API

Misc

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Mixture of Experts

Carbon Emissions

Models

705

Full-text search

Active filters: llama.cpp

pravdin/meta-I-Hermes-3-dare_linear-gguf

Text Generation • 3B • Updated Jun 28 • 20

muranAI/Mistral-Small-3.1-24B-Instruct-2503-GGUF

Text Generation • 24B • Updated Jun 28 • 140 • 1

Vikhrmodels/QVikhr-3-4B-Instruction-GGUF

4B • Updated Jun 30 • 927 • 4

agentlans/Qwen3-4B-multilingual-sft-GGUF

Text Generation • 4B • Updated Jun 29 • 58

aman2024/NuExtract-2-2B-GGUF

2B • Updated Jun 29 • 68

pravdin/Qwen2.5-1.5B-DeepSeek-R1-dare_linear-gguf

Text Generation • 2B • Updated Jul 1 • 14

ReallyFloppyPenguin/OCRFlux-3B-GGUF

3B • Updated Jul 2 • 51 • 1

Darkhn-Quants/L3.3-70B-Animus-V5-Pro-GGUF

71B • Updated Jul 14 • 288 • 1

Darkhn-Quants/M3.2-24B-Animus-V5-Pro-GGUF

24B • Updated Jul 12 • 26 • 1

codebasic/Qwen3-0.6B-GGUF

0.8B • Updated Jul 6 • 23

dzur658/Polaris-4B-Preview-IQ-GGUF

4B • Updated Jul 12 • 926

Darkhn-Quants/M3.2-24B-Animus-V5.1-Pro-GGUF

24B • Updated Jul 22 • 179 • 1

ReallyFloppyPenguin/Llama-3.1-Centaur-70B-GGUF

ReallyFloppyPenguin/Dhanishtha-2.0-preview-GGUF

ReallyFloppyPenguin/DeepSeek-R1-Distill-Qwen-1.5B-GGUF

2B • Updated Jul 8 • 189

ReallyFloppyPenguin/DeepSWE-Preview-GGUF

33B • Updated Jul 8 • 81

JonathanMiddleton/Qwen3-Embedding-8B-GGUF

8B • Updated Jul 14 • 769 • 4

Makatia/mistral-7b-instruct-v0.2.Q8_0-Q8_0.gguf

7B • Updated Jul 13 • 8

Arivukkarasu/TinyLlama-1.1B-Chat-GGUF

1B • Updated Jul 15 • 26

Arivukkarasu/Mistral-7B-Instruct-v0.3-GGUF

7B • Updated Jul 15 • 17

PJEDeveloper/Mistral_Nemo_Instruct_2407-F16.gguf-Q4_K_M

12B • Updated Jul 23 • 17

theprint/Zeth-Gemma3-4B-GGUF

Text Generation • 5B • Updated Aug 16 • 45

HackNetAyush/smollm2-135M-instruct-gguf-q8

Text Generation • 0.1B • Updated Jul 20 • 102 • 2

sugiv/cardvaultplus-500m-gguf

Image-to-Text • 0.4B • Updated Jul 22 • 73 • 2

PJEDeveloper/mistralai_Mistral-7B-Instruct-v0.3-F16.gguf-Q5_K_M

7B • Updated Jul 23 • 60

Bastion-AI/SmolLM3-3B-GGUF

3B • Updated Jul 23 • 192 • 1

PJEDeveloper/mistralai_Mistral-7B-Instruct-v0.2-Q5_K_M

7B • Updated Jul 23 • 17

Darkhn-Quants/M3.2-24B-Animus-V6-Exp-GGUF

24B • Updated Jul 24 • 69

Darkhn-Quants/L3.3-70B-Animus-V6-Exp-GGUF

71B • Updated Aug 2 • 124

klusai/tf2-12b-gguf

Text Generation • 12B • Updated Aug 20 • 16