Edit Models filters

Apps

Inference Providers

HF Inference API

Misc

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Mixture of Experts

Carbon Emissions

Models

705

Full-text search

Active filters: llama.cpp

Darkhn-Quants/L3.3-70B-Animus-V6.1-Exp-GGUF

71B • Updated Jul 26 • 124

theprint/TiTan-Llama-3.2-1B-GGUF

Text Generation • 1B • Updated Jul 25 • 52

theprint/TiTan-Gemma3-4B-GGUF

Text Generation • 5B • Updated Jul 30 • 44

dynomite567/Ministral-8B-Instruct-2410-Q4_K_M-GGUF

8B • Updated Jul 26 • 6

dynomite567/Mistral-7B-Instruct-v0.3-Q4_K_M-GGUF

7B • Updated Jul 26 • 9

Darkhn-Quants/M3.2-24B-Animus-V7.1-GGUF

24B • Updated Jul 29 • 933 • 2

Darkhn-Quants/M3.2-24B-Animus-V7.0-GGUF

24B • Updated Aug 2 • 234

theprint/Pythonified-Llama-3.2-3B-Instruct-GGUF

Text Generation • 3B • Updated Jul 26 • 49

theprint/ReasonableMath-Llama-3.2-3B-Instruct-GGUF

Text Generation • 3B • Updated Aug 16 • 17

Darkhn-Quants/L3.3-70B-Animus-V7.0-GGUF

71B • Updated Aug 21 • 229 • 1

theprint/Empathetic-Llama-3.2-3B-Instruct-GGUF

Text Generation • 3B • Updated Jul 30 • 90

CrucibleLab-TG/M3.2-24B-Loki-V1.0-GGUF

24B • Updated Aug 1 • 140

Darkhn-Quants/M3.2-36B-Animus-V8.0-GGUF

35B • Updated Sep 11 • 73

CrucibleLab-TG/M3.2-24B-Loki-V1.1-1.2-1.3-GGUF

24B • Updated Aug 3 • 41

Darkhn-Quants/M3.2-36B-Animus-V8.1-GGUF

35B • Updated Aug 5 • 852

CrucibleLab-TG/M3.2-24B-Loki-V1.3-GGUF

24B • Updated Aug 23 • 1.14k • 11

CrucibleLab-TG/M3.2-24B-Loki-V1.2-GGUF

24B • Updated Aug 4 • 221 • 1

daskalos-apps/phi4-cybersec-Q4_K_M

4B • Updated Aug 6 • 119

prithivMLmods/Qwen3-4B-Thinking-2507-GGUF

Text Generation • 4B • Updated Aug 6 • 216 • 1

prithivMLmods/Qwen3-4B-Instruct-2507-GGUF

Text Generation • 4B • Updated Aug 6 • 327 • 1

TheMindExpansionNetwork/M1NDB0T-GPT-OSS-20B_GGUF

Text Generation • 21B • Updated Aug 8 • 96

kturki/qwen2.5-7B_internal_audit

Question Answering • 8B • Updated Aug 8 • 56

giladgd/gpt-oss-20b-GGUF

Text Generation • 21B • Updated 11 days ago • 1.15k • 2

giladgd/gpt-oss-120b-GGUF

Text Generation • 117B • Updated 11 days ago • 1.06k • 1

Choyrens/ChoyrensAI-Telekom-Agent-v6-gguf

8B • Updated Aug 9 • 15

giladgd/Qwen3-4B-Thinking-2507-GGUF

Text Generation • 4B • Updated Aug 9 • 271

HuggingBelto/gpt-oss-20b

Text Generation • 21B • Updated Aug 10 • 1

simraann/ExplainIt-Phi-GGUF

Text Generation • 3B • Updated Sep 22 • 25

HillPhelmuth/gpt-oss-20B-chess-analysis-GGUF

21B • Updated Aug 11 • 7

ganchito/dante-7b.gguf

Text Generation • 8B • Updated Aug 11 • 10