Edit Models filters

Apps

Inference Providers

HF Inference API

Misc

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Mixture of Experts

Carbon Emissions

Models

762

Full-text search

Active filters: llama.cpp

codebasic/Qwen3-8B-GGUF

8B • Updated Aug 12 • 38

theprint/TiTan-Gemma3-1B-GGUF

Text Generation • 1B • Updated Aug 12 • 36

nhay103/gpt-oss-20b-GGUF

Text Generation • 21B • Updated Aug 13 • 33

theprint/TiTan-Qwen2.5-0.5B-GGUF

Text Generation • 0.5B • Updated Aug 14 • 55 • 3

sharadsnaik/medgemma-4b-it-medical-gguf

Image-Text-to-Text • 4B • Updated Aug 16 • 13 • 1

theprint/TiTan-Gemma3-0.27B-GGUF

Text Generation • 0.4B • Updated Aug 16 • 46 • 1

LiquidAI/LFM2-VL-1.6B-GGUF

Image-Text-to-Text • 1B • Updated Aug 18 • 9.84k • 50

LiquidAI/LFM2-VL-450M-GGUF

Image-Text-to-Text • 0.4B • Updated Aug 18 • 2.13k • 33

theprint/Genuine-7B-Instruct-GGUF

Text Generation • 8B • Updated Aug 18 • 21

JonathanMiddleton/Qwen3-Reranker-0.6B

Text Ranking • 0.6B • Updated Aug 19 • 142 • 1

Durlabh/gemma-270m-q4-k-m-gguf

Text Generation • 0.3B • Updated Aug 20 • 58

Zoont/InternVL3-2B-4-Bit-GGUF-with-mmproj

Image-Text-to-Text • 2B • Updated Aug 20 • 123

Lutifya/gpt-oss-20b-gguf

21B • Updated Aug 21 • 29

Lutifya/gpt-oss-20b-q5_0

Text Generation • 21B • Updated Aug 23 • 5

pankajligade/tinyllama-irac-gguf

Text Generation • 1B • Updated Aug 24 • 4

theprint/Genuine-Zeth-4B-GGUF

Text Generation • 5B • Updated Aug 24 • 19

Arivukkarasu/Mistral-7B-v0.3-GGUF

7B • Updated Aug 26 • 24

remiai3/gpt_oss_20b_GGUF_project_guide

Text Generation • Updated Aug 29

satviksrivas7/gemma-3-270m-it-gguf

Text Generation • 0.3B • Updated Aug 28 • 4

weathermanj/NVIDIA-Nemotron-Nano-9B-v2-gguf

Text Generation • 9B • Updated Aug 29 • 296 • 1

sultanali338/qwen1.5-1.8b-gguf

Text Generation • 2B • Updated Aug 30 • 20

mirkulix/OrbitUltra-Perfect-Q2K-GGUF

Text Generation • 8B • Updated Sep 1 • 6

gpsworld/gpsworld

Text Generation • 3B • Updated Sep 2

Krish356/qwen3-coder-tailwind-css-v4-merged-80percent-Q8_0-GGUF

31B • Updated Sep 3 • 24

Loni415/Augmentoolkit-DataSpecialist-v0.1-GGUF

Text Generation • 7B • Updated Sep 4 • 1

swasti1234/physicsFineTuneGGUF

4B • Updated 26 days ago • 218 • 1

bobchenyx/gpt-oss-120b-GGUF

Text Generation • 117B • Updated Sep 5 • 99

bobchenyx/gpt-oss-20b-GGUF

Text Generation • 21B • Updated Sep 5 • 28

SutanRifkyt/komodo7b-sunda-lemess-gguf

Text Generation • 7B • Updated Sep 6 • 10

Darkhn-Quants/L3.3-70B-Animus-V11.0-GGUF

71B • Updated 14 days ago • 92