Edit Models filters

Apps

Inference Providers

HF Inference API

Misc

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Mixture of Experts

Carbon Emissions

Models

636

Full-text search

Active filters: llama.cpp

nvjob/Mistral-24B-crack-ru

24B • Updated Feb 28 • 32 • 1

nvjob/crack-ru-v2-gguf

12B • Updated Feb 24 • 57 • 2

Alienanthony/Events-of-the-day_Calander-planner

2B • Updated Feb 18 • 17 • 6

Novaciano/ANAL.DESTRUCTION-3.2-1B-Uncensored_V2-GGUF

Text Generation • 1B • Updated Feb 18 • 17 • 1

bulieme/llama3.2_1b_2025_uncensored_v2-Q3_K_M-GGUF

Text Generation • 1B • Updated Feb 22 • 15

Azzedde/llama3.1-8b-reasoning-grpo-gguf

Text Generation • 8B • Updated Mar 3 • 23

nvjob/Girlfriend-1.5B-RU-8q-gguf

2B • Updated Mar 9 • 124 • 3

nvjob/nvjob-1.5b-ru-8q-gguf

2B • Updated Mar 11 • 1

nvjob/nvjob-3b-ru-8q-gguf

3B • Updated Mar 11 • 9

nvjob/nvjob-3b-ru-4q-gguf

3B • Updated Mar 11 • 14 • 1

Mungert/gemma-3-4b-it-gguf

Image-Text-to-Text • 4B • Updated Sep 24 • 414 • 13

Mungert/gemma-3-12b-it-gguf

Image-Text-to-Text • 12B • Updated Sep 24 • 533 • 11

dmitry7kol/nomic-v2-tuned-Q8-GGUF

nvjob/drugban-mini-gguf

2B • Updated Mar 16 • 13

mradermacher/llama3.2_1b_2025_uncensored_v2-GGUF

1B • Updated Jul 11 • 130

mradermacher/llama3.2_1b_2025_uncensored_v2-i1-GGUF

1B • Updated Jul 11 • 275 • 1

klei1/bleta-logjike-27b-gguf

27B • Updated Mar 23 • 33

AdithyaSrivastava01/RickLLM

8B • Updated Mar 23 • 2

salimlko/XML_Qwen2_5_Coder_3B_bnb_4bit_Model_V2

Text Generation • 3B • Updated Mar 26 • 11

DARELab/qwen-coder-2.5-32B-SFT-fc4eosc

33B • Updated Apr 4 • 21

Mungert/gemma-3-4b-it-qat-q4_0-GGUF

Image-Text-to-Text • 4B • Updated Sep 24 • 111 • 2

erax-ai/EraX-Translator-V1.0

Translation • 4B • Updated May 6 • 36 • 27

Deaquay/T-Rex-mini-IQ4_XS-GGUF

8B • Updated Apr 9 • 5

matrixportalx/Gemmasutra-Small-4B-v1-GGUF

Text Generation • 4B • Updated Apr 9 • 400

Deaquay/T-Rex-mini-I1-GGUF

8B • Updated Apr 9 • 20

NoirZangetsu/gemma-finetune-flutter-gguf-2

4B • Updated Apr 10 • 4

BirdieByte1024/doctor-dental-implant-LoRA-Qwen2.5-7B-Instruct-FullModel

8B • Updated Apr 12 • 30

mradermacher/EraX-Translator-V1.0-GGUF

Translation • 4B • Updated Jul 31 • 248 • 1

erax-ai/EraX-Translator-V1.0-GGUF

Translation • 4B • Updated May 6 • 305 • 9

mradermacher/EraX-Translator-V1.0-i1-GGUF

Translation • 4B • Updated Jul 11 • 209 • 1