Edit Models filters

Apps

Inference Providers

HF Inference API

Misc

arxiv: 2504.03624

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Mixture of Experts

Carbon Emissions

Models

23

Full-text search

Active filters: 2504.03624

nvidia/NVIDIA-Nemotron-Nano-12B-v2

Text Generation • 12B • Updated 14 days ago • 78.2k • • 113

nvidia/NVIDIA-Nemotron-Nano-9B-v2

Text Generation • 9B • Updated 14 days ago • 230k • 417

nvidia/NVIDIA-Nemotron-Nano-9B-v2-NVFP4

Text Generation • 6B • Updated 14 days ago • 1.55k • 4

nvidia/Nemotron-H-8B-Base-8K

Text Generation • 8B • Updated Aug 21 • 17.2k • 52

nvidia/Nemotron-H-47B-Base-8K

Text Generation • 47B • Updated Aug 21 • 1.76k • 21

nvidia/Nemotron-H-56B-Base-8K

Text Generation • 56B • Updated Aug 21 • 6.41k • 32

nvidia/Nemotron-H-47B-Reasoning-128K

Text Generation • 47B • Updated Jul 11 • 966 • 18

nvidia/Nemotron-H-47B-Reasoning-128K-FP8

Text Generation • 47B • Updated Aug 21 • 127 • 5

nvidia/Nemotron-H-8B-Reasoning-128K

Text Generation • 8B • Updated Jul 11 • 1.34k • 23

nvidia/Nemotron-H-8B-Reasoning-128K-FP8

Text Generation • 8B • Updated Aug 21 • 70 • 12

dominguesm/NVIDIA-Nemotron-Nano-9B-v2-GGUF

Text Generation • 9B • Updated Aug 30 • 512 • 1

gabriellarson/NVIDIA-Nemotron-Nano-12B-v2-GGUF

Text Generation • 12B • Updated Aug 30 • 269

QuantFactory/NVIDIA-Nemotron-Nano-9B-v2-GGUF

Text Generation • Updated Aug 30 • 2.44k • 4

QuantFactory/NVIDIA-Nemotron-Nano-12B-v2-GGUF

Text Generation • Updated Aug 30 • 146 • 2

GGUF-A-Lot/NVIDIA-Nemotron-Nano-9B-v2-GGUF

9B • Updated Sep 1 • 129

GGUF-A-Lot/NVIDIA-Nemotron-Nano-12B-v2-GGUF

12B • Updated Sep 1 • 12

cpatonn/NVIDIA-Nemotron-Nano-9B-v2-AWQ-4bit

Text Generation • 2B • Updated Aug 31 • 1.91k • 2

cpatonn/NVIDIA-Nemotron-Nano-12B-v2-AWQ-4bit

Text Generation • 3B • Updated Sep 13 • 2.87k • 2

cpatonn/NVIDIA-Nemotron-Nano-12B-v2-AWQ-8bit

Text Generation • 4B • Updated Sep 13 • 65 • 1

cpatonn/NVIDIA-Nemotron-Nano-9B-v2-AWQ-8bit

Text Generation • 3B • Updated Aug 31 • 222

Mungert/NVIDIA-Nemotron-Nano-12B-v2-GGUF

Text Generation • 12B • Updated Sep 24 • 1.21k • 2

unsloth/NVIDIA-Nemotron-Nano-9B-v2

Text Generation • 9B • Updated Sep 10 • 100

nvidia/NVIDIA-Nemotron-Nano-9B-v2-FP8

Text Generation • 9B • Updated 14 days ago • 7.11k • 3