Edit Models filters

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Mixture of Experts

Carbon Emissions

Models

Full-text search

Active filters: text-generation-inference

EssentialAI/rnj-1-instruct

Text Generation • 8B • Updated about 12 hours ago • 451k • • 260

FutureMa/Qwen3-8B-Drama-Thinking

Text Generation • 308k • Updated 2 days ago • 1.29k • 75

Nanbeige/Nanbeige4-3B-Thinking-2511

Text Generation • 4B • Updated 4 days ago • 1.84k • 82

nvidia/Nemotron-Orchestrator-8B

Text Generation • 8B • Updated 14 days ago • 38.7k • 451

ByteDance/Dolphin-v2

Image-Text-to-Text • 4B • Updated 4 days ago • 303 • 67

meta-llama/Llama-3.1-8B-Instruct

Text Generation • 8B • Updated Sep 25, 2024 • 8.78M • • 5.14k

open-thoughts/OpenThinker-Agent-v1

Text Generation • 8B • Updated 10 days ago • 1.03k • 84

EssentialAI/rnj-1

Text Generation • 8B • Updated about 12 hours ago • 140k • 82

Qwen/Qwen3-0.6B

Text Generation • 0.8B • Updated Jul 26 • 7.6M • • 883

Qwen/Qwen3-4B-Instruct-2507

Text Generation • 4B • Updated Sep 17 • 4.78M • • 562

Motif-Technologies/Motif-2-12.7B-Reasoning

Text Generation • 13B • Updated 4 days ago • 356 • 30

cerebras/DeepSeek-V3.2-REAP-345B-A37B

Text Generation • 345B • Updated 7 days ago • 757 • 25

nn-tech/MetalGPT-1

Text Generation • 33B • Updated 6 days ago • 12.2k • 27

google/gemma-3-4b-it

Image-Text-to-Text • 4B • Updated Mar 21 • 973k • 1.04k

Nanbeige/Nanbeige4-3B-Base

Text Generation • 4B • Updated 3 days ago • 321 • 23

Cognitive-Lab/NetraEmbed

Visual Document Retrieval • 4B • Updated 6 days ago • 588 • 22

meta-llama/Llama-3.2-1B-Instruct

Text Generation • 1B • Updated Oct 24, 2024 • 3.42M • • 1.21k

meta-llama/Llama-3.1-8B

Text Generation • 8B • Updated Oct 16, 2024 • 710k • • 1.97k

microsoft/Fara-7B

Image-Text-to-Text • 8B • Updated 4 days ago • 97.8k • 442

Qwen/Qwen2.5-7B-Instruct

Text Generation • 8B • Updated Jan 12 • 6.96M • • 948

Qwen/Qwen3-8B

Text Generation • 8B • Updated Jul 26 • 4.19M • • 808

maya-research/maya1

Text-to-Speech • 3B • Updated Nov 12 • 74.3k • • 821

meta-llama/Llama-3.2-3B-Instruct

Text Generation • 3B • Updated Oct 24, 2024 • 1.65M • • 1.87k

google/gemma-3-27b-it

Image-Text-to-Text • 27B • Updated Mar 21 • 1.69M • • 1.74k

Qwen/Qwen3-Embedding-0.6B

Feature Extraction • 0.6B • Updated Jun 20 • 3.68M • • 781

OctoMed/OctoMed-7B

Image-Text-to-Text • 8B • Updated 9 days ago • 1.04k • 16

Qwen/Qwen2.5-VL-7B-Instruct

Image-Text-to-Text • 8B • Updated Apr 6 • 3.07M • • 1.39k

ByteDance-Seed/UI-TARS-1.5-7B

Image-Text-to-Text • 8B • Updated Apr 18 • 327k • 455

dphn/Dolphin-Mistral-24B-Venice-Edition

Text Generation • 24B • Updated Sep 8 • 8.8k • • 337

cerebras/DeepSeek-V3.2-REAP-508B-A37B

Text Generation • 508B • Updated 7 days ago • 228 • 13