Edit Models filters

Apps

Inference Providers

HF Inference API

Misc

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Mixture of Experts

Carbon Emissions

Models

2,117

Full-text search

Active filters: multimodal

ZJU-AI4H/Hulu-Med-7B

Image-Text-to-Text • 8B • Updated 4 days ago • 1.23k • 24

ZJU-AI4H/Hulu-Med-14B

Image-Text-to-Text • 15B • Updated 4 days ago • 332 • 23

AvitoTech/avision

Image-Text-to-Text • 7B • Updated 6 days ago • 192 • 16

Qwen/Qwen2.5-VL-7B-Instruct

Image-Text-to-Text • 8B • Updated Apr 6 • 4.1M • • 1.33k

ZJU-AI4H/Hulu-Med-32B

Image-Text-to-Text • 33B • Updated 4 days ago • 389 • 28

Qwen/Qwen3-Omni-30B-A3B-Instruct

Any-to-Any • 35B • Updated Sep 22 • 345k • 688

IDEA-Research/Rex-Omni

Image-Text-to-Text • 4B • Updated 13 days ago • 18.5k • 30

ByteDance/Dolphin

Image-Text-to-Text • 0.4B • Updated Jul 16 • 7.95k • 505

vogent/Vogent-Turn-80M

79.2M • Updated 6 days ago • 275 • 9

ByteDance/Dolphin-1.5

Image-Text-to-Text • 0.4B • Updated 12 days ago • 642 • 15

Qwen/Qwen2.5-VL-3B-Instruct

Image-Text-to-Text • 4B • Updated Apr 6 • 6.53M • 543

ByteDance-Seed/UI-TARS-1.5-7B

Image-Text-to-Text • 8B • Updated Apr 18 • 131k • 420

Qwen/Qwen2.5-Omni-7B

Any-to-Any • 11B • Updated Apr 30 • 245k • 1.81k

Qwen/Qwen2-VL-2B-Instruct

Image-Text-to-Text • 2B • Updated Jan 12 • 2.23M • 462

Qwen/Qwen2.5-VL-32B-Instruct

Image-Text-to-Text • 33B • Updated Apr 14 • 2M • • 459

internlm/CapRL-3B

Image-Text-to-Text • 4B • Updated 8 days ago • 3.67k • 42

Lamapi/next-12b

Image-Text-to-Text • 12B • Updated 1 day ago • 67 • 3

lmms-lab/LLaVA-Video-7B-Qwen2

Video-Text-to-Text • 8B • Updated Oct 25, 2024 • 39.5k • 114

Qwen/Qwen2.5-VL-72B-Instruct

Image-Text-to-Text • 73B • Updated Jun 6 • 591k • • 555

unsloth/Qwen2.5-Omni-7B-GGUF

Any-to-Any • 8B • Updated May 28 • 11.2k • 40

lingshu-medical-mllm/Lingshu-7B

Image-Text-to-Text • 8B • Updated Sep 17 • 29.1k • 61

NCSOFT/VARCO-VISION-2.0-14B

Image-Text-to-Text • 15B • Updated Sep 15 • 860 • 42

internlm/Spark-VL-7B

Video-Text-to-Text • 8B • Updated 7 days ago • 120 • 10

utter-project/TowerVideo-9B

Video-Text-to-Text • 10B • Updated 1 day ago • 12 • 2

General-Medical-AI/UniMedVL

Any-to-Any • Updated 5 days ago • 4

aquif-ai/aquif-Dream-6B-Exp

Text-to-Video • Updated 8 days ago • 2

Cogent-ai/cogent-csp-15m

Text Generation • Updated 1 day ago • 2

remyxai/SpaceQwen3-VL-2B-Thinking

Image-Text-to-Text • 2B • Updated 6 days ago • 28 • 2

thesby/Qwen3-VL-8B-NSFW-Caption-V4

Image-to-Text • 9B • Updated 6 days ago • 796 • 6

imageomics/bioclip

Zero-Shot Image Classification • Updated 27 days ago • 6.41k • 54