Edit Models filters

Apps

Inference Providers

HF Inference API

Misc

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Mixture of Experts

Carbon Emissions

Models

2,110

Full-text search

Active filters: multimodal

Qwen/Qwen2.5-VL-7B-Instruct

Image-Text-to-Text • 8B • Updated Apr 6 • 4.05M • • 1.33k

ZJU-AI4H/Hulu-Med-14B

Image-Text-to-Text • 15B • Updated 4 days ago • 330 • 22

AvitoTech/avision

Image-Text-to-Text • 7B • Updated 6 days ago • 77 • 15

ZJU-AI4H/Hulu-Med-7B

Image-Text-to-Text • 8B • Updated 4 days ago • 1.17k • 22

Qwen/Qwen3-Omni-30B-A3B-Instruct

Any-to-Any • 35B • Updated Sep 22 • 364k • 687

ZJU-AI4H/Hulu-Med-32B

Image-Text-to-Text • 33B • Updated 4 days ago • 366 • 27

IDEA-Research/Rex-Omni

Image-Text-to-Text • 4B • Updated 12 days ago • 17.8k • 29

ByteDance/Dolphin

Image-Text-to-Text • 0.4B • Updated Jul 16 • 8.48k • 505

vogent/Vogent-Turn-80M

79.2M • Updated 5 days ago • 213 • 9

ByteDance/Dolphin-1.5

Image-Text-to-Text • 0.4B • Updated 11 days ago • 583 • 14

Qwen/Qwen2-VL-2B-Instruct

Image-Text-to-Text • 2B • Updated Jan 12 • 2.21M • 462

Qwen/Qwen2.5-VL-3B-Instruct

Image-Text-to-Text • 4B • Updated Apr 6 • 6.56M • 541

Qwen/Qwen2.5-Omni-7B

Any-to-Any • 11B • Updated Apr 30 • 245k • 1.81k

ByteDance-Seed/UI-TARS-1.5-7B

Image-Text-to-Text • 8B • Updated Apr 18 • 130k • 419

Qwen/Qwen2.5-VL-32B-Instruct

Image-Text-to-Text • 33B • Updated Apr 14 • 2M • • 459

General-Medical-AI/UniMedVL

Any-to-Any • Updated 4 days ago • 4

racineai/QwenAmann-4B-dse

Visual Document Retrieval • 4B • Updated 9 days ago • 162 • 14

Qwen/Qwen2.5-VL-72B-Instruct

Image-Text-to-Text • 73B • Updated Jun 6 • 602k • • 555

unsloth/Qwen2.5-Omni-7B-GGUF

Any-to-Any • 8B • Updated May 28 • 11.1k • 40

lingshu-medical-mllm/Lingshu-7B

Image-Text-to-Text • 8B • Updated Sep 17 • 28.9k • 61

NCSOFT/VARCO-VISION-2.0-14B

Image-Text-to-Text • 15B • Updated Sep 15 • 842 • 42

internlm/CapRL-3B

Image-Text-to-Text • 4B • Updated 7 days ago • 3.81k • 41

internlm/Spark-VL-7B

Video-Text-to-Text • 8B • Updated 6 days ago • 141 • 10

utter-project/TowerVideo-9B

Video-Text-to-Text • 10B • Updated about 4 hours ago • 4 • 2

aquif-ai/aquif-Dream-6B-Exp

Text-to-Video • Updated 7 days ago • 2

Cogent-ai/cogent-csp-15m

Text Generation • Updated about 10 hours ago • 2

remyxai/SpaceQwen3-VL-2B-Thinking

Image-Text-to-Text • 2B • Updated 5 days ago • 28 • 2

thesby/Qwen3-VL-8B-NSFW-Caption-V4

Image-to-Text • 9B • Updated 5 days ago • 552 • 6

imageomics/bioclip

Zero-Shot Image Classification • Updated 26 days ago • 5.27k • 54

openvla/openvla-7b

Image-Text-to-Text • 8B • Updated Sep 16, 2024 • 702k • 148