Edit Models filters

Inference Providers

HF Inference API

Misc

Inference Endpoints

text-generation-inference

AutoTrain Compatible

4-bit precision

8-bit precision

Mixture of Experts

Misc with no match

text-embeddings-inference

Carbon Emissions

Models

555

Full-text search

Active filters: multimodal

unsloth/Qwen2-VL-7B-Instruct-bnb-4bit

Image-Text-to-Text • Updated Nov 22, 2024 • 1.44k • 5

unsloth/Pixtral-12B-Base-2409

Image-Text-to-Text • Updated Nov 21, 2024 • 25 • 1

unsloth/Pixtral-12B-2409

Image-Text-to-Text • Updated Nov 21, 2024 • 1.03k • 3

unsloth/Pixtral-12B-Base-2409-bnb-4bit

Image-Text-to-Text • Updated Nov 21, 2024 • 278 • 1

unsloth/Pixtral-12B-2409-bnb-4bit

Image-Text-to-Text • Updated Nov 21, 2024 • 1.08k • 3

unsloth/Qwen2-VL-72B-Instruct

Image-Text-to-Text • Updated Nov 22, 2024 • 122 • 1

unsloth/Qwen2-VL-72B-Instruct-bnb-4bit

Image-Text-to-Text • Updated Nov 22, 2024 • 790 • 3

unsloth/llava-1.5-7b-hf

Image-Text-to-Text • Updated Nov 22, 2024 • 43 • 1

unsloth/llava-v1.6-mistral-7b-hf

Image-Text-to-Text • Updated Nov 21, 2024 • 221 • 1

NCSOFT/VARCO-VISION-14B

Image-Text-to-Text • Updated Dec 31, 2024 • 648 • 22

NCSOFT/VARCO-VISION-14B-HF

Image-Text-to-Text • Updated Dec 31, 2024 • 1.54k • 22

Flex-Data/bm-v1

Audio-Text-to-Text • Updated Dec 4, 2024 • 2

unsloth/Qwen2-VL-2B-Instruct-unsloth-bnb-4bit

Image-Text-to-Text • Updated Dec 4, 2024 • 26.8k • 5

unsloth/Qwen2-VL-7B-Instruct-unsloth-bnb-4bit

Image-Text-to-Text • Updated Dec 4, 2024 • 49.1k • 9

CogACT/CogACT-Large

Robotics • Updated Dec 4, 2024 • 531 • 1

rhymes-ai/Aria-Base-64K

Image-Text-to-Text • Updated Dec 1, 2024 • 741 • 12

rhymes-ai/Aria-Chat

Image-Text-to-Text • Updated Dec 15, 2024 • 100 • 10

AnyModal/LaTeX-OCR-Llama-3.2-1B

Updated Dec 23, 2024 • 2

Qwen/Qwen2-VL-72B

Image-Text-to-Text • Updated Dec 6, 2024 • 2.21k • 71

unsloth/Pixtral-12B-2409-unsloth-bnb-4bit

Image-Text-to-Text • Updated Dec 4, 2024 • 5.05k • 5

unsloth/Llama-3.2-11B-Vision-unsloth-bnb-4bit

Image-Text-to-Text • Updated Dec 4, 2024 • 1.33k • 3

AI-Safeguard/Ivy-VL-llava

Visual Question Answering • Updated Dec 31, 2024 • 955 • 59

bartowski/Qwen2-VL-2B-Instruct-GGUF

Image-Text-to-Text • Updated Dec 17, 2024 • 5.48k • 20

lmstudio-community/Qwen2-VL-7B-Instruct-GGUF

Image-Text-to-Text • Updated Jan 6 • 11.6k • 2

bartowski/Qwen2-VL-7B-Instruct-GGUF

Image-Text-to-Text • Updated Dec 17, 2024 • 22.3k • 34

bartowski/Qwen2-VL-72B-Instruct-GGUF

Image-Text-to-Text • Updated Dec 18, 2024 • 4.31k • 11

second-state/Qwen2-VL-7B-Instruct-GGUF

Image-Text-to-Text • Updated Jan 11 • 614 • 3

mradermacher/Qwen2-VL-72B-Instruct-abliterated-i1-GGUF

Updated Dec 15, 2024 • 262 • 1

GoodiesHere/Apollo-LMMs-Apollo-1_5B-t32

Video-Text-to-Text • Updated Dec 18, 2024 • 126 • 9

GoodiesHere/Apollo-LMMs-Apollo-3B-t32

Text Generation • Updated Dec 18, 2024 • 124 • 17