Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference status
Reset Inference status
Warm
Cold
Frozen
Misc
Reset Misc
multimodal
Inference Endpoints
AutoTrain Compatible
custom_code
text-generation-inference
4-bit precision
Merge
Eval Results
8-bit precision
Mixture of Experts
Misc with no match
text-embeddings-inference
Carbon Emissions
Apply filters
Models
279
Full-text search
Edit filters
Sort: Trending
Active filters:
multimodal
Clear all
NexaAIDev/omnivision-968M
Updated
1 day ago
•
8.9k
•
400
jinaai/jina-clip-v2
Feature Extraction
•
Updated
3 days ago
•
1.99k
•
55
Qwen/Qwen2-VL-7B-Instruct
Image-Text-to-Text
•
Updated
Sep 21
•
1.74M
•
•
843
Qwen/Qwen2-VL-2B-Instruct
Image-Text-to-Text
•
Updated
Sep 21
•
981k
•
283
Qwen/Qwen2-VL-72B-Instruct
Image-Text-to-Text
•
Updated
Sep 21
•
78.4k
•
171
unsloth/Llama-3.2-11B-Vision-Instruct-bnb-4bit
Image-Text-to-Text
•
Updated
4 days ago
•
39.9k
•
52
allenai/Molmo-7B-D-0924
Image-Text-to-Text
•
Updated
Oct 10
•
70.5k
•
441
robotics-diffusion-transformer/rdt-1b
Robotics
•
Updated
Oct 17
•
3.47k
•
40
allenai/Molmo-72B-0924
Image-Text-to-Text
•
Updated
Oct 10
•
6.55k
•
264
qnguyen3/nanoLLaVA
Text Generation
•
Updated
29 days ago
•
28.6k
•
148
Qwen/Qwen2-VL-72B-Instruct-AWQ
Image-Text-to-Text
•
Updated
Sep 25
•
18.9k
•
36
lmms-lab/llava-onevision-qwen2-7b-ov
Text Generation
•
Updated
Sep 2
•
88.5k
•
39
allenai/MolmoE-1B-0924
Image-Text-to-Text
•
Updated
Oct 10
•
9.65k
•
132
allenai/Molmo-7B-O-0924
Image-Text-to-Text
•
Updated
10 days ago
•
32.4k
•
142
unsloth/Llama-3.2-11B-Vision-Instruct
Image-Text-to-Text
•
Updated
4 days ago
•
42.6k
•
59
rhymes-ai/Aria
Image-Text-to-Text
•
Updated
13 days ago
•
15.9k
•
585
BAAI/Aquila-VL-2B-llava-qwen
Visual Question Answering
•
Updated
about 4 hours ago
•
3.28k
•
47
mlx-community/Molmo-7B-D-0924-4bit
Image-Text-to-Text
•
Updated
4 days ago
•
72
•
3
marcosv/InstructIR
Image-to-Image
•
Updated
Jan 31
•
29
lmms-lab/llava-onevision-qwen2-0.5b-si
Text Generation
•
Updated
Sep 2
•
41.5k
•
11
lmms-lab/llava-onevision-qwen2-0.5b-ov
Text Generation
•
Updated
Sep 2
•
23.9k
•
15
Qwen/Qwen2-VL-2B-Instruct-GPTQ-Int4
Image-Text-to-Text
•
Updated
Sep 21
•
14k
•
17
lmms-lab/LLaVA-Video-7B-Qwen2
Video-Text-to-Text
•
Updated
Oct 25
•
40.8k
•
36
ibm/biomed.sm.mv-te-84m-MoleculeNet-ligand_scaffold-BACE-101
Updated
24 days ago
•
114
•
2
ibm/biomed.sm.mv-te-84m-MoleculeNet-ligand_scaffold-MUV-101
Updated
24 days ago
•
81
•
2
ibm/biomed.sm.mv-te-84m-MoleculeNet-ligand_scaffold-SIDER-101
Updated
24 days ago
•
90
•
2
Cylingo/XinYuan-VL-2B-GGUF
Image-Text-to-Text
•
Updated
2 days ago
•
100
•
2
unsloth/Pixtral-12B-2409-bnb-4bit
Image-Text-to-Text
•
Updated
4 days ago
•
595
•
2
LABahasa/llama-labahasa-11B
Updated
3 days ago
•
48
•
2
imageomics/bioclip
Zero-Shot Image Classification
•
Updated
May 17
•
9.67k
•
42
Previous
1
2
3
...
10
Next