Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference Providers
Select all
fal
Fireworks
Replicate
SambaNova
Together AI
HF Inference API
Misc
Reset Misc
multimodal
Inference Endpoints
text-generation-inference
AutoTrain Compatible
custom_code
4-bit precision
Eval Results
Merge
8-bit precision
Mixture of Experts
Misc with no match
text-embeddings-inference
Carbon Emissions
Apply filters
Models
555
Full-text search
Edit filters
Sort: Trending
Active filters:
multimodal
Clear all
unsloth/Qwen2-VL-7B-Instruct-bnb-4bit
Image-Text-to-Text
•
Updated
Nov 22, 2024
•
1.44k
•
5
unsloth/Pixtral-12B-Base-2409
Image-Text-to-Text
•
Updated
Nov 21, 2024
•
25
•
1
unsloth/Pixtral-12B-2409
Image-Text-to-Text
•
Updated
Nov 21, 2024
•
1.03k
•
3
unsloth/Pixtral-12B-Base-2409-bnb-4bit
Image-Text-to-Text
•
Updated
Nov 21, 2024
•
278
•
1
unsloth/Pixtral-12B-2409-bnb-4bit
Image-Text-to-Text
•
Updated
Nov 21, 2024
•
1.08k
•
3
unsloth/Qwen2-VL-72B-Instruct
Image-Text-to-Text
•
Updated
Nov 22, 2024
•
122
•
1
unsloth/Qwen2-VL-72B-Instruct-bnb-4bit
Image-Text-to-Text
•
Updated
Nov 22, 2024
•
790
•
3
unsloth/llava-1.5-7b-hf
Image-Text-to-Text
•
Updated
Nov 22, 2024
•
43
•
1
unsloth/llava-v1.6-mistral-7b-hf
Image-Text-to-Text
•
Updated
Nov 21, 2024
•
221
•
1
NCSOFT/VARCO-VISION-14B
Image-Text-to-Text
•
Updated
Dec 31, 2024
•
648
•
22
NCSOFT/VARCO-VISION-14B-HF
Image-Text-to-Text
•
Updated
Dec 31, 2024
•
1.54k
•
22
Flex-Data/bm-v1
Audio-Text-to-Text
•
Updated
Dec 4, 2024
•
2
unsloth/Qwen2-VL-2B-Instruct-unsloth-bnb-4bit
Image-Text-to-Text
•
Updated
Dec 4, 2024
•
26.8k
•
5
unsloth/Qwen2-VL-7B-Instruct-unsloth-bnb-4bit
Image-Text-to-Text
•
Updated
Dec 4, 2024
•
49.1k
•
9
CogACT/CogACT-Large
Robotics
•
Updated
Dec 4, 2024
•
531
•
1
rhymes-ai/Aria-Base-64K
Image-Text-to-Text
•
Updated
Dec 1, 2024
•
741
•
12
rhymes-ai/Aria-Chat
Image-Text-to-Text
•
Updated
Dec 15, 2024
•
100
•
10
AnyModal/LaTeX-OCR-Llama-3.2-1B
Updated
Dec 23, 2024
•
2
Qwen/Qwen2-VL-72B
Image-Text-to-Text
•
Updated
Dec 6, 2024
•
2.21k
•
71
unsloth/Pixtral-12B-2409-unsloth-bnb-4bit
Image-Text-to-Text
•
Updated
Dec 4, 2024
•
5.05k
•
5
unsloth/Llama-3.2-11B-Vision-unsloth-bnb-4bit
Image-Text-to-Text
•
Updated
Dec 4, 2024
•
1.33k
•
3
AI-Safeguard/Ivy-VL-llava
Visual Question Answering
•
Updated
Dec 31, 2024
•
955
•
59
bartowski/Qwen2-VL-2B-Instruct-GGUF
Image-Text-to-Text
•
Updated
Dec 17, 2024
•
5.48k
•
20
lmstudio-community/Qwen2-VL-7B-Instruct-GGUF
Image-Text-to-Text
•
Updated
Jan 6
•
11.6k
•
2
bartowski/Qwen2-VL-7B-Instruct-GGUF
Image-Text-to-Text
•
Updated
Dec 17, 2024
•
22.3k
•
34
bartowski/Qwen2-VL-72B-Instruct-GGUF
Image-Text-to-Text
•
Updated
Dec 18, 2024
•
4.31k
•
11
second-state/Qwen2-VL-7B-Instruct-GGUF
Image-Text-to-Text
•
Updated
Jan 11
•
614
•
3
mradermacher/Qwen2-VL-72B-Instruct-abliterated-i1-GGUF
Updated
Dec 15, 2024
•
262
•
1
GoodiesHere/Apollo-LMMs-Apollo-1_5B-t32
Video-Text-to-Text
•
Updated
Dec 18, 2024
•
126
•
9
GoodiesHere/Apollo-LMMs-Apollo-3B-t32
Text Generation
•
Updated
Dec 18, 2024
•
124
•
17
Previous
1
2
3
4
5
6
...
19
Next