Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
johannhartmann
's Collections
Multimodal Models
Medical MultiModal
Multimodal Models
updated
27 days ago
A collection of multimodal models for the gpu poor
Upvote
2
google/paligemma-3b-pt-896
Image-Text-to-Text
•
Updated
Jul 19
•
2.55k
•
111
OpenGVLab/InternVL-Chat-V1-5
Image-Text-to-Text
•
Updated
4 days ago
•
2.45k
•
404
alexshengzhili/llava-v1.5-13b-dpo
Text Generation
•
Updated
Apr 13
•
16
•
5
llava-hf/llava-v1.6-mistral-7b-hf
Image-Text-to-Text
•
Updated
30 days ago
•
243k
•
242
Qwen/Qwen-VL
Text Generation
•
Updated
Jan 25
•
17.2k
•
219
THUDM/cogvlm2-llama3-chat-19B
Text Generation
•
Updated
Sep 3
•
65.1k
•
202
BK-Lee/MoAI-7B
Image-Text-to-Text
•
Updated
Oct 2
•
402
•
45
01-ai/Yi-VL-34B
Image-Text-to-Text
•
Updated
Jun 26
•
75
•
260
mPLUG/DocOwl1.5-Omni
Updated
Apr 10
•
57
•
16
google/paligemma-3b-ft-docvqa-896
Image-Text-to-Text
•
Updated
Jul 19
•
587
•
8
Lin-Chen/open-llava-next-llama3-8b
Image-Text-to-Text
•
Updated
May 27
•
60
•
26
Mizukiluke/mplug_owl_2_1
Updated
Jan 31
•
47
•
11
HuanjinYao/DenseConnector-v1.5-8B
Image-to-Text
•
Updated
May 26
•
15
•
7
microsoft/Phi-3-vision-128k-instruct
Text Generation
•
Updated
Aug 20
•
82.2k
•
940
tiiuae/falcon-11B-vlm
Image-Text-to-Text
•
Updated
Jun 12
•
1.8k
•
46
AIDC-AI/Ovis1.5-Llama3-8B
Image-Text-to-Text
•
Updated
Aug 2
•
64
•
25
HuggingFaceM4/Idefics3-8B-Llama3
Image-Text-to-Text
•
Updated
20 days ago
•
18.8k
•
257
openbmb/MiniCPM-V-2_6
Image-Text-to-Text
•
Updated
Nov 15
•
55.1k
•
869
microsoft/Florence-2-large
Image-Text-to-Text
•
Updated
13 days ago
•
339k
•
1.29k
allenai/Molmo-7B-D-0924
Image-Text-to-Text
•
Updated
Oct 10
•
230k
•
460
meta-llama/Llama-3.2-11B-Vision-Instruct
Image-Text-to-Text
•
Updated
18 days ago
•
2.81M
•
•
1.14k
BAAI/Emu3-Gen
Any-to-Any
•
Updated
Oct 23
•
3.76k
•
194
vidore/colpali-v1.2
Updated
2 days ago
•
160k
•
96
Qwen/Qwen2-VL-2B-Instruct
Image-Text-to-Text
•
Updated
16 days ago
•
926k
•
325
deepseek-ai/Janus-1.3B
Any-to-Any
•
Updated
Nov 14
•
6.89k
•
481
NexaAIDev/OmniVLM-968M
Updated
5 days ago
•
7.45k
•
479
Xkev/Llama-3.2V-11B-cot
Image-Text-to-Text
•
Updated
6 days ago
•
13.8k
•
129
alibaba-damo/mgp-str-base
Image-to-Text
•
Updated
Dec 11, 2023
•
8.36k
•
63
Upvote
2
Share collection
View history
Collection guide
Browse collections