Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
ceyda
's Collections
Korean Models
Useful Tools
vid-gen
Clips
VQA (Image captioning,QA)
Color
Nice~
Fashion
Cool names
VQA (Image captioning,QA)
updated
May 26
Upvote
-
Running
33
📊
FuseCap
Running
on
T4
394
💻
Kosmos 2
Running
5
🚀
Vilt Nlvr
Sleeping
122
⚡
Qwen VL
Running
on
T4
349
🔥
LLaVA
Running
on
A10G
308
👁
Fuyu Multimodal
Build error
21
📚
Chat-UniVi
Runtime error
159
🚀
MoE LLaVA
Running
on
Zero
155
🐨
IDEFICS2 Playground
Runtime error
82
🐐
CuMo 7b Zero
Running
on
Zero
252
🐬
Chat with DeepSeek VL 7B
What matters when building vision-language models?
Paper
•
2405.02246
•
Published
May 3
•
91
Running
on
Zero
298
🌔
moondream2
a tiny vision language model
Upvote
-
Share collection
View history
Collection guide
Browse collections