Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
carlizor
's Collections
Flux
Image restoration
3D Generation
LLM
Embedding
LLM - Small
Video vision
To Read
Video
Image Segmentation
Image Generation (Fast)
Image Depth
Image caption
Audio
Image Generation
Image that talks
Image Enhance
Image Vision
Image editing
Image upscaling
Face Recognition
Multimodal
LLM - Medium
Image Vision
updated
1 day ago
Upvote
-
Salesforce/xgen-mm-phi3-mini-instruct-r-v1
Image-Text-to-Text
•
Updated
Sep 18, 2024
•
1.24k
•
185
AIDC-AI/Ovis1.6-Gemma2-9B
Image-Text-to-Text
•
Updated
Nov 27, 2024
•
3.6k
•
265
nvidia/NVLM-D-72B
Image-Text-to-Text
•
Updated
3 days ago
•
6.29k
•
764
microsoft/OmniParser
Image-Text-to-Text
•
Updated
Dec 2, 2024
•
1.17k
•
1.52k
deepseek-ai/Janus-1.3B
Any-to-Any
•
Updated
Nov 14, 2024
•
9.23k
•
498
deepseek-ai/JanusFlow-1.3B
Any-to-Any
•
Updated
Nov 18, 2024
•
2.76k
•
80
NexaAIDev/OmniVLM-968M
Updated
25 days ago
•
1.68k
•
493
vikhyatk/moondream2
Image-Text-to-Text
•
Updated
1 day ago
•
115k
•
845
stepfun-ai/GOT-OCR2_0
Image-Text-to-Text
•
Updated
Sep 18, 2024
•
719k
•
1.32k
jiuhai/florence-vl-8b-sft
Updated
Dec 3, 2024
•
101
•
18
AI-Safeguard/Ivy-VL-llava
Visual Question Answering
•
Updated
11 days ago
•
3.34k
•
56
OpenGVLab/InternVL2_5-78B
Image-Text-to-Text
•
Updated
23 days ago
•
4.61k
•
159
Qwen/QVQ-72B-Preview
Image-Text-to-Text
•
Updated
17 days ago
•
86.9k
•
480
deepseek-ai/deepseek-vl2
Image-Text-to-Text
•
Updated
23 days ago
•
2.15k
•
128
allenai/Molmo-7B-D-0924
Image-Text-to-Text
•
Updated
Oct 10, 2024
•
317k
•
488
prithivMLmods/Qwen2-VL-OCR-2B-Instruct
Image-Text-to-Text
•
Updated
7 days ago
•
3.08k
•
25
ByteDance/Sa2VA-1B
Image-Text-to-Text
•
Updated
about 10 hours ago
•
123
•
10
Upvote
-
Share collection
View history
Collection guide
Browse collections