Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference Providers
Select all
Together AI
SambaNova
Fireworks
Replicate
fal
HF Inference API
Misc
Reset Misc
vlm
Inference Endpoints
custom_code
AutoTrain Compatible
text-generation-inference
4-bit precision
Misc with no match
Eval Results
Merge
8-bit precision
text-embeddings-inference
Carbon Emissions
Mixture of Experts
Apply filters
Models
40
Full-text search
Edit filters
Sort: Trending
Active filters:
vlm
Clear all
unum-cloud/uform-gen2-qwen-500m
Image-to-Text
•
Updated
Apr 24, 2024
•
24.3k
•
76
unum-cloud/uform-gen
Image-to-Text
•
Updated
Dec 31, 2023
•
782
•
43
TIGER-Lab/Mantis-8B-Idefics2
Image-Text-to-Text
•
Updated
Nov 15, 2024
•
459
•
13
BUAADreamer/PaliGemma-3B-Chat-v0.2
Image-Text-to-Text
•
Updated
Jun 5, 2024
•
129
•
8
cyan2k/molmo-7B-O-bnb-4bit
Text Generation
•
Updated
Sep 26, 2024
•
1.16k
•
9
AnyModal/LaTeX-OCR-Llama-3.2-1B
Updated
Dec 23, 2024
•
2
prithivMLmods/Blazer.1-2B-Vision
Image-Text-to-Text
•
Updated
Jan 15
•
620
•
8
mradermacher/Blazer.1-2B-Vision-GGUF
Updated
Jan 16
•
611
•
1
unum-cloud/uform-gen-chat
Visual Question Answering
•
Updated
Dec 31, 2023
•
139
•
20
4bit/uform-gen
Image-to-Text
•
Updated
Dec 31, 2023
•
43
•
2
unum-cloud/uform-gen2-dpo
Image-to-Text
•
Updated
Apr 24, 2024
•
1.02k
•
43
MonolithFoundation/Bumblebee
Text Generation
•
Updated
Apr 28, 2024
•
9
•
4
sujet-ai/Lutece-Vision-Base
Image-to-Text
•
Updated
Jul 14, 2024
•
142
•
6
TIGER-Lab/Mantis-8B-siglip-llama3
Image-Text-to-Text
•
Updated
Nov 15, 2024
•
15.3k
•
32
TIGER-Lab/Mantis-8B-clip-llama3
Image-Text-to-Text
•
Updated
Nov 15, 2024
•
614
•
1
TIGER-Lab/Mantis-8B-Fuyu
Text Generation
•
Updated
May 4, 2024
•
73
•
4
MischaQI/SNIFFER
Updated
May 15, 2024
•
1
hiyouga/PaliGemma-3B-Chat-v0.1
Image-Text-to-Text
•
Updated
Jul 1, 2024
•
41
•
11
JosefAlbers/Phi-3-vision-128k-instruct-mlx
Updated
Jun 16, 2024
•
33
•
1
AlanaAI/AlanaVLM
Updated
Jul 4, 2024
amitha/mllava-baichuan2-en
Visual Question Answering
•
Updated
Jun 19, 2024
•
13
amitha/mllava-baichuan2-zh
Visual Question Answering
•
Updated
Jun 19, 2024
•
7
amitha/mllava-baichuan2-en-zh
Visual Question Answering
•
Updated
Jun 19, 2024
•
8
amitha/mllava-llama2-en
Visual Question Answering
•
Updated
Jun 19, 2024
•
12
amitha/mllava-llama2-zh
Visual Question Answering
•
Updated
Jun 19, 2024
•
10
amitha/mllava-llama2-en-zh
Visual Question Answering
•
Updated
Jun 19, 2024
•
12
variante/llava-1.5-7b-llara-D-inBC-VIMA-80k
Image-Text-to-Text
•
Updated
Jul 13, 2024
•
7
•
1
variante/llava-1.5-7b-llara-D-inBC-Aux-D-VIMA-80k
Image-Text-to-Text
•
Updated
Jul 13, 2024
•
4
•
1
variante/llara-maskrcnn
Object Detection
•
Updated
Jul 1, 2024
•
1
variante/llava-1.5-7b-llara-D-inBC-Aux-B-VIMA-80k
Image-Text-to-Text
•
Updated
Jul 15, 2024
•
10
•
1
Previous
1
2
Next