Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Edit Models filters
Tasks
1
Libraries
Datasets
Languages
Licenses
Other
Reset Tasks
Multimodal
Image-Text-to-Text
Visual Question Answering
Document Question Answering
Computer Vision
Depth Estimation
Image Classification
Object Detection
Image Segmentation
Text-to-Image
Image-to-Text
Image-to-Image
Image-to-Video
Unconditional Image Generation
Video Classification
Text-to-Video
Zero-Shot Image Classification
Mask Generation
Zero-Shot Object Detection
Text-to-3D
Image-to-3D
Image Feature Extraction
Natural Language Processing
Text Classification
Token Classification
Table Question Answering
Question Answering
Zero-Shot Classification
Translation
Summarization
Feature Extraction
Text Generation
Text2Text Generation
Fill-Mask
Sentence Similarity
Audio
Text-to-Speech
Text-to-Audio
Automatic Speech Recognition
Audio-to-Audio
Audio Classification
Voice Activity Detection
Tabular
Tabular Classification
Tabular Regression
Time Series Forecasting
Reinforcement Learning
Reinforcement Learning
Robotics
Other
Graph Machine Learning
Apply filters
Models
366
Full-text search
Edit filters
Sort: Trending
Active filters:
image-text-to-text
Clear all
google/paligemma-3b-ft-coco35l-224
Image-Text-to-Text
•
Updated
2 days ago
•
193
•
1
google/paligemma-3b-ft-science-qa-448
Image-Text-to-Text
•
Updated
2 days ago
•
22
•
1
google/paligemma-3b-ft-aokvqa-da-224
Image-Text-to-Text
•
Updated
2 days ago
•
628
google/paligemma-3b-ft-stvqa-224
Image-Text-to-Text
•
Updated
2 days ago
•
8
google/paligemma-3b-ft-textvqa-896
Image-Text-to-Text
•
Updated
2 days ago
•
16
•
1
google/paligemma-3b-ft-stvqa-448
Image-Text-to-Text
•
Updated
2 days ago
•
7
google/paligemma-3b-ft-widgetcap-224
Image-Text-to-Text
•
Updated
2 days ago
•
2
Zery/MV-LLaVA-7B
Image-Text-to-Text
•
Updated
25 days ago
•
34
google/paligemma-3b-ft-ai2d-448
Image-Text-to-Text
•
Updated
2 days ago
•
46
mlx-community/paligemma-3b-mix-224-8bit
Image-Text-to-Text
•
Updated
May 15
•
21
•
1
tinyllava/TinyLLaVA-Gemma-SigLIP-2.4B
Image-Text-to-Text
•
Updated
May 18
•
123
gokaygokay/paligemma-docci-transformers
Image-Text-to-Text
•
Updated
May 16
•
379
•
1
leo009/paligemma-3b-pt-224
Image-Text-to-Text
•
Updated
May 18
•
133
leo009/paligemma-3b-mix-224
Image-Text-to-Text
•
Updated
May 17
•
180
•
1
RichardLuo/Shotluck-Holmes-1.5
Image-Text-to-Text
•
Updated
May 18
•
4
•
2
Xenova/tiny-random-PaliGemmaForConditionalGeneration
Image-Text-to-Text
•
Updated
May 19
•
7
firqaaa/vsft-llava-1.5-7b-hf-liveness
Image-Text-to-Text
•
Updated
May 22
•
121
•
1
stanrom/ShareGPT4V-7B
Image-Text-to-Text
•
Updated
May 20
•
4
aloobun/F18
Image-Text-to-Text
•
Updated
May 20
ayoubkirouane/llava-phi3-instruct-Lora
Image-Text-to-Text
•
Updated
May 22
•
2
Lin-Chen/open-llava-next-vicuna-7b
Image-Text-to-Text
•
Updated
May 27
•
57
•
1
ayoubkirouane/Idefics2-8b-Finetuned-Lora
Image-Text-to-Text
•
Updated
16 days ago
abhi-8/Age-gender-predictor
Image-Text-to-Text
•
Updated
May 23
•
1
lamm-mit/Cephalo-Idefics-2-vision-8b-alpha
Image-Text-to-Text
•
Updated
30 days ago
•
2
Reverb/Idefics2-8b-docVQA-finetuned
Image-Text-to-Text
•
Updated
May 25
•
2
lamm-mit/Cephalo-Phi-3-vision-128k-4b-beta
Image-Text-to-Text
•
Updated
27 days ago
•
177
LINs-lab/DynMoE-StableLM-1.6B
Image-Text-to-Text
•
Updated
29 days ago
•
29
•
2
LINs-lab/DynMoE-Qwen-1.8B
Image-Text-to-Text
•
Updated
29 days ago
•
3
•
1
LINs-lab/DynMoE-Phi-2-2.7B
Image-Text-to-Text
•
Updated
29 days ago
•
1
•
1
ucsahin/paligemma-3b-mix-448-ft-TableDetection
Image-Text-to-Text
•
Updated
May 26
•
65
•
3
Previous
1
...
9
10
11
12
13
Next