Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
1
Libraries
Datasets
Languages
Licenses
Other
Reset Tasks
Multimodal
Audio-Text-to-Text
Image-Text-to-Text
Visual Question Answering
Document Question Answering
Video-Text-to-Text
Any-to-Any
Computer Vision
Depth Estimation
Image Classification
Object Detection
Image Segmentation
Text-to-Image
Image-to-Text
Image-to-Image
Image-to-Video
Unconditional Image Generation
Video Classification
Text-to-Video
Zero-Shot Image Classification
Mask Generation
Zero-Shot Object Detection
Text-to-3D
Image-to-3D
Image Feature Extraction
Keypoint Detection
Natural Language Processing
Text Classification
Token Classification
Table Question Answering
Question Answering
Zero-Shot Classification
Translation
Summarization
Feature Extraction
Text Generation
Text2Text Generation
Fill-Mask
Sentence Similarity
Audio
Text-to-Speech
Text-to-Audio
Automatic Speech Recognition
Audio-to-Audio
Audio Classification
Voice Activity Detection
Tabular
Tabular Classification
Tabular Regression
Time Series Forecasting
Reinforcement Learning
Reinforcement Learning
Robotics
Other
Graph Machine Learning
Apply filters
Models
5,638
Full-text search
Edit filters
Sort: Trending
Active filters:
image-text-to-text
Clear all
AIDC-AI/Ovis1.6-Llama3.2-3B
Image-Text-to-Text
•
Updated
21 days ago
•
2.69k
•
45
rhymes-ai/Aria-sequential_mlp
Image-Text-to-Text
•
Updated
26 days ago
•
180
•
15
mlx-community/Llama-3.2-11B-Vision-Instruct-4bit
Image-Text-to-Text
•
Updated
Oct 18
•
673
•
4
mlx-community/Llama-3.2-11B-Vision-Instruct-8bit
Image-Text-to-Text
•
Updated
Oct 18
•
897
•
9
thwin27/Aria-sequential_mlp-FP8-dynamic
Image-Text-to-Text
•
Updated
Oct 23
•
86
•
5
PULSE-ECG/PULSE-7B
Image-Text-to-Text
•
Updated
Oct 28
•
631
•
8
Kadins/Llama-3.2-Vision-chinese-lora
Image-Text-to-Text
•
Updated
8 days ago
•
52
•
3
thwin27/Aria-sequential_mlp-bnb_nf4
Image-Text-to-Text
•
Updated
Oct 23
•
277
•
10
llm-jp/llm-jp-3-vila-14b
Image-Text-to-Text
•
Updated
about 1 month ago
•
1.81k
•
6
yuchenxie/ArlowGPT-VL-OCR
Image-Text-to-Text
•
Updated
Nov 17
•
8
•
1
AIDC-AI/Ovis1.6-Llama3.2-3B-GPTQ-Int4
Image-Text-to-Text
•
Updated
Nov 11
•
214
•
4
shikiw/LLaVA-v1.5-MoCa-7B-pretrain
Image-Text-to-Text
•
Updated
Oct 28
•
67
•
1
shikiw/LLaVA-v1.5-MoCa-7B
Image-Text-to-Text
•
Updated
Oct 28
•
56
•
2
yifeihu/Florence-2-DocLayNet-Fixed
Image-Text-to-Text
•
Updated
Oct 29
•
137
•
8
calcuis/llava-gguf
Image-Text-to-Text
•
Updated
Nov 2
•
622
•
1
Vikhrmodels/Vikhr-2-VL-2b-Instruct-experimental
Image-Text-to-Text
•
Updated
Nov 3
•
295
•
14
rafox2005/Ovis1.6-Llama3.2-3B-GGUF
Image-Text-to-Text
•
Updated
Nov 2
•
1
pdufour/Qwen2-VL-7B-Instruct-onnx
Image-Text-to-Text
•
Updated
29 days ago
•
53
•
2
OS-Copilot/OS-Atlas-Base-7B
Image-Text-to-Text
•
Updated
29 days ago
•
4.59k
•
23
OS-Copilot/OS-Atlas-Base-4B
Image-Text-to-Text
•
Updated
29 days ago
•
691
•
5
Aranya31/llava-pruned
Image-Text-to-Text
•
Updated
Nov 5
•
11
•
1
c01zaut/MiniCPM-V-2_6-rk3588-1.1.4
Image-Text-to-Text
•
Updated
3 days ago
•
16
•
2
OpenGVLab/InternVL2-8B-MPO
Image-Text-to-Text
•
Updated
about 1 month ago
•
3.37k
•
22
Infinirc/Llama-3.2-Infinirc-11B-Vision-Instruct
Image-Text-to-Text
•
Updated
6 days ago
•
44
•
3
spow12/Pixtral-12b-korean-preview
Image-Text-to-Text
•
Updated
Nov 13
•
34
•
2
eltorio/IDEFICS3_ROCO
Image-Text-to-Text
•
Updated
Nov 14
•
132
•
9
BAAI/Aquila-VL-2B-Intermediate
Image-Text-to-Text
•
Updated
29 days ago
•
2
pdsdpo/PDS-DPO-7B
Image-Text-to-Text
•
Updated
1 day ago
•
16
•
1
OS-Copilot/OS-Atlas-Pro-7B
Image-Text-to-Text
•
Updated
29 days ago
•
621
•
9
OS-Copilot/OS-Atlas-Pro-4B
Image-Text-to-Text
•
Updated
29 days ago
•
72
•
1
Previous
1
...
9
10
11
12
13
...
100
Next