Collection of the most popular vision models including Llama 3.2, LlaVa, Qwen2 VL, Pixtral, PaliGemma and more!
-
unsloth/Llama-3.2-11B-Vision-Instruct
Image-Text-to-Text • Updated • 42.9k • 57 -
unsloth/Llama-3.2-11B-Vision-Instruct-bnb-4bit
Image-Text-to-Text • Updated • 36.1k • 49 -
unsloth/Qwen2-VL-7B-Instruct-bnb-4bit
Image-Text-to-Text • Updated • 24 -
unsloth/Pixtral-12B-2409-bnb-4bit
Image-Text-to-Text • Updated • 8 • 2