SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features Paper • 2502.14786 • Published 21 days ago • 129
view article Article PaliGemma 2 Mix - New Instruction Vision Language Models by Google 23 days ago • 65
view article Article Introducing smolagents: simple agents that write actions in code. Dec 31, 2024 • 867
PaliGemma Release Collection Pretrained and mix checkpoints for PaliGemma • 16 items • Updated 1 day ago • 145