UI Agent Collection a collection of algorithmic agents for user interfaces/interactions and program synthesis • 236 items • Updated 1 day ago • 38
Dataset Creation Collection Spaces and utilities for creating datasets and getting them on the Hub • 3 items • Updated Nov 10, 2024 • 10
Llama 3.2 Collection This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated 30 days ago • 551
Molmo Collection Artifacts for open multimodal language models. • 5 items • Updated Nov 27, 2024 • 290
Windows Agent Arena: Evaluating Multi-Modal OS Agents at Scale Paper • 2409.08264 • Published Sep 12, 2024 • 43
LLaVA-Video Collection Models focus on video understanding (previously known as LLaVA-NeXT-Video). • 6 items • Updated Oct 5, 2024 • 56
PaliGemma Release Collection Pretrained and mix checkpoints for PaliGemma • 16 items • Updated 23 days ago • 143
Vision Language Models Papers 🖼️💬📝 Collection Papers about vision-language models, most important ones are on top of the list. • 27 items • Updated Apr 30, 2024 • 35