Two Giraffes in a Dirt Field: Using Game Play to Investigate Situation Modelling in Large Multimodal Models Paper β’ 2406.14035 β’ Published Jun 20, 2024 β’ 12
Qwen2-VL Collection Vision-language model series based on Qwen2 β’ 16 items β’ Updated Dec 6, 2024 β’ 188
Vision Language Leaderboards Collection This collection has all the vision language leaderboards. β’ 7 items β’ Updated Aug 24, 2024 β’ 15