Two Giraffes in a Dirt Field: Using Game Play to Investigate Situation Modelling in Large Multimodal Models Paper β’ 2406.14035 β’ Published Jun 20, 2024 β’ 13
How Many Parameters Does it Take to Change a Light Bulb? Evaluating Performance in Self-Play of Conversational Games as a Function of Model Characteristics Paper β’ 2406.14051 β’ Published Jun 20, 2024 β’ 9
How Many Parameters Does it Take to Change a Light Bulb? Evaluating Performance in Self-Play of Conversational Games as a Function of Model Characteristics Paper β’ 2406.14051 β’ Published Jun 20, 2024 β’ 9
Two Giraffes in a Dirt Field: Using Game Play to Investigate Situation Modelling in Large Multimodal Models Paper β’ 2406.14035 β’ Published Jun 20, 2024 β’ 13
Two Giraffes in a Dirt Field: Using Game Play to Investigate Situation Modelling in Large Multimodal Models Paper β’ 2406.14035 β’ Published Jun 20, 2024 β’ 13
Clembench: Using Game Play to Evaluate Chat-Optimized Language Models as Conversational Agents Paper β’ 2305.13455 β’ Published May 22, 2023 β’ 3
Images in Language Space: Exploring the Suitability of Large Language Models for Vision & Language Tasks Paper β’ 2305.13782 β’ Published May 23, 2023 β’ 1
Clembench: Using Game Play to Evaluate Chat-Optimized Language Models as Conversational Agents Paper β’ 2305.13455 β’ Published May 22, 2023 β’ 3