ShowUI: One Vision-Language-Action Model for GUI Visual Agent Paper β’ 2411.17465 β’ Published about 1 month ago β’ 76
Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction Paper β’ 2412.04454 β’ Published 21 days ago β’ 50
CogAgent: A Visual Language Model for GUI Agents Paper β’ 2312.08914 β’ Published Dec 14, 2023 β’ 29