Large Language Model-Brained GUI Agents: A Survey Paper • 2411.18279 • Published Nov 27, 2024 • 29
Ferret-UI: Grounded Mobile UI Understanding with Multimodal LLMs Paper • 2404.05719 • Published Apr 8, 2024 • 83
ShowUI: One Vision-Language-Action Model for GUI Visual Agent Paper • 2411.17465 • Published Nov 26, 2024 • 78
UI Agent Collection a collection of algorithmic agents for user interfaces/interactions and program synthesis • 239 items • Updated 12 minutes ago • 41