One Vision-Language-Action Model for GUI Agent
Qinghong (Kevin) Lin PRO
KevinQHLin
AI & ML interests
Vision-Language Model, Video Understanding, Human-AI Interaction
Recent Activity
updated
a dataset
5 days ago
KevinQHLin/showui_traj
published
a dataset
5 days ago
KevinQHLin/showui_traj
upvoted
a
paper
9 days ago
WorldGUI: Dynamic Testing for Comprehensive Desktop GUI Automation