arxiv:2405.14974
Hengyuan Zhao
hhenryz
AI & ML interests
Multimodal Understanding, AI Automation
Recent Activity
upvoted
a
paper
about 1 month ago
ROICtrl: Boosting Instance Control for Visual Generation
liked
a Space
about 1 month ago
showlab/ShowUI
upvoted
a
paper
about 1 month ago
ShowUI: One Vision-Language-Action Model for GUI Visual Agent