arxiv:2407.02392
YuqianYuan
CircleRadon
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 14 hours ago
2.5 Years in Class: A Multimodal Textbook for Vision-Language
Pretraining
upvoted
a
paper
about 17 hours ago
VideoRefer Suite: Advancing Spatial-Temporal Object Understanding with
Video LLM
commented
a paper
about 17 hours ago
VideoRefer Suite: Advancing Spatial-Temporal Object Understanding with
Video LLM