arxiv:2410.16198
Haotian Zhang
haotiz
AI & ML interests
Vision and Language
Recent Activity
upvoted
a
paper
12 days ago
STIV: Scalable Text and Image Conditioned Video Generation
authored
a paper
about 2 months ago
Improve Vision Language Model Chain-of-thought Reasoning
upvoted
a
paper
2 months ago
Improve Vision Language Model Chain-of-thought Reasoning
Organizations
Papers
13
spaces
1
models
None public yet
datasets
None public yet