arxiv:2412.07589
Xiangtai Li
LXT
AI & ML interests
Computer Vision, Multi-Modal Understanding, Generative AI
Recent Activity
liked
a dataset
4 days ago
zhangtao-whu/OMG-LLaVA
upvoted
a
paper
9 days ago
Multimodal Latent Language Modeling with Next-Token Diffusion
commented
a paper
11 days ago
DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for
Customized Manga Generation
Organizations
Papers
27
datasets
None public yet