Zirui Wang

zwcolin

https://zwcolin.github.io/

zwcolin
zwcolin

AI & ML interests

My general research interest lies in two directions (1) understand and harness the synergy between generative and understanding modeling objectives, and (2) align image and text in different modalities, especially when texts (and other arbitrary, non-natural structures such as graphs and flowcharts) appear in the visual representation.

Organizations

zwcolin's activity

New activity in princeton-nlp/CharXiv 5 months ago

Upload 12 files

#3 opened 5 months ago by

zwcolin

New activity in princeton-nlp/CharXiv 6 months ago

Answer of the first image is wrong

#2 opened 7 months ago by

linus106

commented a paper 7 months ago

CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMs

Paper • 2406.18521 • Published Jun 26, 2024 • 29 •