Wenqi Zhang's picture

Wenqi Zhang

zwq2018

·

zwq2018

AI & ML interests

LLM, Multimodal, Robotics

Recent Activity

updated a dataset 2 days ago

DAMO-NLP-SG/multimodal_textbook

commented a paper 2 days ago

2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining

commented a paper 2 days ago

2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining

View all activity

Organizations

zwq2018's activity

upvoted 2 papers 2 days ago

VideoRefer Suite: Advancing Spatial-Temporal Object Understanding with Video LLM

Paper • 2501.00599 • Published 5 days ago • 31

2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining

Paper • 2501.00958 • Published 4 days ago • 69

upvoted 4 papers 2 months ago

Infinity-MM: Scaling Multimodal Performance with Large-Scale and High-Quality Instruction Data

Paper • 2410.18558 • Published Oct 24, 2024 • 18

Distill Visual Chart Reasoning Ability from LLMs to MLLMs

Paper • 2410.18798 • Published Oct 24, 2024 • 19

Can Knowledge Editing Really Correct Hallucinations?

Paper • 2410.16251 • Published Oct 21, 2024 • 54

Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss

Paper • 2410.17243 • Published Oct 22, 2024 • 89

upvoted a paper 4 months ago

Training Language Models to Self-Correct via Reinforcement Learning

Paper • 2409.12917 • Published Sep 19, 2024 • 136

upvoted a paper 5 months ago

SeaLLMs 3: Open Foundation and Chat Multilingual Large Language Models for Southeast Asian Languages

Paper • 2407.19672 • Published Jul 29, 2024 • 56

upvoted a paper 6 months ago

Multimodal Self-Instruct: Synthetic Abstract Image and Visual Reasoning Instruction Using Language Model

Paper • 2407.07053 • Published Jul 9, 2024 • 43

upvoted a paper 12 months ago

Self-Contrast: Better Reflection Through Inconsistent Solving Perspectives

Paper • 2401.02009 • Published Jan 4, 2024 • 1