Shuhuai Ren's picture

5 6 35

Shuhuai Ren

ShuhuaiRen

·

https://renshuhuai-andy.github.io/

AI & ML interests

NLP, Multi-modal

Recent Activity

liked a dataset 11 days ago

CSU-JPG/TextAtlas5M

liked a dataset 15 days ago

AIDC-AI/Ovis-dataset

updated a collection 23 days ago

next-block-prediction

View all activity

Organizations

ShuhuaiRen's activity

upvoted a paper 29 days ago

Next Block Prediction: Video Generation via Semi-Autoregressive Modeling

Paper • 2502.07737 • Published about 1 month ago • 9

upvoted a paper about 1 month ago

EVEv2: Improved Baselines for Encoder-Free Vision-Language Models

Paper • 2502.06788 • Published Feb 10 • 12

upvoted a paper 3 months ago

Parallelized Autoregressive Visual Generation

Paper • 2412.15119 • Published Dec 19, 2024 • 51

upvoted 2 papers 9 months ago

M^3IT: A Large-Scale Dataset towards Multi-Modal Multilingual Instruction Tuning

Paper • 2306.04387 • Published Jun 7, 2023 • 8

Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis

Paper • 2405.21075 • Published May 31, 2024 • 24

upvoted a paper about 1 year ago

TimeChat: A Time-sensitive Multimodal Large Language Model for Long Video Understanding

Paper • 2312.02051 • Published Dec 4, 2023 • 1