11 16 29

Yizhi Li

yizhilll

https://yizhilll.github.io

AI & ML interests

None yet

Recent Activity

liked a dataset 18 days ago

O1-OPEN/OpenO1-SFT

upvoted a paper 25 days ago

MAmmoTH-VL: Eliciting Multimodal Reasoning with Instruction Tuning at Scale

View all activity

Organizations

yizhilll's activity

upvoted a paper 25 days ago

MAmmoTH-VL: Eliciting Multimodal Reasoning with Instruction Tuning at Scale

Paper • 2412.05237 • Published 28 days ago • 46

upvoted a paper about 1 month ago

VideoAutoArena: An Automated Arena for Evaluating Large Multimodal Models in Video Analysis through User Simulation

Paper • 2411.13281 • Published Nov 20, 2024 • 17

upvoted 3 papers about 2 months ago

Chinese SimpleQA: A Chinese Factuality Evaluation for Large Language Models

Paper • 2411.07140 • Published Nov 11, 2024 • 33

OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision

Paper • 2411.07199 • Published Nov 11, 2024 • 46

OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models

Paper • 2411.04905 • Published Nov 7, 2024 • 111

upvoted 2 papers 3 months ago

Pyramidal Flow Matching for Efficient Video Generative Modeling

Paper • 2410.05954 • Published Oct 8, 2024 • 38

HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models

Paper • 2409.16191 • Published Sep 24, 2024 • 41

upvoted a collection 3 months ago

M-A-P Full Paper List

Collection

26 items • Updated Oct 8, 2024 • 6

upvoted a paper 3 months ago

OmniBench: Towards The Future of Universal Omni-Language Models

Paper • 2409.15272 • Published Sep 23, 2024 • 26

upvoted a paper 6 months ago

LMMs-Eval: Reality Check on the Evaluation of Large Multimodal Models

Paper • 2407.12772 • Published Jul 17, 2024 • 33

upvoted 3 papers 7 months ago

MAP-Neo: Highly Capable and Transparent Bilingual Large Language Model Series

Paper • 2405.19327 • Published May 29, 2024 • 46

Chameleon: Mixed-Modal Early-Fusion Foundation Models

Paper • 2405.09818 • Published May 16, 2024 • 126

LoGAH: Predicting 774-Million-Parameter Transformers using Graph HyperNetworks with 1/100 Parameters

Paper • 2405.16287 • Published May 25, 2024 • 10

upvoted an article 8 months ago

Article

A Dive into Pretraining Strategies for Vision-Language Models

Feb 3, 2023

• 52

upvoted a paper 8 months ago

TextSquare: Scaling up Text-Centric Visual Instruction Tuning

Paper • 2404.12803 • Published Apr 19, 2024 • 29

upvoted a paper 10 months ago

ChatMusician: Understanding and Generating Music Intrinsically with LLM

Paper • 2402.16153 • Published Feb 25, 2024 • 56