Yuanxin Liu

lyx97

https://llyx97.github.io/

llyx97

AI & ML interests

None yet

Recent Activity

upvoted a paper 8 days ago

Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling

updated a Space 23 days ago

lyx97/TempCompass

liked a Space about 1 month ago

opencompass/open_vlm_leaderboard

View all activity

Organizations

None yet

lyx97's activity

upvoted a paper 8 days ago

Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling

Paper • 2412.05271 • Published 16 days ago • 116

updated a Space 23 days ago

Running

🥇

TempCompass

liked 2 Spaces about 1 month ago

Running on CPU Upgrade

541

🌎

Open VLM Leaderboard

VLMEvalKit Evaluation Results Collection

Running

🌎

Open VLM Video Leaderboard

VLMEvalKit Eval Results in video understanding benchmark

liked a dataset 2 months ago

tobiaslee/text_temporal

Viewer • Updated Sep 27 • 12.5k • 115 • 2

upvoted 2 papers 2 months ago

Video Instruction Tuning With Synthetic Data

Paper • 2410.02713 • Published Oct 3 • 38

Pixtral 12B

Paper • 2410.07073 • Published Oct 9 • 62

authored 4 papers 2 months ago

VITATECS: A Diagnostic Dataset for Temporal Concept Understanding of Video-Language Models

Paper • 2311.17404 • Published Nov 29, 2023

TempCompass: Do Video LLMs Really Understand Videos?

Paper • 2403.00476 • Published Mar 1

COST-EFF: Collaborative Optimization of Spatial and Temporal Efficiency with Slenderized Multi-exit Language Models

Paper • 2210.15523 • Published Oct 27, 2022 • 1

Temporal Reasoning Transfer from Text to Video

Paper • 2410.06166 • Published Oct 8 • 12

upvoted 2 papers 2 months ago

Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation

Paper • 2410.05363 • Published Oct 7 • 44

Temporal Reasoning Transfer from Text to Video

Paper • 2410.06166 • Published Oct 8 • 12

liked a model 4 months ago

Qwen/Qwen2-VL-7B-Instruct

Image-Text-to-Text • Updated 17 days ago • 2.53M • • 965

liked a dataset 4 months ago

lmms-lab/Video-MME

Viewer • Updated Jul 4 • 2.7k • 9.28k • 30

liked a dataset 5 months ago

lmms-lab/TempCompass

Viewer • Updated Jun 10 • 7.54k • 516 • 5

upvoted a paper 6 months ago

MMEvalPro: Calibrating Multimodal Benchmarks Towards Trustworthy and Efficient Evaluation

Paper • 2407.00468 • Published Jun 29 • 34

liked 2 datasets 6 months ago

MLVU/MVLU

Preview • Updated Sep 18 • 4.61k • 16

ShareGPT4Video/ShareGPT4Video

Viewer • Updated Jul 8 • 40.2k • 2.79k • 183

liked a Space 7 months ago

Running on Zero

🎞️🍿