Haoning Wu, Teo PRO

teowu

https://teowu.github.io

AI & ML interests

Lead of Q-Future: https://github.com/Q-Future. I love MLLMs/LMMs/LVLMs/(any names you call them). Core builder of Aria (best open-source video LMM).

Recent Activity

updated a dataset about 14 hours ago

VideoLMMs/LongVideoCaptions-371K

updated a model 5 days ago

rhymes-ai/Aria

updated a model 7 days ago

rhymes-ai/Aria-Chat

View all activity

Organizations

teowu's activity

updated a dataset about 14 hours ago

VideoLMMs/LongVideoCaptions-371K

Updated about 14 hours ago • 3

updated a model 5 days ago

rhymes-ai/Aria

Image-Text-to-Text • Updated 5 days ago • 18.6k • 598

updated a model 7 days ago

rhymes-ai/Aria-Chat

Image-Text-to-Text • Updated 7 days ago • 118 • 4

upvoted a paper 12 days ago

MAmmoTH-VL: Eliciting Multimodal Reasoning with Instruction Tuning at Scale

Paper • 2412.05237 • Published 16 days ago • 44

upvoted a paper 19 days ago

VISTA: Enhancing Long-Duration and High-Resolution Video Understanding by Video Spatiotemporal Augmentation

Paper • 2412.00927 • Published 21 days ago • 26

New activity in zero-gpu-explorers/README 21 days ago

No space left on device when running ZeroGpu

#132 opened 26 days ago by

xiaozaa

updated a Space 21 days ago

Running

🔥

README

updated 2 models 21 days ago

rhymes-ai/Aria-Base-8K

Image-Text-to-Text • Updated 21 days ago • 145 • 6

rhymes-ai/Aria-Base-64K

Image-Text-to-Text • Updated 21 days ago • 143 • 10

liked 2 models 21 days ago

rhymes-ai/Aria-Base-64K

Image-Text-to-Text • Updated 21 days ago • 143 • 10

rhymes-ai/Aria-Base-8K

Image-Text-to-Text • Updated 21 days ago • 145 • 6

updated a model 22 days ago

teowu/Aria-Midtrain

Updated 22 days ago

liked a model 22 days ago

Qwen/QwQ-32B-Preview

Text Generation • Updated 23 days ago • 119k • • 1.38k

liked a dataset 26 days ago

allenai/pixmo-docs

Viewer • Updated 17 days ago • 255k • 4.65k • 19

updated a dataset 26 days ago

teowu/TOMATO_benchmark_videos

Viewer • Updated 26 days ago • 1.42k • 27

liked a Space 26 days ago

Running

🌎

Open VLM Video Leaderboard

VLMEvalKit Eval Results in video understanding benchmark

upvoted a paper 27 days ago

Data Engineering for Scaling Language Models to 128K Context

Paper • 2402.10171 • Published Feb 15 • 23

authored 3 papers about 1 month ago

Adaptive Image Quality Assessment via Teaching Large Multimodal Model to Compare

Paper • 2405.19298 • Published May 29

LIME: Less Is More for MLLM Evaluation

Paper • 2409.06851 • Published Sep 10

VideoAutoArena: An Automated Arena for Evaluating Large Multimodal Models in Video Analysis through User Simulation

Paper • 2411.13281 • Published Nov 20 • 17