Xinyu Fang's picture

1 15 7

Xinyu Fang

nebulae09

·

FangXinyu-0913

AI & ML interests

None yet

Recent Activity

liked a Space 14 days ago

opencompass/Open_LMM_Reasoning_Leaderboard

upvoted a paper 14 days ago

Are Your LLMs Capable of Stable Reasoning?

authored a paper about 1 month ago

MME-Survey: A Comprehensive Survey on Evaluation of Multimodal LLMs

View all activity

Organizations

nebulae09's activity

upvoted a paper 14 days ago

Are Your LLMs Capable of Stable Reasoning?

Paper • 2412.13147 • Published 14 days ago • 90

upvoted a paper about 1 month ago

MME-Survey: A Comprehensive Survey on Evaluation of Multimodal LLMs

Paper • 2411.15296 • Published Nov 22 • 19

upvoted a paper 2 months ago

CompassJudger-1: All-in-one Judge Model Helps Model Evaluation and Evolution

Paper • 2410.16256 • Published Oct 21 • 58

upvoted a paper 3 months ago

ProSA: Assessing and Understanding the Prompt Sensitivity of LLMs

Paper • 2410.12405 • Published Oct 16 • 13

upvoted a collection 3 months ago

LLaVA-Video

Models focus on video understanding (previously known as LLaVA-NeXT-Video). • 6 items • Updated Oct 5 • 55

upvoted 2 papers 3 months ago

Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation

Paper • 2410.05363 • Published Oct 7 • 44

HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models

Paper • 2409.16191 • Published Sep 24 • 41

upvoted 5 papers 6 months ago

NeedleBench: Can LLMs Do Retrieval and Reasoning in 1 Million Context Window?

Paper • 2407.11963 • Published Jul 16 • 43

MG-LLaVA: Towards Multi-Granularity Visual Instruction Tuning

Paper • 2406.17770 • Published Jun 25 • 18

ShareGPT4Video: Improving Video Understanding and Generation with Better Captions

Paper • 2406.04325 • Published Jun 6 • 72

Prism: A Framework for Decoupling and Assessing the Capabilities of VLMs

Paper • 2406.14544 • Published Jun 20 • 34

MMBench-Video: A Long-Form Multi-Shot Benchmark for Holistic Video Understanding

Paper • 2406.14515 • Published Jun 20 • 32

upvoted 3 articles 8 months ago

Article

PaliGemma – Google's Cutting-Edge Open Vision Language Model

May 14

• 229

Article

Vision Language Models Explained

Apr 11

• 238

Article

RealWorldQA, What's New?

By

•

Apr 25

• 5