Shijia Yang's picture

1 5

Shijia Yang

shijiay

·

AI & ML interests

None yet

Organizations

None yet

shijiay's activity

upvoted a paper 3 months ago

AV-Odyssey Bench: Can Your Multimodal LLMs Really Understand Audio-Visual Information?

Paper • 2412.02611 • Published Dec 3, 2024 • 24

upvoted an article 6 months ago

Article

Key Insights into the Law of Vision Representations in MLLMs

By

•

Sep 2, 2024

• 18

upvoted 3 papers 6 months ago

Law of Vision Representation in MLLMs

Paper • 2408.16357 • Published Aug 29, 2024 • 93

Multitask Vision-Language Prompt Tuning

Paper • 2211.11720 • Published Nov 21, 2022 • 2

HallE-Switch: Rethinking and Controlling Object Existence Hallucinations in Large Vision Language Models for Detailed Caption

Paper • 2310.01779 • Published Oct 3, 2023 • 4