PhysGame: Uncovering Physical Commonsense Violations in Gameplay Videos Paper • 2412.01800 • Published Dec 2, 2024 • 6
PPLLaVA: Varied Video Sequence Understanding With Prompt Guidance Paper • 2411.02327 • Published Nov 4, 2024 • 11
PPLLaVA: Varied Video Sequence Understanding With Prompt Guidance Paper • 2411.02327 • Published Nov 4, 2024 • 11 • 1
Mug-STAN: Adapting Image-Language Pretrained Models for General Video Understanding Paper • 2311.15075 • Published Nov 25, 2023 • 1
ST-LLM: Large Language Models Are Effective Temporal Learners Paper • 2404.00308 • Published Mar 30, 2024 • 8
PPLLaVA: Varied Video Sequence Understanding With Prompt Guidance Paper • 2411.02327 • Published Nov 4, 2024 • 11