Forgotten Polygons: Multimodal Large Language Models are Shape-Blind Paper • 2502.15969 • Published 18 days ago • 2
What Do VLMs NOTICE? A Mechanistic Interpretability Pipeline for Noise-free Text-Image Corruption and Evaluation Paper • 2406.16320 • Published Jun 24, 2024 • 3