Vision Search Assistant: Empower Vision-Language Models as Multimodal Search Engines Paper โข 2410.21220 โข Published Oct 28 โข 10