Trust but Verify: Programmatic VLM Evaluation in the Wild Paper • 2410.13121 • Published Oct 17, 2024 • 2
BLIP3-KALE: Knowledge Augmented Large-Scale Dense Captions Paper • 2411.07461 • Published Nov 12, 2024 • 22
BLIP3-KALE: Knowledge Augmented Large-Scale Dense Captions Paper • 2411.07461 • Published Nov 12, 2024 • 22
xGen-MM (BLIP-3): A Family of Open Large Multimodal Models Paper • 2408.08872 • Published Aug 16, 2024 • 98
xGen-MM (BLIP-3): A Family of Open Large Multimodal Models Paper • 2408.08872 • Published Aug 16, 2024 • 98
Salesforce/xgen-mm-phi3-mini-instruct-r-v1 Image-Text-to-Text • Updated Sep 18, 2024 • 1.29k • 185
List Items One by One: A New Data Source and Learning Paradigm for Multimodal LLMs Paper • 2404.16375 • Published Apr 25, 2024 • 16
List Items One by One: A New Data Source and Learning Paradigm for Multimodal LLMs Paper • 2404.16375 • Published Apr 25, 2024 • 16
Learning Concise and Descriptive Attributes for Visual Recognition Paper • 2308.03685 • Published Aug 7, 2023