PaliGemma 2: A Family of Versatile VLMs for Transfer Paper • 2412.03555 • Published 18 days ago • 118
NVILA: Efficient Frontier Visual Language Models Paper • 2412.04468 • Published 17 days ago • 54
VisionZip: Longer is Better but Not Necessary in Vision Language Models Paper • 2412.04467 • Published 17 days ago • 103
Teach Multimodal LLMs to Comprehend Electrocardiographic Images Paper • 2410.19008 • Published Oct 21 • 23
Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss Paper • 2410.17243 • Published Oct 22 • 89