Multimodal Models Collection Multimodal models with leading performance. • 17 items • Updated about 13 hours ago • 27
Reformulating Vision-Language Foundation Models and Datasets Towards Universal Multimodal Assistants Paper • 2310.00653 • Published Oct 1, 2023 • 3
Large Multilingual Models Pivot Zero-Shot Multimodal Learning across Languages Paper • 2308.12038 • Published Aug 23, 2023 • 2
MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies Paper • 2404.06395 • Published Apr 9, 2024 • 22
GUICourse: From General Vision Language Models to Versatile GUI Agents Paper • 2406.11317 • Published Jun 17, 2024 • 1
Multimodal Models Collection Multimodal models with leading performance. • 17 items • Updated about 13 hours ago • 27