Multimodal Models Collection Multimodal models with leading performance. • 17 items • Updated Jan 17 • 32
We-Math: Does Your Large Multimodal Model Achieve Human-like Mathematical Reasoning? Paper • 2407.01284 • Published Jul 1, 2024 • 77
Boosting Multimodal Reasoning with MCTS-Automated Structured Thinking Paper • 2502.02339 • Published 15 days ago • 21
Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 • 3 items • Updated 24 days ago • 350
Qwen2-VL Collection Vision-language model series based on Qwen2 • 16 items • Updated Dec 6, 2024 • 205