microsoft/BiomedCLIP-PubMedBERT_256-vit_base_patch16_224 Zero-Shot Image Classification β’ Updated Sep 27, 2024 β’ 94.4k β’ 249
PowerInfer/SmallThinker-3B-Preview Text Generation β’ Updated about 5 hours ago β’ 6.37k β’ β’ 257
Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey Paper β’ 2412.18619 β’ Published 21 days ago β’ 49
view article Article π¦Έπ»#2: Your Go-To Vocabulary to Navigate the World of AI Agents and Agentic Workflows By Kseniase β’ 9 days ago β’ 8
Vision Language Models Papers πΌοΈπ¬π Collection Papers about vision-language models, most important ones are on top of the list. β’ 27 items β’ Updated Apr 30, 2024 β’ 35