ChatRex: Taming Multimodal LLM for Joint Perception and Understanding Paper โข 2411.18363 โข Published 24 days ago โข 9 โข 3
ChatRex: Taming Multimodal LLM for Joint Perception and Understanding Paper โข 2411.18363 โข Published 24 days ago โข 9 โข 3
PixMo Collection A set of vision-language datasets built by Ai2 and used to train the Molmo family of models. Read more at https://molmo.allenai.org/blog โข 9 items โข Updated 23 days ago โข 49
Molmo Collection Artifacts for open multimodal language models. โข 5 items โข Updated 23 days ago โข 289
Grounding DINO 1.5: Advance the "Edge" of Open-Set Object Detection Paper โข 2405.10300 โข Published May 16 โข 26
Grounding DINO 1.5: Advance the "Edge" of Open-Set Object Detection Paper โข 2405.10300 โข Published May 16 โข 26
Runtime error 27 ๐ Grounding DINO 1.5 IDEA Research's Most Capable Open-Set Object Detection Model