Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks Paper • 2311.06242 • Published Nov 10, 2023 • 86
Inject Semantic Concepts into Image Tagging for Open-Set Recognition Paper • 2310.15200 • Published Oct 23, 2023 • 5
SAM-CLIP: Merging Vision Foundation Models towards Semantic and Spatial Understanding Paper • 2310.15308 • Published Oct 23, 2023 • 22