This model is the stage 1 checkpoint of one of the thirteen settings, SigLIP, used in the Law of Vision Representation in MLLMs.

Downloads last month
14
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.