This model is the stage 1 checkpoint of one of the thirteen settings, DINOv2, used in the Law of Vision Representation in MLLMs.

Downloads last month
10
Inference Providers NEW
This model is not currently available via any of the supported third-party Inference Providers, and the HF Inference API does not support transformers models with pipeline type image-text-to-text