Which Vision Encoder was used here?

by floschne - opened Mar 18, 2024

Mar 18, 2024

Do you have any information about the exact vision encoder which was used?

nielsr

Llava Hugging Face org Mar 18, 2024

•

Hi,

The CLIP vision encoder by OpenAI was used, as can be seen here in the original implementation.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment