Libra-Base

This model was trained on image-text pairs for basic multi-modal understanding ability.

In addition to the pretrained weights in this repo, please download the pretrained CLIP model in huggingface and merge it into the path, as:

libra-base/
├── ...
└── openai-clip-vit-large-patch14-336/
    └── ...

The CLIP model can be downloaded here.

Safetensors

Model size

11B params

Tensor type

BF16

Collection including YifanXu/libra-11b-base