BAAI
/

BoyaWu10 commited on
Commit
07938a4
1 Parent(s): aea3510

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -15,7 +15,7 @@ This is Bunny-v1.1-4B.
15
 
16
  Bunny is a family of lightweight but powerful multimodal models. It offers multiple plug-and-play vision encoders, like EVA-CLIP, SigLIP and language backbones, including Phi-3-mini, Llama-3-8B, Phi-1.5, StableLM-2 and Phi-2. To compensate for the decrease in model size, we construct more informative training data by curated selection from a broader data source.
17
 
18
- We provide Bunny-v1.1-4B, which is built upon [SigLIP](https://huggingface.co/google/siglip-so400m-patch14-384) and [Phi-3-mini-4k-instruct](https://huggingface.co/microsoft/Phi-3-mini-4k-instruct) with [S\\(^{2}\\)-Wrapper](https://github.com/bfshi/scaling_on_scales), supporting 1152x1152 resolution. More details about this model can be found in [GitHub](https://github.com/BAAI-DCAI/Bunny).
19
 
20
  | | MME \\(^{\text{P}}\\) | MME \\(^{\text{C}}\\) | MMB \\(^{\text{T/D}}\\) | MMB-CN \\(^{\text{T/D}}\\) |SEED(-IMG) | MMMU \\(^{\text{V/T}}\\) | VQA \\(^{\text{v2}}\\) | GQA | SQA \\(^{\text{I}}\\) | POPE |
21
  | ------------------ | :--------------: | :--------------: |:--------------: | :----------------: | :--: | :-----------------: | :---------------: | :--: | :--------------: | :--: |
 
15
 
16
  Bunny is a family of lightweight but powerful multimodal models. It offers multiple plug-and-play vision encoders, like EVA-CLIP, SigLIP and language backbones, including Phi-3-mini, Llama-3-8B, Phi-1.5, StableLM-2 and Phi-2. To compensate for the decrease in model size, we construct more informative training data by curated selection from a broader data source.
17
 
18
+ We provide Bunny-v1.1-4B, which is built upon [SigLIP](https://huggingface.co/google/siglip-so400m-patch14-384) and [Phi-3-mini-4k-instruct](https://huggingface.co/microsoft/Phi-3-mini-4k-instruct) with [S \\(^{2}\\)-Wrapper](https://github.com/bfshi/scaling_on_scales), supporting 1152x1152 resolution. More details about this model can be found in [GitHub](https://github.com/BAAI-DCAI/Bunny).
19
 
20
  | | MME \\(^{\text{P}}\\) | MME \\(^{\text{C}}\\) | MMB \\(^{\text{T/D}}\\) | MMB-CN \\(^{\text{T/D}}\\) |SEED(-IMG) | MMMU \\(^{\text{V/T}}\\) | VQA \\(^{\text{v2}}\\) | GQA | SQA \\(^{\text{I}}\\) | POPE |
21
  | ------------------ | :--------------: | :--------------: |:--------------: | :----------------: | :--: | :-----------------: | :---------------: | :--: | :--------------: | :--: |