|
--- |
|
license: gpl-3.0 |
|
datasets: |
|
- JosephusCheung/GuanacoVQADataset |
|
language: |
|
- en |
|
- zh |
|
- ja |
|
- de |
|
pipeline_tag: visual-question-answering |
|
--- |
|
|
|
The following content is currently a work in progress and does not represent the final quality. |
|
|
|
Alignment for the multilingual VQA tasks is being conducted on blip2-flan-t5-xxl and Guanaco using only Linear Layers. |
|
|
|
The latest weight file is provided here, based on the implementation of MiniGPT-4. |
|
|
|
This model supports English, Chinese, Japanese, and German languages and requires the combined use of the Guanaco 7B LLM model. |
|
|
|
A portion of the dataset has already been released. |