LinWeizheDragon
/

PreFLMR_ViT-G

Feature Extraction

knowledge-based visual question answering

Model card Files Files and versions Community

LinWeizheDragon commited on Feb 24, 2024

Commit

67d9380

•

1 Parent(s): f4edb9c

Update README.md

Files changed (1) hide show

README.md +9 -3

README.md CHANGED Viewed

@@ -3,6 +3,12 @@ library_name: transformers
 license: mit
 language:
 - en
 ---
 # PreFLMR model card
@@ -37,11 +43,11 @@ This model can be used combined with language models to create a retrieval-augme
 ## How to Get Started with the Model
-For details of training, indexing and performing retrieval, please refer to [here](https://github.com/LinWeizheDragon/FLMR).
 ## Training datasets
-The model is pretrained on three types of tasks with a total of nine datasets:
-1. Image to Text retrieval: WIT, KVQA and CC3M
 2. Question to Text retrieval: MSMARCO
 3. Image & Question to Text retrieval: LLaVA, OVEN, OKVQA, Infoseek and E-VQA

 license: mit
 language:
 - en
+tags:
+- retrieval
+- multi-modal
+- knowledge-based visual question answering
+- FLMR
+- PreFLMR
 ---
 # PreFLMR model card
 ## How to Get Started with the Model
+For details of training, indexing, and performing retrieval, please refer to [here](https://github.com/LinWeizheDragon/FLMR).
 ## Training datasets
+The model is pre-trained on three types of tasks with a total of nine datasets:
+1. Image to Text retrieval: WIT, KVQA, and CC3M
 2. Question to Text retrieval: MSMARCO
 3. Image & Question to Text retrieval: LLaVA, OVEN, OKVQA, Infoseek and E-VQA