Update README.md
Browse files
README.md
CHANGED
@@ -54,8 +54,13 @@ The primary intended users of the model are researchers and hobbyists in compute
|
|
54 |
|
55 |
## Training dataset
|
56 |
595K filtered image-text pairs from CC3M.
|
|
|
57 |
150K GPT-generated multimodal instruction-following chat data.
|
|
|
58 |
83K VQA v2 instruction-following VQA data.
|
|
|
59 |
16K A-OKVQA instruction-following CoT-VQA data.
|
|
|
60 |
23K FLICKR instruction-following spotting captioning data.
|
|
|
61 |
10K LLaVA-based human preference data
|
|
|
54 |
|
55 |
## Training dataset
|
56 |
595K filtered image-text pairs from CC3M.
|
57 |
+
|
58 |
150K GPT-generated multimodal instruction-following chat data.
|
59 |
+
|
60 |
83K VQA v2 instruction-following VQA data.
|
61 |
+
|
62 |
16K A-OKVQA instruction-following CoT-VQA data.
|
63 |
+
|
64 |
23K FLICKR instruction-following spotting captioning data.
|
65 |
+
|
66 |
10K LLaVA-based human preference data
|