xurantju commited on
Commit
78f61d3
1 Parent(s): 48ec795

update readme

Browse files
Files changed (1) hide show
  1. README.md +3 -3
README.md CHANGED
@@ -16,15 +16,15 @@ These models have been trained at scale on high-quality image caption datasets a
16
  * The **instruct** fine-tuned model, `xgen-mm-phi3-mini-instruct-r-v1`, achieves state-of-the-art performance among open-source and closed-source VLMs under 5b parameters.
17
  * `xgen-mm-phi3-mini-instruct-r-v1` supports flexible high-resolution image encoding with efficient visual token sampling.
18
 
19
- More technical details will come with a technical report soon.
20
 
21
 
22
  # Datasets
23
 
24
  | Dataset Type| Dataset(s) Used |
25
  |--------|------------------------------------------|
26
- | Pretrain | caption data: (datacomp, cc12m, cc3m, SBU, vg) && interleaved data: obelics |
27
- | Instruction Tuning | LLaVA-Instruct-150K, ShareGPT4V captions, a mixture of academic VQA data including OCR/Document/Chart-focused tasks, publicly available text-only instruction data |
28
 
29
  # Results
30
 
 
16
  * The **instruct** fine-tuned model, `xgen-mm-phi3-mini-instruct-r-v1`, achieves state-of-the-art performance among open-source and closed-source VLMs under 5b parameters.
17
  * `xgen-mm-phi3-mini-instruct-r-v1` supports flexible high-resolution image encoding with efficient visual token sampling.
18
 
19
+ The model is for research purposes, more technical details will come with a technical report soon.
20
 
21
 
22
  # Datasets
23
 
24
  | Dataset Type| Dataset(s) Used |
25
  |--------|------------------------------------------|
26
+ | Pretrain | caption data: high-quality image caption datasets and interleaved datasets |
27
+ | Instruction Tuning | visual instruction following and caption datasets, a mixture of academic VQA data including OCR/Document/Chart-focused tasks, publicly available text-only instruction data |
28
 
29
  # Results
30