LZHgrla commited on
Commit
46c88df
1 Parent(s): f65b684

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +80 -0
README.md ADDED
@@ -0,0 +1,80 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ datasets:
3
+ - Lin-Chen/ShareGPT4V
4
+ pipeline_tag: image-text-to-text
5
+ library_name: xtuner
6
+ ---
7
+
8
+ <div align="center">
9
+ <img src="https://github.com/InternLM/lmdeploy/assets/36994684/0cf8d00f-e86b-40ba-9b54-dc8f1bc6c8d8" width="600"/>
10
+
11
+
12
+ [![Generic badge](https://img.shields.io/badge/GitHub-%20XTuner-black.svg)](https://github.com/InternLM/xtuner)
13
+
14
+
15
+ </div>
16
+
17
+ ## Model
18
+
19
+ llava-llama-3-8b-v1_1 is a LLaVA model fine-tuned from [meta-llama/Meta-Llama-3-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct) and [CLIP-ViT-Large-patch14-336](https://huggingface.co/openai/clip-vit-large-patch14-336) with [ShareGPT4V-PT](https://huggingface.co/datasets/Lin-Chen/ShareGPT4V) and [InternVL-SFT](https://github.com/OpenGVLab/InternVL/tree/main/internvl_chat#prepare-training-datasets) by [XTuner](https://github.com/InternLM/xtuner).
20
+
21
+ ## Results
22
+
23
+
24
+ ## Quickstart
25
+
26
+ ### Installation
27
+
28
+ ```shell
29
+ pip install 'git+https://github.com/InternLM/xtuner.git#egg=xtuner[deepspeed]'
30
+ ```
31
+
32
+ ### Chat
33
+
34
+ ```shell
35
+ xtuner chat xtuner/llava-llama-3-8b-v1_1 \
36
+ --visual-encoder openai/clip-vit-large-patch14-336 \
37
+ --llava xtuner/llava-llama-3-8b-v1_1 \
38
+ --prompt-template llama3_chat \
39
+ --image $IMAGE_PATH
40
+ ```
41
+
42
+ ### MMBench Evaluation
43
+
44
+ XTuner integrates the MMBench evaluation, and you can perform evaluations with the following command!
45
+
46
+ ```bash
47
+ xtuner mmbench xtuner/llava-llama-3-8b-v1_1 \
48
+ --visual-encoder openai/clip-vit-large-patch14-336 \
49
+ --llava xtuner/llava-llama-3-8b-v1_1 \
50
+ --prompt-template llama3_chat \
51
+ --data-path $MMBENCH_DATA_PATH \
52
+ --work-dir $RESULT_PATH
53
+ ```
54
+
55
+ After the evaluation is completed, if it's a development set, it will directly print out the results; If it's a test set, you need to submit `mmbench_result.xlsx` to the official MMBench for final evaluation to obtain precision results!
56
+
57
+ ### Training
58
+
59
+ 1. Pretrain (saved by default in `./work_dirs/llava_llama3_8b_instruct_clip_vit_large_p14_336_e1_gpu8_sharegpt4v_pretrain/`)
60
+
61
+ ```bash
62
+ NPROC_PER_NODE=8 xtuner train llava_llama3_8b_instruct_clip_vit_large_p14_336_e1_gpu8_sharegpt4v_pretrain --deepspeed deepspeed_zero2 --seed 1234
63
+ ```
64
+
65
+ 2. Fine-tune (saved by default in `./work_dirs/llava_llama3_8b_instruct_full_clip_vit_large_p14_336_lora_e1_gpu8_internvl_finetune/`)
66
+
67
+ ```bash
68
+ NPROC_PER_NODE=8 xtuner train llava_llama3_8b_instruct_full_clip_vit_large_p14_336_lora_e1_gpu8_internvl_finetune --deepspeed deepspeed_zero2 --seed 1234
69
+ ```
70
+
71
+ ## Citation
72
+
73
+ ```bibtex
74
+ @misc{2023xtuner,
75
+ title={XTuner: A Toolkit for Efficiently Fine-tuning LLM},
76
+ author={XTuner Contributors},
77
+ howpublished = {\url{https://github.com/InternLM/xtuner}},
78
+ year={2023}
79
+ }
80
+ ```