StarCycle
/

llava-clip-internlm2-1_8b-pretrain-v1

Image-Text-to-Text

Model card Files Files and versions Metrics Training metrics Community

StarCycle commited on Feb 21

Commit

9a775d9

•

1 Parent(s): f0164d3

Update README.md

Files changed (1) hide show

README.md +8 -0

README.md CHANGED Viewed

@@ -74,6 +74,7 @@ git clone https://huggingface.co/datasets/liuhaotian/LLaVA-Pretrain --depth=1
 ```
 3. Finetune Data
 Please check the final release version
 ## Cheers! Now train your own model!
@@ -83,5 +84,12 @@ NPROC_PER_NODE=8 xtuner train ./llava_internlm2_chat_7b_dinov2_e1_gpu8_pretrain.
 ```
 The checkpoint and tensorboard logs are saved by default in ./work_dirs/. I only train it for 1 epoch to be same as the original LLaVA paper. Some researches also report that training for multiple epochs will make the model overfit the training dataset and perform worse in other domains.
 2. Instruction following fine-tuning
 Please check the final release version

 ```
 3. Finetune Data
 Please check the final release version
 ## Cheers! Now train your own model!
 ```
 The checkpoint and tensorboard logs are saved by default in ./work_dirs/. I only train it for 1 epoch to be same as the original LLaVA paper. Some researches also report that training for multiple epochs will make the model overfit the training dataset and perform worse in other domains.
+This is my loss curve for llava-clip-internlm2-1_8b-pretrain-v1:
+![image/png](https://cdn-uploads.huggingface.co/production/uploads/642a298ae5f33939cf3ee600/iNxPxfOvSJq8ZPz8uP_sP.png)
+And the learning rate curve:
+![image/png](https://cdn-uploads.huggingface.co/production/uploads/642a298ae5f33939cf3ee600/U1U9Kapcd6AIEUySvt2RS.png)
 2. Instruction following fine-tuning
 Please check the final release version