weizhiwang
/

LLaVA-Video-Llama-3

Video-Text-to-Text

text-generation

Inference Endpoints

Model card Files Files and versions Community

weizhiwang commited on 28 days ago

Commit

9bc8bda

•

1 Parent(s): 139d1e2

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -12,7 +12,7 @@ pipeline_tag: video-text-to-text
 <!-- Provide a quick summary of what the model is/does. -->
-Please follow my github repo [LLaVA-Video-Llama-3](https://github.com/Victorwz/LLaVA-Video-Llama-3/) for more details on fine-tuning LLaVA model with Llama-3 as the foundatiaon LLM.
 ## Updates
 - [6/4/2024] The codebase supports the video data fine-tuning for video understanding tasks.
@@ -111,7 +111,7 @@ The video is funny because it shows a baby girl wearing glasses and reading a bo
 ```
 # Fine-Tune LLaVA-Llama-3 on Your Video Instruction Data
-Please refer to a forked [LLaVA-Video-Llama-3](https://github.com/Victorwz/LLaVA-Video-Llama-3) git repo for fine-tuning data preparation and scripts. The data loading function and fastchat conversation template are changed due to a different tokenizer.
 ## Citation

 <!-- Provide a quick summary of what the model is/does. -->
+Please follow my github repo [LLaVA-Unified](https://github.com/Victorwz/LLaVA-Unified) for more details on fine-tuning LLaVA model with Llama-3 as the foundatiaon LLM.
 ## Updates
 - [6/4/2024] The codebase supports the video data fine-tuning for video understanding tasks.
 ```
 # Fine-Tune LLaVA-Llama-3 on Your Video Instruction Data
+Please refer to our [LLaVA-Unified](https://github.com/Victorwz/LLaVA-Unified) git repo for fine-tuning data preparation and scripts. The data loading function and fastchat conversation template are changed due to a different tokenizer.
 ## Citation