lmms-lab
/

LLaVA-Video-7B-Qwen2

Video-Text-to-Text

text-generation

Inference Endpoints

Model card Files Files and versions Community

ZhangYuanhan commited on Oct 4, 2024

Commit

4fd4fba

·

verified ·

1 Parent(s): 5a16914

Update README.md

Files changed (1) hide show

README.md +2 -0

README.md CHANGED Viewed

@@ -134,6 +134,8 @@ The LLaVA-Video models are 7/72B parameter models trained on [LLaVA-Video-178K](
 This model support at most 64 frames.
 - **Repository:** [LLaVA-VL/LLaVA-NeXT](https://github.com/LLaVA-VL/LLaVA-NeXT?tab=readme-ov-file)
 - **Point of Contact:** [Yuanhan Zhang](https://zhangyuanhan-ai.github.io/)
 - **Languages:** English, Chinese

 This model support at most 64 frames.
+- **Project Page:** [Project Page](https://llava-vl.github.io/blog/2024-09-30-llava-video/).
+- **Paper** For more details, please check our [paper](arxiv.org/abs/2410.02713)
 - **Repository:** [LLaVA-VL/LLaVA-NeXT](https://github.com/LLaVA-VL/LLaVA-NeXT?tab=readme-ov-file)
 - **Point of Contact:** [Yuanhan Zhang](https://zhangyuanhan-ai.github.io/)
 - **Languages:** English, Chinese