Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
DAMO-NLP-SG
/
VideoLLaMA2-7B-16F
like
7
Visual Question Answering
Transformers
Safetensors
OpenGVLab/VideoChat2-IT
Lin-Chen/ShareGPT4V
liuhaotian/LLaVA-Instruct-150K
English
mistral
text-generation
multimodal large language model
large video-language model
Inference Endpoints
text-generation-inference
arxiv:
2406.07476
arxiv:
2306.02858
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
2f713ad
VideoLLaMA2-7B-16F
3 contributors
History:
3 commits
lixin4ever
Update README.md
2f713ad
verified
23 days ago
.gitattributes
1.52 kB
initial commit
25 days ago
README.md
7.84 kB
Update README.md
23 days ago
config.json
1.2 kB
Model upload.
24 days ago
generation_config.json
132 Bytes
Model upload.
24 days ago
model-00001-of-00004.safetensors
4.94 GB
LFS
Model upload.
24 days ago
model-00002-of-00004.safetensors
5 GB
LFS
Model upload.
24 days ago
model-00003-of-00004.safetensors
4.99 GB
LFS
Model upload.
24 days ago
model-00004-of-00004.safetensors
1.14 GB
LFS
Model upload.
24 days ago
model.safetensors.index.json
82.2 kB
Model upload.
24 days ago
special_tokens_map.json
438 Bytes
Model upload.
24 days ago
tokenizer.json
1.8 MB
Model upload.
24 days ago
tokenizer.model
493 kB
LFS
Model upload.
24 days ago
tokenizer_config.json
1.46 kB
Model upload.
24 days ago