Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
YanweiLi
/
llama-vid-7b-pretrain-224
like
0
Text Generation
Transformers
llava
vision-language model
llama
video understanding
Inference Endpoints
arxiv:
2311.17043
Model card
Files
Files and versions
Community
1
Train
Deploy
Use this model
main
llama-vid-7b-pretrain-224
1 contributor
History:
4 commits
YanweiLi
Update README.md
89e94e6
11 months ago
.gitattributes
Safe
1.52 kB
initial commit
11 months ago
README.md
Safe
1.52 kB
Update README.md
11 months ago
config.json
Safe
1.21 kB
Upload 3 files
11 months ago
mm_projector.bin
Safe
pickle
Detected Pickle imports (3)
"torch._utils._rebuild_tensor_v2"
,
"collections.OrderedDict"
,
"torch.BFloat16Storage"
What is a pickle import?
434 MB
LFS
Upload 3 files
11 months ago
trainer_state.json
Safe
263 kB
Upload 3 files
11 months ago