Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
zhibinlan
/
LLaVE-7B
like
3
Image-Text-to-Text
Transformers
Safetensors
English
llava
text-generation
Sentence Similarity
Embedding
zero-shot-image-classification
video-text-to-text
conversational
Inference Endpoints
arxiv:
2503.04812
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
LLaVE-7B
/
figures
2 contributors
History:
1 commit
zhibinlan
Upload 3 files
01296af
verified
2 days ago
leaderboard.png
220 kB
LFS
Upload 3 files
2 days ago
results.png
335 kB
LFS
Upload 3 files
2 days ago
zero-shot-vr.png
124 kB
LFS
Upload 3 files
2 days ago