Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
zhibinlan
/
LLaVE-7B
like
3
Image-Text-to-Text
Transformers
Safetensors
English
llava
text-generation
Sentence Similarity
Embedding
zero-shot-image-classification
video-text-to-text
conversational
Inference Endpoints
arxiv:
2503.04812
License:
apache-2.0
Model card
Files
Files and versions
Community
1
Train
Deploy
Use this model
01296af
LLaVE-7B
/
figures
3 contributors
History:
1 commit
zhibinlan
Upload 3 files
01296af
verified
3 days ago
leaderboard.png
220 kB
LFS
Upload 3 files
3 days ago
results.png
335 kB
LFS
Upload 3 files
3 days ago
zero-shot-vr.png
124 kB
LFS
Upload 3 files
3 days ago