Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
SakanaAI
/
Llama-3-EvoVLM-JP-v2
like
20
Follow
Sakana AI
175
Image-to-Text
Transformers
Safetensors
Japanese
llava
image-text-to-text
multimodal
vision-language
mantis
llama3
siglip
Inference Endpoints
arxiv:
2403.13187
License:
llama3
Model card
Files
Files and versions
Community
3
Train
Deploy
Use this model
main
Llama-3-EvoVLM-JP-v2
Commit History
Delete the repository
b34a669
verified
Inoichan
commited on
Aug 1, 2024
Update license
97c3229
verified
Inoichan
commited on
Aug 1, 2024
Update usage
0f8098c
verified
Inoichan
commited on
Aug 1, 2024
Fix device
75d77d8
verified
Inoichan
commited on
Aug 1, 2024
update the links to the blog
2be6ebf
verified
Inoichan
commited on
Jul 31, 2024
Update README.md
0469061
verified
Inoichan
commited on
Jul 31, 2024
Update README.md
9b4dcdb
verified
Inoichan
commited on
Jul 30, 2024
Update README.md
a768111
verified
Inoichan
commited on
Jul 29, 2024
Upload LlavaForConditionalGeneration
9cf9a20
verified
Inoichan
commited on
Jul 29, 2024
initial commit
f35dc74
verified
Inoichan
commited on
Jul 29, 2024