Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Xenova
/
vit-gpt2-image-captioning
like
21
Image-to-Text
Transformers.js
ONNX
vision-encoder-decoder
image-text-to-text
image-captioning
Model card
Files
Files and versions
Community
1
Use this model
ba08b91
vit-gpt2-image-captioning
/
onnx
2 contributors
History:
3 commits
Xenova
HF staff
Upload folder using huggingface_hub
ba08b91
about 1 year ago
decoder_model.onnx
Safe
613 MB
LFS
Upload folder using huggingface_hub
about 1 year ago
decoder_model_merged.onnx
Safe
615 MB
LFS
Upload folder using huggingface_hub
about 1 year ago
decoder_model_merged_quantized.onnx
Safe
159 MB
LFS
Upload folder using huggingface_hub
about 1 year ago
decoder_model_quantized.onnx
Safe
156 MB
LFS
Upload folder using huggingface_hub
about 1 year ago
decoder_with_past_model.onnx
Safe
613 MB
LFS
Upload folder using huggingface_hub
about 1 year ago
decoder_with_past_model_quantized.onnx
Safe
156 MB
LFS
Upload folder using huggingface_hub
about 1 year ago
encoder_model.onnx
Safe
343 MB
LFS
Upload folder using huggingface_hub
over 1 year ago
encoder_model_quantized.onnx
Safe
87.5 MB
LFS
Upload folder using huggingface_hub
over 1 year ago