deepklarity
/

poster2plot

Image Classification

vision-encoder-decoder

image-text-to-text

image-captioning

Inference Endpoints

Model card Files Files and versions Community

Deepak Singh Rawat commited on Nov 22, 2021

Commit

749bdc7

·

1 Parent(s): 47962d5

Add Huggingface Spaces link

Files changed (1) hide show

README.md +2 -0

README.md CHANGED Viewed

@@ -10,6 +10,8 @@ tags:
 An image captioning model to generate movie/t.v show plot from poster. It generates decent plots but is no way perfect. We are still working on improving the model.
 # Model Details
 The base model uses a Vision Transformer (ViT) model as an image encoder and GPT-2 as a decoder.

 An image captioning model to generate movie/t.v show plot from poster. It generates decent plots but is no way perfect. We are still working on improving the model.
+## Live demo on Hugging Face Spaces: https://huggingface.co/spaces/deepklarity/poster2plot
 # Model Details
 The base model uses a Vision Transformer (ViT) model as an image encoder and GPT-2 as a decoder.