Deepak Singh Rawat
commited on
Commit
•
749bdc7
1
Parent(s):
47962d5
Add Huggingface Spaces link
Browse files
README.md
CHANGED
@@ -10,6 +10,8 @@ tags:
|
|
10 |
|
11 |
An image captioning model to generate movie/t.v show plot from poster. It generates decent plots but is no way perfect. We are still working on improving the model.
|
12 |
|
|
|
|
|
13 |
# Model Details
|
14 |
|
15 |
The base model uses a Vision Transformer (ViT) model as an image encoder and GPT-2 as a decoder.
|
|
|
10 |
|
11 |
An image captioning model to generate movie/t.v show plot from poster. It generates decent plots but is no way perfect. We are still working on improving the model.
|
12 |
|
13 |
+
## Live demo on Hugging Face Spaces: https://huggingface.co/spaces/deepklarity/poster2plot
|
14 |
+
|
15 |
# Model Details
|
16 |
|
17 |
The base model uses a Vision Transformer (ViT) model as an image encoder and GPT-2 as a decoder.
|