File size: 1,403 Bytes
6765b11 7b8e1eb 6765b11 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 |
---
license: cc-by-4.0
tags:
- yahoo-open-source-software-incubator
- image-to-text
- image-captioning
inference: false
---
# Object Relation Transformer
The Object Relation Transformer is a Transformer-based image captioning model.
You can find more details about the model in our [NeurIPS 2019
paper](https://papers.nips.cc/paper/9293-image-captioning-transforming-objects-into-words.pdf).
This model repository contains two variants of the Object Relation
Transformer, as well as a couple of baseline models. Please find more
details about all these models within the [README of our Github
repository](https://github.com/yahoo/object_relation_transformer?tab=readme-ov-file#model-zoo-and-results).
## Citation
If you find these models useful, please consider citing (no obligation at all):
```
@article{herdade2019image,
title={Image Captioning: Transforming Objects into Words},
author={Herdade, Simao and Kappeler, Armin and Boakye, Kofi and Soares, Joao},
journal={arXiv preprint arXiv:1906.05963},
year={2019}
}
```
## Maintainers
- Joao Soares: jvbsoares@yahooinc.com
## License
The contents of this repository are (c) by Verizon Media.
The contents of this repository are licensed under a Creative Commons
Attribution 4.0 International License.
You should have received a copy of the license along with this
work. If not, see <https://creativecommons.org/licenses/by/4.0/>.
|