license: other | |
license_name: tongyi-qwen | |
license_link: >- | |
https://github.com/QwenLM/Qwen/blob/main/Tongyi%20Qianwen%20LICENSE%20AGREEMENT | |
pipeline_tag: image-text-to-text | |
This repository contains the model described in [Inst-IT: Boosting Multimodal Instance Understanding via Explicit Visual Prompt Instruction Tuning](https://huggingface.co/papers/2412.03565). | |
Project page: https://inst-it.github.io/ | |
Code: https://github.com/inst-it/inst-it |