metadata
license: other
license_name: tongyi-qwen
license_link: >-
https://github.com/QwenLM/Qwen/blob/main/Tongyi%20Qianwen%20LICENSE%20AGREEMENT
pipeline_tag: image-text-to-text
This repository contains the model described in Inst-IT: Boosting Multimodal Instance Understanding via Explicit Visual Prompt Instruction Tuning.
Project page: https://inst-it.github.io/