--- license: other license_name: tongyi-qwen license_link: >- https://github.com/QwenLM/Qwen/blob/main/Tongyi%20Qianwen%20LICENSE%20AGREEMENT pipeline_tag: image-text-to-text --- This repository contains the model described in [Inst-IT: Boosting Multimodal Instance Understanding via Explicit Visual Prompt Instruction Tuning](https://huggingface.co/papers/2412.03565). Project page: https://inst-it.github.io/ Code: https://github.com/inst-it/inst-it