Citation

If you use this finetuned model checkpoint in your research, please cite our paper as follows:

      @misc{zhang2024visualquestiondecompositionmultimodal,
      title={Visual Question Decomposition on Multimodal Large Language Models}, 
      author={Haowei Zhang and Jianzhe Liu and Zhen Han and Shuo Chen and Bailan He and Volker Tresp and Zhiqiang Xu and Jindong Gu},
      year={2024},
      eprint={2409.19339},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2409.19339}, 
}

Downloads last month: 3

Safetensors

Model size

25.5B params

Tensor type

BF16

Inference Examples

Visual Question Answering

Unable to determine this model's library. Check the docs .

Model tree for freesky/InternVL-Chat-V1-5_ft_by_DecoVQAplus_SelectiveLoss

Base model

OpenGVLab/InternVL-Chat-V1-5

Finetuned

(3)

this model