ZechenBai
/

LOVA3-llava-v1.5-phi1.5-gemini

Model card Files Files and versions Community

Create README.md

#2

by hhenryz - opened 3 days ago

base: refs/heads/main

←

from: refs/pr/2

Discussion Files changed

Files changed (1) hide show

README.md +25 -0

README.md ADDED Viewed

	@@ -0,0 +1,25 @@

+---
+license: apache-2.0
+task_categories:
+- image-text-to-text
+---
+This repository contains the data for [LOVA3: Learning to Visual Question Answering, Asking and Assessment](https://huggingface.co/papers/2405.14974).
+LOVA3 is a framework designed to equip MLLMs with the capabilities to answer, ask, and assess questions in the context of images.
+Code: https://github.com/showlab/LOVA3
+## 🎓 Citation
+If you find LOVA3 useful, please cite using this BibTeX:
+```bibtex
+@inproceedings{
+    zhao2024lova,
+    title={{LOVA}3: Learning to Visual Question Answering, Asking and Assessment},
+    author={Hengyuan Zhao and Pan Zhou and Difei Gao and Zechen Bai and Mike Zheng Shou},
+    booktitle={The Thirty-eighth Annual Conference on Neural Information Processing Systems},
+    year={2024},
+    url={https://openreview.net/forum?id=vIOKLMl6wu}
+}
+```