Create README.md
#2
by
hhenryz
- opened
README.md
ADDED
@@ -0,0 +1,25 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
license: apache-2.0
|
3 |
+
task_categories:
|
4 |
+
- image-text-to-text
|
5 |
+
---
|
6 |
+
|
7 |
+
This repository contains the data for [LOVA3: Learning to Visual Question Answering, Asking and Assessment](https://huggingface.co/papers/2405.14974).
|
8 |
+
LOVA3 is a framework designed to equip MLLMs with the capabilities to answer, ask, and assess questions in the context of images.
|
9 |
+
|
10 |
+
Code: https://github.com/showlab/LOVA3
|
11 |
+
|
12 |
+
## 🎓 Citation
|
13 |
+
|
14 |
+
If you find LOVA3 useful, please cite using this BibTeX:
|
15 |
+
|
16 |
+
```bibtex
|
17 |
+
@inproceedings{
|
18 |
+
zhao2024lova,
|
19 |
+
title={{LOVA}3: Learning to Visual Question Answering, Asking and Assessment},
|
20 |
+
author={Hengyuan Zhao and Pan Zhou and Difei Gao and Zechen Bai and Mike Zheng Shou},
|
21 |
+
booktitle={The Thirty-eighth Annual Conference on Neural Information Processing Systems},
|
22 |
+
year={2024},
|
23 |
+
url={https://openreview.net/forum?id=vIOKLMl6wu}
|
24 |
+
}
|
25 |
+
```
|