HaoyeZhang
commited on
Commit
•
23d9724
1
Parent(s):
e0c7f4a
Update README.md
Browse files
README.md
CHANGED
@@ -14,6 +14,9 @@ language:
|
|
14 |
|
15 |
We utilize a novel framework, [RLAIF-V](https://github.com/RLHF-V/RLAIF-V), which **aligns MLLMs in a fully open-source paradigm**. This framework maximally exploits the [open-source feedback](https://huggingface.co/datasets/HaoyeZhang/RLAIF-V-Dataset) from two key perspectives, including **high-quality feedback data** and an **online feedback learning algorithm**.
|
16 |
|
|
|
|
|
|
|
17 |
|
18 |
## Model Details
|
19 |
|
@@ -35,4 +38,7 @@ We utilize a novel framework, [RLAIF-V](https://github.com/RLHF-V/RLAIF-V), whic
|
|
35 |
|
36 |
### Model Description
|
37 |
- **Related model:** [OmniLMM-12B](https://huggingface.co/openbmb/OmniLMM-12B)
|
38 |
-
- **Trained on data:** [RLAIF-V-Dataset](https://huggingface.co/datasets/HaoyeZhang/RLAIF-V-Dataset)
|
|
|
|
|
|
|
|
14 |
|
15 |
We utilize a novel framework, [RLAIF-V](https://github.com/RLHF-V/RLAIF-V), which **aligns MLLMs in a fully open-source paradigm**. This framework maximally exploits the [open-source feedback](https://huggingface.co/datasets/HaoyeZhang/RLAIF-V-Dataset) from two key perspectives, including **high-quality feedback data** and an **online feedback learning algorithm**.
|
16 |
|
17 |
+
<p align="center">
|
18 |
+
<img src="https://cdn-uploads.huggingface.co/production/uploads/6566e0c493e30c8a60048eb3/T4hALrgNdXKHnkvb-27bA.png" alt="fig1" width="70%"/>
|
19 |
+
</p>
|
20 |
|
21 |
## Model Details
|
22 |
|
|
|
38 |
|
39 |
### Model Description
|
40 |
- **Related model:** [OmniLMM-12B](https://huggingface.co/openbmb/OmniLMM-12B)
|
41 |
+
- **Trained on data:** [RLAIF-V-Dataset](https://huggingface.co/datasets/HaoyeZhang/RLAIF-V-Dataset)
|
42 |
+
|
43 |
+
## Usage
|
44 |
+
Please look at [GitHub](https://github.com/RLHF-V/RLAIF-V) for more details about usage.
|