openbmb
/

RLAIF-V-12B

Text Generation

Inference Endpoints

Model card Files Files and versions Community

HaoyeZhang commited on May 25

Commit

23d9724

•

1 Parent(s): e0c7f4a

Update README.md

Files changed (1) hide show

README.md +7 -1

README.md CHANGED Viewed

@@ -14,6 +14,9 @@ language:
 We utilize a novel framework, [RLAIF-V](https://github.com/RLHF-V/RLAIF-V), which **aligns MLLMs in a fully open-source paradigm**. This framework maximally exploits the [open-source feedback](https://huggingface.co/datasets/HaoyeZhang/RLAIF-V-Dataset) from two key perspectives, including **high-quality feedback data** and an **online feedback learning algorithm**.
 ## Model Details
@@ -35,4 +38,7 @@ We utilize a novel framework, [RLAIF-V](https://github.com/RLHF-V/RLAIF-V), whic
 ### Model Description
 - **Related model:** [OmniLMM-12B](https://huggingface.co/openbmb/OmniLMM-12B)
-- **Trained on data:** [RLAIF-V-Dataset](https://huggingface.co/datasets/HaoyeZhang/RLAIF-V-Dataset)

 We utilize a novel framework, [RLAIF-V](https://github.com/RLHF-V/RLAIF-V), which **aligns MLLMs in a fully open-source paradigm**. This framework maximally exploits the [open-source feedback](https://huggingface.co/datasets/HaoyeZhang/RLAIF-V-Dataset) from two key perspectives, including **high-quality feedback data** and an **online feedback learning algorithm**.
+<p align="center">
+  <img src="https://cdn-uploads.huggingface.co/production/uploads/6566e0c493e30c8a60048eb3/T4hALrgNdXKHnkvb-27bA.png" alt="fig1" width="70%"/>
+</p>
 ## Model Details
 ### Model Description
 - **Related model:** [OmniLMM-12B](https://huggingface.co/openbmb/OmniLMM-12B)
+- **Trained on data:** [RLAIF-V-Dataset](https://huggingface.co/datasets/HaoyeZhang/RLAIF-V-Dataset)
+## Usage
+Please look at [GitHub](https://github.com/RLHF-V/RLAIF-V) for more details about usage.