File size: 583 Bytes
871457d 6a7fde3 871457d 6a7fde3 871457d 6a7fde3 871457d 6a7fde3 871457d 6a7fde3 871457d 6a7fde3 871457d 6a7fde3 871457d 6a7fde3 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 |
This is a Vanilla BT based Reward model based on Gemma-2-9B. The recipes are from RLHF Workflow.
We have the reward-bench result:
Chat: 98.04
Chat Hard: 65.35
Safety: 89.54
Reasoning: 92.31
Please refer to
```bibtex
@misc{dong2024rlhf,
title={RLHF Workflow: From Reward Modeling to Online RLHF},
author={Hanze Dong and Wei Xiong and Bo Pang and Haoxiang Wang and Han Zhao and Yingbo Zhou and Nan Jiang and Doyen Sahoo and Caiming Xiong and Tong Zhang},
year={2024},
eprint={2405.07863},
archivePrefix={arXiv},
primaryClass={cs.LG}
}
``` |