M-RewardBench: Evaluating Reward Models in Multilingual Settings Paper • 2410.15522 • Published Oct 20 • 10
Multilingual RewardBench Collection Multilingual Reward Model Evaluation Dataset and Results • 2 items • Updated about 1 month ago • 4
view article Article Illustrating Reinforcement Learning from Human Feedback (RLHF) Dec 9, 2022 • 101