general-preference
/

GPM-Gemma-2B

Model card Files Files and versions Community

kirigayahitsugi commited on 29 days ago

Commit

17553f0

•

1 Parent(s): 894bd99

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -27,7 +27,7 @@ The General Preference Representation Model (GPM) improves preference-based rewa
 ## Evaluation
-The GPM is evaluated using the [RewardBench](https://github.com/allenai/reward-bench) leaderboard, showing significant improvements over the BT model, with a performance margin of up to 5.6%. GPM also excels in modeling cyclic preferences, achieving 100% accuracy on cyclic datasets.
 ## Usage

 ## Evaluation
+The GPM is evaluated using the [RewardBench](https://github.com/allenai/reward-bench) leaderboard, showing significant improvements over the BT model, with a performance margin of up to 9.11%. GPM also excels in modeling cyclic preferences, achieving 100% accuracy on cyclic datasets.
 ## Usage