chrisliu298 commited on
Commit
730eec6
1 Parent(s): e5fc468

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -46,9 +46,9 @@ We evaluate our model on [RewardBench](https://huggingface.co/spaces/allenai/rew
46
 
47
  | Rank | Model | Chat | Chat Hard | Safety | Reasoning | Score |
48
  | :---: | --------------------------- | :---: | :-------: | :----: | :-------: | :---: |
49
- | 1 | Skywork-Reward-Gemma-2-27B | 95.8 | 91.4 | 92.0 | 96.2 | 93.9 |
50
  | 2 | SFR-LLaMa-3.1-70B-Judge-r | 96.9 | 84.8 | 92.2 | 97.6 | 92.8 |
51
- | 3 | Skywork-Reward-Llama-3.1-8B | 96.1 | 87.3 | 90.6 | 96.1 | 92.5 |
52
  | 4 | Nemotron-4-340B-Reward | 95.8 | 87.1 | 92.2 | 93.6 | 92.2 |
53
  | 5 | ArmoRM-Llama3-8B-v0.1 | 96.9 | 76.8 | 92.2 | 97.3 | 90.8 |
54
  | 6 | internlm2-20b-reward | 98.9 | 76.5 | 89.9 | 95.8 | 90.3 |
 
46
 
47
  | Rank | Model | Chat | Chat Hard | Safety | Reasoning | Score |
48
  | :---: | --------------------------- | :---: | :-------: | :----: | :-------: | :---: |
49
+ | 1 | Skywork-Reward-Gemma-2-27B | 95.8 | 91.4 | 92.0 | 96.1 | 93.8 |
50
  | 2 | SFR-LLaMa-3.1-70B-Judge-r | 96.9 | 84.8 | 92.2 | 97.6 | 92.8 |
51
+ | 3 | Skywork-Reward-Llama-3.1-8B | 95.8 | 87.3 | 90.6 | 96.2 | 92.5 |
52
  | 4 | Nemotron-4-340B-Reward | 95.8 | 87.1 | 92.2 | 93.6 | 92.2 |
53
  | 5 | ArmoRM-Llama3-8B-v0.1 | 96.9 | 76.8 | 92.2 | 97.3 | 90.8 |
54
  | 6 | internlm2-20b-reward | 98.9 | 76.5 | 89.9 | 95.8 | 90.3 |