Haoxiang-Wang
commited on
Commit
•
7a8c806
1
Parent(s):
f39a6b6
Update README.md
Browse files
README.md
CHANGED
@@ -23,8 +23,8 @@ license: llama3
|
|
23 |
|
24 |
## RewardBench LeaderBoard
|
25 |
|
26 |
-
| Base Model
|
27 |
-
|
28 |
| ArmoRM-Llama3-8B-v0.1 | Llama-3 8B | ArmoRM + MoE | **88.97** | 96.9 | **76.8** | **92.2** | **97.3** | 74.3 |
|
29 |
| Cohere May 2024 | Unknown | Unknown | 88.25 | 96.4 | 71.3 | **92.7** | **97.7** | **78.2** |
|
30 |
| GPT-4 Turbo (0125 version) | GPT-4 Turbo | LLM-as-a-Judge | 84.25 | 95.3 | 74.3 | 87.2 | 86.9 | 70.9 |
|
|
|
23 |
|
24 |
## RewardBench LeaderBoard
|
25 |
|
26 |
+
| Model | Base Model | Method | Score | Chat | Chat Hard | Safety | Reasoning | Prior Sets (0.5 weight) |
|
27 |
+
|:--------------------------------------------------------------------------------|:-----------------------------------------------------------------------|:-----:|:-----|:----------|:-------|:----------|:-----------------------|:------------------------|
|
28 |
| ArmoRM-Llama3-8B-v0.1 | Llama-3 8B | ArmoRM + MoE | **88.97** | 96.9 | **76.8** | **92.2** | **97.3** | 74.3 |
|
29 |
| Cohere May 2024 | Unknown | Unknown | 88.25 | 96.4 | 71.3 | **92.7** | **97.7** | **78.2** |
|
30 |
| GPT-4 Turbo (0125 version) | GPT-4 Turbo | LLM-as-a-Judge | 84.25 | 95.3 | 74.3 | 87.2 | 86.9 | 70.9 |
|