Update README.md
Browse files
README.md
CHANGED
@@ -24,6 +24,8 @@ Llama-3.1-Nemotron-70B-Reward is a large language model customized using develop
|
|
24 |
By accessing this model, you are agreeing to the LLama 3.1 terms and conditions of the [license](https://github.com/meta-llama/llama-models/blob/main/models/llama3_1/LICENSE), [acceptable use policy](https://github.com/meta-llama/llama-models/blob/main/models/llama3_1/USE_POLICY.md) and [Meta’s privacy policy](https://www.facebook.com/privacy/policy/)
|
25 |
|
26 |
|
|
|
|
|
27 |
| Model | Type of Data Used For Training | Overall | Chat | Chat-Hard | Safety | Reasoning |
|
28 |
|:-----------------------------|:----------------|:-----|:----------|:-------|:----------|:-----------------------|
|
29 |
| _**Llama-3.1-Nemotron-70B-Reward**_ |Permissive Licensed Data Only (CC-BY-4.0) | **94.1** | **97.5** | 85.8 | **95.1** | **98.1** |
|
|
|
24 |
By accessing this model, you are agreeing to the LLama 3.1 terms and conditions of the [license](https://github.com/meta-llama/llama-models/blob/main/models/llama3_1/LICENSE), [acceptable use policy](https://github.com/meta-llama/llama-models/blob/main/models/llama3_1/USE_POLICY.md) and [Meta’s privacy policy](https://www.facebook.com/privacy/policy/)
|
25 |
|
26 |
|
27 |
+
## RewardBench Primary Dataset LeaderBoard
|
28 |
+
|
29 |
| Model | Type of Data Used For Training | Overall | Chat | Chat-Hard | Safety | Reasoning |
|
30 |
|:-----------------------------|:----------------|:-----|:----------|:-------|:----------|:-----------------------|
|
31 |
| _**Llama-3.1-Nemotron-70B-Reward**_ |Permissive Licensed Data Only (CC-BY-4.0) | **94.1** | **97.5** | 85.8 | **95.1** | **98.1** |
|