NeMo
English
nvidia
llama3.1
reward model
zhilinw commited on
Commit
1bd9327
·
verified ·
1 Parent(s): dd48622

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -0
README.md CHANGED
@@ -24,6 +24,8 @@ Llama-3.1-Nemotron-70B-Reward is a large language model customized using develop
24
  By accessing this model, you are agreeing to the LLama 3.1 terms and conditions of the [license](https://github.com/meta-llama/llama-models/blob/main/models/llama3_1/LICENSE), [acceptable use policy](https://github.com/meta-llama/llama-models/blob/main/models/llama3_1/USE_POLICY.md) and [Meta’s privacy policy](https://www.facebook.com/privacy/policy/)
25
 
26
 
 
 
27
  | Model | Type of Data Used For Training | Overall | Chat | Chat-Hard | Safety | Reasoning |
28
  |:-----------------------------|:----------------|:-----|:----------|:-------|:----------|:-----------------------|
29
  | _**Llama-3.1-Nemotron-70B-Reward**_ |Permissive Licensed Data Only (CC-BY-4.0) | **94.1** | **97.5** | 85.8 | **95.1** | **98.1** |
 
24
  By accessing this model, you are agreeing to the LLama 3.1 terms and conditions of the [license](https://github.com/meta-llama/llama-models/blob/main/models/llama3_1/LICENSE), [acceptable use policy](https://github.com/meta-llama/llama-models/blob/main/models/llama3_1/USE_POLICY.md) and [Meta’s privacy policy](https://www.facebook.com/privacy/policy/)
25
 
26
 
27
+ ## RewardBench Primary Dataset LeaderBoard
28
+
29
  | Model | Type of Data Used For Training | Overall | Chat | Chat-Hard | Safety | Reasoning |
30
  |:-----------------------------|:----------------|:-----|:----------|:-------|:----------|:-----------------------|
31
  | _**Llama-3.1-Nemotron-70B-Reward**_ |Permissive Licensed Data Only (CC-BY-4.0) | **94.1** | **97.5** | 85.8 | **95.1** | **98.1** |