nvidia
/

Llama-3.1-Nemotron-70B-Reward

Model card Files Files and versions Community

zhilinw commited on Sep 28, 2024

Commit

1bd9327

·

verified ·

1 Parent(s): dd48622

Update README.md

Files changed (1) hide show

README.md +2 -0

README.md CHANGED Viewed

@@ -24,6 +24,8 @@ Llama-3.1-Nemotron-70B-Reward is a large language model customized using develop
 By accessing this model, you are agreeing to the LLama 3.1 terms and conditions of the [license](https://github.com/meta-llama/llama-models/blob/main/models/llama3_1/LICENSE), [acceptable use policy](https://github.com/meta-llama/llama-models/blob/main/models/llama3_1/USE_POLICY.md) and [Meta’s privacy policy](https://www.facebook.com/privacy/policy/)
  | Model  | Type of Data Used For Training |  Overall | Chat | Chat-Hard | Safety | Reasoning |
 |:-----------------------------|:----------------|:-----|:----------|:-------|:----------|:-----------------------|
 | _**Llama-3.1-Nemotron-70B-Reward**_ |Permissive Licensed Data Only (CC-BY-4.0) | **94.1** | **97.5** | 85.8 | **95.1** | **98.1** |

 By accessing this model, you are agreeing to the LLama 3.1 terms and conditions of the [license](https://github.com/meta-llama/llama-models/blob/main/models/llama3_1/LICENSE), [acceptable use policy](https://github.com/meta-llama/llama-models/blob/main/models/llama3_1/USE_POLICY.md) and [Meta’s privacy policy](https://www.facebook.com/privacy/policy/)
+## RewardBench Primary Dataset LeaderBoard
  | Model  | Type of Data Used For Training |  Overall | Chat | Chat-Hard | Safety | Reasoning |
 |:-----------------------------|:----------------|:-----|:----------|:-------|:----------|:-----------------------|
 | _**Llama-3.1-Nemotron-70B-Reward**_ |Permissive Licensed Data Only (CC-BY-4.0) | **94.1** | **97.5** | 85.8 | **95.1** | **98.1** |