NeMo
English
nvidia
llama3.1
reward model
zhilinw commited on
Commit
838ba71
·
verified ·
1 Parent(s): 8ac3d10

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -26,7 +26,7 @@ By accessing this model, you are agreeing to the LLama 3.1 terms and conditions
26
 
27
  ## RewardBench Primary Dataset LeaderBoard
28
 
29
- As of 27 Sept 2024, Llama-3.1-Nemotron-70B-Reward performs best Overall on RewardBench as well as in Chat, Safety and Reasoning category.
30
 
31
  | Model | Type of Data Used For Training | Overall | Chat | Chat-Hard | Safety | Reasoning |
32
  |:-----------------------------|:----------------|:-----|:----------|:-------|:----------|:-----------------------|
 
26
 
27
  ## RewardBench Primary Dataset LeaderBoard
28
 
29
+ As of 27 Sept 2024, Llama-3.1-Nemotron-70B-Reward performs best Overall on RewardBench as well as with strong performance in Chat, Safety and Reasoning categories among the models below.
30
 
31
  | Model | Type of Data Used For Training | Overall | Chat | Chat-Hard | Safety | Reasoning |
32
  |:-----------------------------|:----------------|:-----|:----------|:-------|:----------|:-----------------------|