nicolinho commited on
Commit
26fa9d4
·
verified ·
1 Parent(s): 35c5679

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -0
README.md CHANGED
@@ -4,6 +4,7 @@ license: llama3
4
 
5
  # Quantile Regression for Distributional Reward Models in RLHF
6
 
 
7
 
8
 
9
 
 
4
 
5
  # Quantile Regression for Distributional Reward Models in RLHF
6
 
7
+ # (This is an old version. The new one trained on the decontaminated version of the Skywork dataset is [nicolinho/QRM-Llama3.1-8B-v2](https://huggingface.co/nicolinho/QRM-Llama3.1-8B-v2))
8
 
9
 
10