RuiyangSun commited on
Commit
32e35c1
1 Parent(s): 588a9a4

docs: update readme

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -20,7 +20,7 @@ library_name: safe-rlhf
20
 
21
  ## Model Details
22
 
23
- The Beaver Cost model is a preference model trained using the [PKU-SafeRLHF](https://huggingface.co/datasets/PKU-Alignment/PKU-SafeRLHF dataset).
24
  It can play a role in the safe RLHF algorithm, helping the Beaver model become more safe and harmless.
25
 
26
  - **Developed by:** the [PKU-Alignment](https://github.com/PKU-Alignment) Team.
 
20
 
21
  ## Model Details
22
 
23
+ The Beaver Cost model is a preference model trained using the [PKU-SafeRLHF](https://huggingface.co/datasets/PKU-Alignment/PKU-SafeRLHF) dataset.
24
  It can play a role in the safe RLHF algorithm, helping the Beaver model become more safe and harmless.
25
 
26
  - **Developed by:** the [PKU-Alignment](https://github.com/PKU-Alignment) Team.