hendrydong commited on
Commit
524cd28
1 Parent(s): d6f5e57

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -1,4 +1,4 @@
1
- The reward model may be used for iterative SFT/DPO
2
 
3
  ```
4
  @article{dong2023raft,
 
1
+ The reward model can be used for iterative SFT/DPO
2
 
3
  ```
4
  @article{dong2023raft,