hendrydong
/

Mistral-RM-for-RAFT-GSHF-v0

Text Classification

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

hendrydong commited on Mar 22

Commit

524cd28

•

1 Parent(s): d6f5e57

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -1,4 +1,4 @@
-The reward model may be used for iterative SFT/DPO
 ```
 @article{dong2023raft,

+The reward model can be used for iterative SFT/DPO
 ```
 @article{dong2023raft,