Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Xiaodong
/
Next-DPO-iter1
like
0
Safetensors
Xiaodong/DPO_sdf_17k
Model card
Files
Files and versions
Community
main
Next-DPO-iter1
2 contributors
History:
4 commits
Xiaodong
Update README.md
920c957
verified
about 2 months ago
checkpoint-3000
upload DPO-reproduce checkpoint
about 2 months ago
.gitattributes
1.52 kB
initial commit
about 2 months ago
README.md
76 Bytes
Update README.md
about 2 months ago