Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
THU-KEG
/
Mistral-Crab-DPO
like
2
Follow
Knowledge Engineer Group @ Tsinghua University
19
Text Generation
Safetensors
English
mistral
alignment-handbook
Generated from Trainer
conversational
arxiv:
2410.24175
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
main
Mistral-Crab-DPO
Commit History
Update README.md
1d83ca3
verified
Kikkk
commited on
14 days ago
Update README.md
6666436
verified
Kikkk
commited on
14 days ago
commit
c1412d7
Kikkk
commited on
14 days ago
initial commit
16b295e
verified
Kikkk
commited on
14 days ago