Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
THU-KEG
/
Mistral-Crab-DPO
like
4
Follow
Knowledge Engineer Group @ Tsinghua University
36
Text Generation
Safetensors
English
mistral
alignment-handbook
Generated from Trainer
conversational
arxiv:
2410.24175
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
1d83ca3
Mistral-Crab-DPO
Commit History
Update README.md
1d83ca3
verified
Kikkk
commited on
Nov 1, 2024
Update README.md
6666436
verified
Kikkk
commited on
Nov 1, 2024
commit
c1412d7
Kikkk
commited on
Nov 1, 2024
initial commit
16b295e
verified
Kikkk
commited on
Nov 1, 2024