Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
THU-KEG
/
Mistral-Crab-DPO
like
4
Follow
Knowledge Engineer Group @ Tsinghua University
23
Text Generation
Safetensors
English
mistral
alignment-handbook
Generated from Trainer
conversational
arxiv:
2410.24175
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
6666436
Mistral-Crab-DPO
Commit History
Update README.md
6666436
verified
Kikkk
commited on
Nov 1, 2024
commit
c1412d7
Kikkk
commited on
Nov 1, 2024
initial commit
16b295e
verified
Kikkk
commited on
Nov 1, 2024