Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
TTTXXX01
/
DPO-Zephyr-7B-baseline
like
0
Text Generation
Transformers
Safetensors
HuggingFaceH4/ultrafeedback_binarized
mistral
alignment-handbook
trl
dpo
Generated from Trainer
conversational
text-generation-inference
Inference Endpoints
License:
mit
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
DPO-Zephyr-7B-baseline
Commit History
Training in progress, step 1900
b6adea8
verified
TTTXXX01
commited on
Jun 2
Training in progress, step 1800
6747241
verified
TTTXXX01
commited on
Jun 2
Training in progress, step 1700
7a4e6f4
verified
TTTXXX01
commited on
Jun 2
Training in progress, step 1600
a3279ae
verified
TTTXXX01
commited on
Jun 2
Training in progress, step 1500
a3ea139
verified
TTTXXX01
commited on
Jun 2
Training in progress, step 1400
9f32122
verified
TTTXXX01
commited on
Jun 2
Training in progress, step 1300
93bfb27
verified
TTTXXX01
commited on
Jun 2
Training in progress, step 1200
b273439
verified
TTTXXX01
commited on
Jun 2
Training in progress, step 1100
ab191ce
verified
TTTXXX01
commited on
Jun 2
Training in progress, step 1000
d30dfbc
verified
TTTXXX01
commited on
Jun 2
Training in progress, step 900
0973733
verified
TTTXXX01
commited on
Jun 2
Training in progress, step 800
3f11b1f
verified
TTTXXX01
commited on
Jun 2
Training in progress, step 700
bf7dd45
verified
TTTXXX01
commited on
Jun 2
Training in progress, step 600
a14177e
verified
TTTXXX01
commited on
Jun 2
Training in progress, step 500
116bc4e
verified
TTTXXX01
commited on
Jun 2
Training in progress, step 400
25a90d5
verified
TTTXXX01
commited on
Jun 2
Training in progress, step 300
8c4d4ad
verified
TTTXXX01
commited on
Jun 2
Training in progress, step 200
fc245f3
verified
TTTXXX01
commited on
Jun 2
Training in progress, step 100
fb02c2b
verified
TTTXXX01
commited on
Jun 2
initial commit
dbfb0e1
verified
TTTXXX01
commited on
Jun 2
Previous
1
2
Next