Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
DUAL-GPO
/
zephyr-7b-gpo-v3-2-i2
like
0
PEFT
TensorBoard
Safetensors
HuggingFaceH4/ultrafeedback_binarized
mistral
alignment-handbook
Generated from Trainer
trl
dpo
Model card
Files
Files and versions
Metrics
Training metrics
Community
Train
Use this model
main
zephyr-7b-gpo-v3-2-i2
Commit History
End of training
c9e726a
verified
lole25
commited on
May 19
Model save
0ee6071
verified
lole25
commited on
May 19
Training in progress, step 3600
291388e
verified
lole25
commited on
May 19
Training in progress, step 3400
623ef19
verified
lole25
commited on
May 19
Training in progress, step 3300
db331ff
verified
lole25
commited on
May 19
Training in progress, step 3100
fd49b4e
verified
lole25
commited on
May 19
Training in progress, step 2900
af9e5ad
verified
lole25
commited on
May 19
Training in progress, step 2800
13ea6c9
verified
lole25
commited on
May 19
Training in progress, step 2700
0c16e2b
verified
lole25
commited on
May 19
Training in progress, step 2500
ae7ac20
verified
lole25
commited on
May 19
Training in progress, step 2400
6375717
verified
lole25
commited on
May 19
Training in progress, step 2300
240c41c
verified
lole25
commited on
May 19
Training in progress, step 2200
d62b7c3
verified
lole25
commited on
May 19
Training in progress, step 2100
373b9cc
verified
lole25
commited on
May 19
Training in progress, step 2000
168f4ec
verified
lole25
commited on
May 19
Training in progress, step 1900
a194866
verified
lole25
commited on
May 19
Training in progress, step 1800
d39929b
verified
lole25
commited on
May 19
Training in progress, step 1700
6f967fc
verified
lole25
commited on
May 19
Training in progress, step 1600
c2a7a1a
verified
lole25
commited on
May 19
Training in progress, step 1500
c27c7a3
verified
lole25
commited on
May 19
Training in progress, step 1400
b6ba78b
verified
lole25
commited on
May 19
Training in progress, step 1300
1064c6e
verified
lole25
commited on
May 19
Training in progress, step 1200
e6184e8
verified
lole25
commited on
May 19
Training in progress, step 1100
cc1e974
verified
lole25
commited on
May 19
Training in progress, step 1000
a4e6c91
verified
lole25
commited on
May 19
Training in progress, step 900
e7e5257
verified
lole25
commited on
May 19
Training in progress, step 800
072bf11
verified
lole25
commited on
May 19
Training in progress, step 700
4342d24
verified
lole25
commited on
May 19
Training in progress, step 600
86b4c45
verified
lole25
commited on
May 19
Training in progress, step 500
0f42be0
verified
lole25
commited on
May 19
Training in progress, step 400
d5c2d98
verified
lole25
commited on
May 19
Training in progress, step 300
6cf1b3a
verified
lole25
commited on
May 19
Training in progress, step 200
6cad95a
verified
lole25
commited on
May 19
Training in progress, step 100
301a79c
verified
lole25
commited on
May 19
initial commit
96d65a0
verified
lole25
commited on
May 19