Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
DUAL-GPO-2
/
phi-2-irepo-chatml-v9-i1
like
0
Follow
DUAL-GPO-2
2
PEFT
TensorBoard
Safetensors
HuggingFaceH4/ultrafeedback_binarized
phi
alignment-handbook
Generated from Trainer
trl
dpo
custom_code
Model card
Files
Files and versions
Metrics
Training metrics
Community
Use this model
main
phi-2-irepo-chatml-v9-i1
Commit History
End of training
f86d371
verified
lole25
commited on
May 20
Model save
b7fe50c
verified
lole25
commited on
May 20
Training in progress, step 1800
cdcdbd4
verified
lole25
commited on
May 20
Training in progress, step 1700
6c1e070
verified
lole25
commited on
May 20
Training in progress, step 1500
d119678
verified
lole25
commited on
May 20
Training in progress, step 1400
f5d3cdc
verified
lole25
commited on
May 20
Training in progress, step 1300
1a5783d
verified
lole25
commited on
May 20
Training in progress, step 1200
2a2199d
verified
lole25
commited on
May 20
Training in progress, step 1100
1e25128
verified
lole25
commited on
May 20
Training in progress, step 1000
d524774
verified
lole25
commited on
May 20
Training in progress, step 900
f25e6b5
verified
lole25
commited on
May 20
Training in progress, step 800
baee088
verified
lole25
commited on
May 20
Training in progress, step 700
29bc4e1
verified
lole25
commited on
May 20
Training in progress, step 600
2fac341
verified
lole25
commited on
May 20
Training in progress, step 500
be47031
verified
lole25
commited on
May 20
Training in progress, step 400
eb49ff9
verified
lole25
commited on
May 20
Training in progress, step 300
c396fb1
verified
lole25
commited on
May 20
Training in progress, step 200
cd048f5
verified
lole25
commited on
May 20
Training in progress, step 100
eee1790
verified
lole25
commited on
May 20
initial commit
4f15868
verified
lole25
commited on
May 20