Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
DUAL-GPO
/
phi-2-ipo-chatml
like
0
Follow
DUAL Group
2
PEFT
TensorBoard
Safetensors
HuggingFaceH4/ultrafeedback_binarized
phi
alignment-handbook
Generated from Trainer
trl
dpo
custom_code
License:
apache-2.0
Model card
Files
Files and versions
Metrics
Training metrics
Community
Use this model
main
phi-2-ipo-chatml
/
adapter_model.safetensors
Commit History
Model save
21f5880
verified
lole25
commited on
May 19
Training in progress, step 1900
b7f762d
verified
lole25
commited on
May 19
Training in progress, step 1800
e15cbdf
verified
lole25
commited on
May 19
Training in progress, step 1700
08d9ece
verified
lole25
commited on
May 19
Training in progress, step 1500
23afdd1
verified
lole25
commited on
May 19
Training in progress, step 1400
a72833e
verified
lole25
commited on
May 19
Training in progress, step 1300
711c1a0
verified
lole25
commited on
May 19
Training in progress, step 1200
0cae008
verified
lole25
commited on
May 19
Training in progress, step 1100
a6066bc
verified
lole25
commited on
May 19
Training in progress, step 1000
5551a6c
verified
lole25
commited on
May 19
Training in progress, step 900
d62a216
verified
lole25
commited on
May 19
Training in progress, step 800
6184dd0
verified
lole25
commited on
May 19
Training in progress, step 700
479a94c
verified
lole25
commited on
May 19
Training in progress, step 600
d3e1bcf
verified
lole25
commited on
May 19
Training in progress, step 500
ca8d06d
verified
lole25
commited on
May 19
Training in progress, step 400
01bfda6
verified
lole25
commited on
May 19
Training in progress, step 300
efc5599
verified
lole25
commited on
May 19
Training in progress, step 200
db37c63
verified
lole25
commited on
May 19
Training in progress, step 100
56eeb1b
verified
lole25
commited on
May 19