Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
EllieS
/
zephyr-7b-dpo-lora-pubmedqa-ultrafeedback-mix
like
0
PEFT
TensorBoard
Safetensors
HuggingFaceH4/ultrafeedback_binarized
mistral
alignment-handbook
Generated from Trainer
trl
dpo
License:
apache-2.0
Model card
Files
Files and versions
Metrics
Training metrics
Community
Train
Use this model
main
zephyr-7b-dpo-lora-pubmedqa-ultrafeedback-mix
/
adapter_model.safetensors
Commit History
Model save
b314125
verified
EllieS
commited on
Mar 5
Training in progress, step 15000
f1a9fbb
verified
EllieS
commited on
Mar 5
Training in progress, step 14000
3f8753c
verified
EllieS
commited on
Mar 5
Training in progress, step 13000
a09a103
verified
EllieS
commited on
Mar 5
Training in progress, step 12000
f3d4911
verified
EllieS
commited on
Mar 5
Training in progress, step 11000
9d3eb1f
verified
EllieS
commited on
Mar 5
Training in progress, step 10000
17c211a
verified
EllieS
commited on
Mar 5
Training in progress, step 9000
e358781
verified
EllieS
commited on
Mar 5
Training in progress, step 8000
7af5356
verified
EllieS
commited on
Mar 5
Training in progress, step 7000
915fb57
verified
EllieS
commited on
Mar 5
Training in progress, step 6000
49a8e2f
verified
EllieS
commited on
Mar 5
Training in progress, step 5000
3de6b79
verified
EllieS
commited on
Mar 5
Training in progress, step 4000
30a8b00
verified
EllieS
commited on
Mar 5
Training in progress, step 3000
ed231f4
verified
EllieS
commited on
Mar 5
Training in progress, step 2000
687d13f
verified
EllieS
commited on
Mar 5
Training in progress, step 1000
26ba566
verified
EllieS
commited on
Mar 5