Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
chrlu
/
zephyr-7b-dpo-qlora
like
0
PEFT
TensorBoard
Safetensors
HuggingFaceH4/ultrafeedback_binarized
mistral
alignment-handbook
trl
dpo
Generated from Trainer
4-bit precision
bitsandbytes
License:
apache-2.0
Model card
Files
Files and versions
Metrics
Training metrics
Community
Train
Use this model
main
zephyr-7b-dpo-qlora
Commit History
End of training
e4b7667
verified
chrlu
commited on
Apr 27
Model save
60a5aab
verified
chrlu
commited on
Apr 27
Training in progress, step 1900
4895620
verified
chrlu
commited on
Apr 27
Training in progress, step 1800
79a685f
verified
chrlu
commited on
Apr 27
Training in progress, step 1700
d5785d8
verified
chrlu
commited on
Apr 27
Training in progress, step 1600
8d13dc5
verified
chrlu
commited on
Apr 27
Training in progress, step 1500
443ee46
verified
chrlu
commited on
Apr 27
Training in progress, step 1400
4bf7948
verified
chrlu
commited on
Apr 27
Training in progress, step 1300
8c23ac6
verified
chrlu
commited on
Apr 27
Training in progress, step 1200
6575778
verified
chrlu
commited on
Apr 27
Training in progress, step 1100
d869c8c
verified
chrlu
commited on
Apr 27
Training in progress, step 1000
b74f51b
verified
chrlu
commited on
Apr 27
Training in progress, step 900
f6d10c0
verified
chrlu
commited on
Apr 27
Training in progress, step 800
54e9ff0
verified
chrlu
commited on
Apr 27
Training in progress, step 700
8428f95
verified
chrlu
commited on
Apr 27
Training in progress, step 600
382caf0
verified
chrlu
commited on
Apr 27
Training in progress, step 500
2d9574e
verified
chrlu
commited on
Apr 27
Training in progress, step 400
6d2e1ad
verified
chrlu
commited on
Apr 27
Training in progress, step 300
4d86d8b
verified
chrlu
commited on
Apr 27
Training in progress, step 200
a8f661d
verified
chrlu
commited on
Apr 27
Training in progress, step 100
e198ffc
verified
chrlu
commited on
Apr 27
initial commit
a6891b4
verified
chrlu
commited on
Apr 26