Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
just1nseo
/
zephyr-dpo-qlora-uf-ours-5e-7-epoch1
like
0
PEFT
TensorBoard
Safetensors
generation/UF
mistral
alignment-handbook
trl
dpo
Generated from Trainer
4-bit precision
bitsandbytes
License:
apache-2.0
Model card
Files
Files and versions
Metrics
Training metrics
Community
Train
Use this model
main
zephyr-dpo-qlora-uf-ours-5e-7-epoch1
/
runs
/
Jul29_11-02-19_notebook-deployment-48-7d9b6c99-khd85
/
events.out.tfevents.1722251039.notebook-deployment-48-7d9b6c99-khd85.3446412.0
Commit History
Model save
6da2f8e
verified
just1nseo
commited on
Jul 29
Training in progress, step 300
0cef3e4
verified
just1nseo
commited on
Jul 29
Training in progress, step 200
487d1a1
verified
just1nseo
commited on
Jul 29
Training in progress, step 100
ba95ad1
verified
just1nseo
commited on
Jul 29