Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
statking
/
Meta-Llama-3-8B-Instruct-ORPO-QLoRA
like
0
PEFT
Safetensors
HuggingFaceH4/ultrafeedback_binarized
llama
alignment-handbook
trl
orpo
Generated from Trainer
License:
llama3
Model card
Files
Files and versions
Community
Train
Use this model
main
Meta-Llama-3-8B-Instruct-ORPO-QLoRA
Commit History
End of training
7986c4c
verified
statking
commited on
May 21
Model save
d21fe82
verified
statking
commited on
May 21
Training in progress, step 1900
de4580b
verified
statking
commited on
May 21
Training in progress, step 1800
3139762
verified
statking
commited on
May 21
Training in progress, step 1700
cd41a23
verified
statking
commited on
May 21
Training in progress, step 1600
f48378e
verified
statking
commited on
May 21
Training in progress, step 1500
568cb91
verified
statking
commited on
May 21
Training in progress, step 1400
eee18c3
verified
statking
commited on
May 21
Training in progress, step 1300
6201d98
verified
statking
commited on
May 21
Training in progress, step 1200
bf55e07
verified
statking
commited on
May 21
Training in progress, step 1100
145c5b5
verified
statking
commited on
May 21
Training in progress, step 1000
1468f54
verified
statking
commited on
May 21
Training in progress, step 900
8966322
verified
statking
commited on
May 21
Training in progress, step 800
405464f
verified
statking
commited on
May 21
Training in progress, step 700
8fa359d
verified
statking
commited on
May 21
Training in progress, step 600
0499a63
verified
statking
commited on
May 21
Training in progress, step 500
2fcc57e
verified
statking
commited on
May 21
Training in progress, step 400
ee6bea8
verified
statking
commited on
May 21
Training in progress, step 300
dafc170
verified
statking
commited on
May 21
Training in progress, step 200
b799d74
verified
statking
commited on
May 21
Training in progress, step 100
a34ec4d
verified
statking
commited on
May 21
initial commit
2717d03
verified
statking
commited on
May 21