PEFT
Safetensors
mixtral
alignment-handbook
trl
dpo
Generated from Trainer
4-bit precision
bitsandbytes

Commit History

End of training
8a75540
verified

jan-hq commited on

Model save
c7f70d3
verified

jan-hq commited on