PEFT
Safetensors
mixtral
alignment-handbook
trl
dpo
Generated from Trainer
4-bit precision
bitsandbytes
stealth-finance-v2-dpo-adapter / train_results.json

Commit History

Model save
c7f70d3
verified

jan-hq commited on