floleuerer
/

SausageLM-7b-Instruct-v0.01-dpo-qlora

alignment-handbook

Generated from Trainer

4-bit precision

Model card Files Files and versions Metrics Training metrics Community

SausageLM-7b-Instruct-v0.01-dpo-qlora / runs /Jan14_23-44-38_bronxx

1 contributor

History: 10 commits

floleuerer's picture

Model save

2c6e000 verified 10 months ago

events.out.tfevents.1705272324.bronxx.25818.0

257 kB
LFS

Model save 10 months ago
events.out.tfevents.1705364890.bronxx.25818.1

828 Bytes
LFS

Model save 10 months ago