dball
/

zephyr-7b-dpo-qlora-no-sft

alignment-handbook

Generated from Trainer

4-bit precision

Model card Files Files and versions Metrics Training metrics Community

zephyr-7b-dpo-qlora-no-sft / runs /Feb08_09-39-21_7dec04cc21c9

1 contributor

History: 72 commits

dball's picture

Model save

42d7b6a verified 9 months ago

events.out.tfevents.1707385209.7dec04cc21c9.25454.0

546 kB
LFS

Model save 9 months ago
events.out.tfevents.1707549958.7dec04cc21c9.25454.1

828 Bytes
LFS

Model save 9 months ago