gabrielmbmb
/

smollm2-1.7B-8k-mix7-ep2-v2-qlora-r16-a16-lr3e4-mix1-dpo

alignment-handbook

Generated from Trainer

4-bit precision

Model card Files Files and versions Metrics Training metrics Community

smollm2-1.7B-8k-mix7-ep2-v2-qlora-r16-a16-lr3e4-mix1-dpo / vocab.json

gabrielmbmb's picture

gabrielmbmb HF staff

Training in progress, step 100

2c8f376 verified 20 days ago

history contribute delete

801 kB

File too large to display, you can check the raw version instead.