v1_1000_STEPS_1e5_rate_03_beta_DPO / model-00003-of-00003.safetensors

Commit History

End of training
b36409d
verified

tsavage68 commited on