DPO-3-1k-25steps-nowarmup / model-00001-of-00004.safetensors

Commit History

(Trained with Unsloth)
381921d
verified

ksw1 commited on