llama3-sudo-dpo-5epochs-forget10mix400-1sft-2fullpara / model-00001-of-00004.safetensors

Commit History

Training in progress, step 125
170d3f1
verified

Qin Liu commited on