llama-3-8b-instruct-sppo-iter1 / model-00003-of-00004.safetensors

Commit History

End of training
0f9de24
verified

jcmei commited on