reward-gpt-duplicate-answer / checkpoint-400 /model-00002-of-00002.safetensors

Commit History

Training in progress, step 400, checkpoint
43e61a2

bradmin commited on