longt5_xl_sfd_memsum_40 / all_results.json
learn3r's picture
End of training
a2b5bb6 verified
raw
history blame contribute delete
361 Bytes
{
"epoch": 38.96,
"eval_loss": 5.267875671386719,
"eval_runtime": 14.0885,
"eval_samples": 338,
"eval_samples_per_second": 23.991,
"eval_steps_per_second": 3.052,
"train_loss": 0.32971865670822026,
"train_runtime": 25027.7089,
"train_samples": 3673,
"train_samples_per_second": 5.87,
"train_steps_per_second": 0.045
}