tFINE-680m-e32-d16-gqa-flan / all_results.json
amazingvince's picture
End of training
0c608d2 verified
raw
history blame contribute delete
296 Bytes
{
"epoch": 0.9999466719486831,
"num_input_tokens_seen": 2313940996,
"total_flos": 9.029436409798197e+18,
"train_loss": 0.7368571982491552,
"train_runtime": 130509.2663,
"train_samples": 4200414,
"train_samples_per_second": 32.185,
"train_steps_per_second": 0.126
}