jamba-900M-v0.13-KIx2 / train_results.json
pszemraj's picture
End of training
1058833 verified
raw
history blame contribute delete
289 Bytes
{
"epoch": 1.997349589186324,
"num_input_tokens_seen": 1975517184,
"total_flos": 9.732316690586272e+18,
"train_loss": 3.186332613039928,
"train_runtime": 34789.3198,
"train_samples": 60368,
"train_samples_per_second": 3.47,
"train_steps_per_second": 0.027
}