iteration,consumed_tokens,elapsed_time_per_iteration_ms,tokens_per_sec,tokens_per_sec_per_gpu,global_batch_size,lm_loss,lr,model_tflops_per_gpu,hardware_tflops_per_gpu,grad_norm,memory_usage_MiB,peak_allocated_MiB,peak_reserved_MiB | |
1,4190000.0000000005,83600.0,50200.0,3140.0,1020.0,11.1,0.0001,28.5,28.5,24.9,3168.14,4459.01,13244.0 | |