logical-reasoning / data /Llama3.1-8B-Chinese-Chat_metrics.csv
dh-mc's picture
ready for bf16 tuning
e656f92
raw
history blame
872 Bytes
epoch,model,accuracy,precision,recall,f1
0.0,shenzhi-wang/Llama3.1-8B-Chinese-Chat_torch.float16_lf,0.23666666666666666,0.7457179631400438,0.23666666666666666,0.33962354850065374
0.2,shenzhi-wang/Llama3.1-8B-Chinese-Chat/checkpoint-35_torch.float16_lf,0.6256666666666667,0.827414387212707,0.6256666666666667,0.6935695138877099
0.4,shenzhi-wang/Llama3.1-8B-Chinese-Chat/checkpoint-70_torch.float16_lf,0.762,0.7899461556934093,0.762,0.7667008346960339
0.6,shenzhi-wang/Llama3.1-8B-Chinese-Chat/checkpoint-105_torch.float16_lf,0.6803333333333333,0.79802978899557,0.6803333333333333,0.7212437740051865
0.8,shenzhi-wang/Llama3.1-8B-Chinese-Chat/checkpoint-140_torch.float16_lf,0.7523333333333333,0.8074258170836324,0.7523333333333333,0.7736442997308933
1.0,shenzhi-wang/Llama3.1-8B-Chinese-Chat/checkpoint-175_torch.float16_lf,0.737,0.8090588922502886,0.737,0.7637837184140026