logical-reasoning / data /Qwen2-7B-Instruct_shots_metrics.csv
dh-mc's picture
10-shot results ready for 7/8 B models
3db2ae5
raw
history blame
341 Bytes
shots,model,run,accuracy,precision,recall,f1,ratio_valid_classifications
0,Qwen2-7B-Instruct,Qwen/Qwen2-7B-Instruct/shots-00,0.683,0.7493103872717293,0.683,0.710140098232232,0.9996666666666667
10,Qwen2-7B-Instruct,Qwen/Qwen2-7B-Instruct/shots-10,0.5646666666666667,0.7391197908117386,0.5646666666666667,0.6064049121095652,0.9896666666666667