logical-reasoning / data /Mistral-7B-v0.3-Chinese-Chat_metrics.csv

Commit History

ready for qwen2.5
d5ab5d2

dh-mc commited on

10-shot results ready for 7/8 B models
3db2ae5

dh-mc commited on

completed eval/analysis
468b88d

dh-mc commited on

open source LLM results almost done
5a8f8d2

dh-mc commited on