noneUsername/Mistral-Small-Drummer-22B-W8A8

vllm (pretrained=/root/autodl-tmp/Mistral-Small-Drummer-22B,add_bos_token=true,tensor_parallel_size=4,max_model_len=2048,dtype=bfloat16), gen_kwargs: (None), limit: 250.0, num_fewshot: 5, batch_size: auto

Tasks	Version	Filter	n-shot	Metric		Value		Stderr
gsm8k	3	flexible-extract	5	exact_match	↑	0.840	±	0.0232
		strict-match	5	exact_match	↑	0.832	±	0.0237

Tasks	Version	Filter	n-shot	Metric		Value		Stderr
gsm8k	3	flexible-extract	5	exact_match	↑	0.842	±	0.0163
		strict-match	5	exact_match	↑	0.830	±	0.0168

vllm (pretrained=/root/autodl-tmp/Mistral-Small-Drummer-22B-89,add_bos_token=true,tensor_parallel_size=4,max_model_len=2048,dtype=bfloat16), gen_kwargs: (None), limit: 250.0, num_fewshot: 5, batch_size: auto

Tasks	Version	Filter	n-shot	Metric		Value		Stderr
gsm8k	3	flexible-extract	5	exact_match	↑	0.860	±	0.0220
		strict-match	5	exact_match	↑	0.852	±	0.0225

Tasks	Version	Filter	n-shot	Metric		Value		Stderr
gsm8k	3	flexible-extract	5	exact_match	↑	0.854	±	0.0158
		strict-match	5	exact_match	↑	0.840	±	0.0164

noneUsername
/

Mistral-Small-Drummer-22B-W8A8

Model tree for noneUsername/Mistral-Small-Drummer-22B-W8A8