alibaba-pai
/

DistilQwen2-7B-Instruct

Model card Files Files and versions Community

Bohr commited on Nov 4, 2024

Commit

7bce9e1

·

verified ·

1 Parent(s): 747e42f

Update README.md

Files changed (1) hide show

README.md +1 -2

README.md CHANGED Viewed

@@ -47,8 +47,7 @@ response = tokenizer.batch_decode(generated_ids, skip_special_tokens=True)[0]
 ## 🔍 Evaluation
-We used single-turn instructions from MT-Bench as input for Qwen2-1.5B-Instruct and Qwen2-7B-Instruct. GPT4-turbo is used to evaluate the changes in the level of detail and truthfulness of responses to our model's revised instructions.
 | Model | AlpacaEval 2.0 (length-controlled) | MT-Bench | MT-Bench (single) | IFEval (instruction-loose) | IFEval (strict-prompt) |
 |------|-----------------------------------|----------|-------------------|---------------------------|------------------------|

 ## 🔍 Evaluation
+We evaluated our model on instruction-following leaderboards such as AlpacaEval, MT-Bench and IFEval.
 | Model | AlpacaEval 2.0 (length-controlled) | MT-Bench | MT-Bench (single) | IFEval (instruction-loose) | IFEval (strict-prompt) |
 |------|-----------------------------------|----------|-------------------|---------------------------|------------------------|