Qwen
/

Qwen2-1.5B-Instruct

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

yangapku commited on Jun 6, 2024

Commit

2ffea94

·

verified ·

1 Parent(s): 1527f49

Update README.md

Files changed (1) hide show

README.md +12 -0

README.md CHANGED Viewed

@@ -69,6 +69,18 @@ generated_ids = [
 response = tokenizer.batch_decode(generated_ids, skip_special_tokens=True)[0]
 ```
 ## Citation
 If you find our work helpful, feel free to give us a cite.

 response = tokenizer.batch_decode(generated_ids, skip_special_tokens=True)[0]
 ```
+## Evaluation
+We briefly compare Qwen2-1.5B-Instruct with Qwen1.5-1.8B-Chat. The results are as follows:
+| Datasets | Qwen1.5-0.5B-Chat | **Qwen2-0.5B-Instruct** | Qwen1.5-1.8B-Chat | **Qwen2-1.5B-Instruct** |
+| :--- | :---: | :---: | :---: | :---: |
+| MMLU | 35.0 | **37.9** | 43.7 | **52.4** |
+| HumanEval | 9.1 | **17.1** | 25.0 | **37.8** |
+| GSM8K | 11.3 | **40.1** | 35.3 | **61.6** |
+| C-Eval | 37.2 | **45.2** | 55.3 | **63.8** |
+| IFEval (Prompt Strict-Acc.) | 14.6 | **20.0** | 16.8 | **29.0** |
 ## Citation
 If you find our work helpful, feel free to give us a cite.