keminglu commited on
Commit
e0858ca
1 Parent(s): 78256f3

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +27 -3
README.md CHANGED
@@ -1,3 +1,27 @@
1
- ---
2
- license: apache-2.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ ---
4
+
5
+ see our paper in: [https://arxiv.org/abs/2310.05506](https://arxiv.org/abs/2310.05506)
6
+ ## Model Details
7
+ MuggleMATH is fully fine-tuned on the AugGSM8K and AugMATH datasets and based on the LLaMA-2 Models.
8
+ ## **Model Usage**
9
+ prompting template:
10
+ '''
11
+ "Below is an instruction that describes a task. " "Write a response that appropriately completes the request.\n\n" "### Instruction:\n{instruction}\n\n### Response:"
12
+ '''
13
+ We recommend using vllm to accelerate inference.
14
+ ## Experiment
15
+ | | GSM8K | MATH |
16
+ | --- | --- | --- |
17
+ | MuggleMATH-7B | 69.8 | 25.8 |
18
+ | MuggleMATH-13B | 74.3 | 30.7 |
19
+ | MuggleMATH-70B | 82.5 | 42.1 |
20
+
21
+ ## **Citation**
22
+ @misc{li2023query,
23
+ title={Query and Response Augmentation Cannot Help Out-of-domain Math Reasoning Generalization},
24
+ author={Chengpeng Li and Zheng Yuan and Hongyi Yuan and Guanting Dong and Keming Lu and Jiancan Wu and Chuanqi Tan and Xiang Wang and Chang Zhou},
25
+ journal={arXiv preprint arXiv: 2310.05506},
26
+ year={2023}
27
+ }