TIGER-Lab
/

MAmmoTH-7B-Mistral

Inference Endpoints

Model card Files Files and versions Community

wenhu commited on Dec 6, 2023

Commit

e9e0143

•

1 Parent(s): 1dff1a3

Update README.md

Files changed (1) hide show

README.md +8 -13

README.md CHANGED Viewed

@@ -36,19 +36,14 @@ The models are fine-tuned with the MathInstruct dataset using the original Llama
 The models are evaluated using open-ended and multiple-choice math problems from several datasets. Here are the results:
-| **Model**             	| **Decoding** 	| **GSM**  	| **MATH** 	| **MMLU** 	|
-|---------------------------|---------------|----------|------------|-----------|
-| MAmmoTH-7B             	| CoT          	| 50.5     	| 10.4     	| 43.7     	|
-|                       	| PoT          	| 51.6     	| 28.7     	| 43.3     	|
-|                       	| **Hybrid**   	| 53.6  	| 31.5 	    | 44.5   	|
-| MAmmoTH-Coder-7B  	    | CoT          	| 22.4     	| 7.9      	| 36.2     	|
-|                       	| PoT          	| 58.8     	| 32.1     	| 47.2     	|
-|                       	| **Hybrid**   	| 59.4  	| 33.4  	| 47.2  	|
-| MAmmoTH-7B-Mistral  	    | CoT          	|        	|       	|       	|
-|                       	| PoT          	|        	|       	|        	|
-|                       	| **Hybrid**   	| **75.0** 	| **40.0** 	| **52.5**  |
 ## Usage
 You can use the models through Huggingface's Transformers library. Use the pipeline function to create a text-generation pipeline with the model of your choice, then feed in a math problem to get the solution.

 The models are evaluated using open-ended and multiple-choice math problems from several datasets. Here are the results:
+| **Model**             	| **Decoding** 	| **GSM**  	| **MATH** 	| **MMLU-Math** |
+|---------------------------|---------------|-----------|-----------|-----------|
+| MAmmoTH-7B             	| **Hybrid**   	| 53.6  	| 31.5 	    | 44.5   	|
+| MAmmoTH-Coder-7B  	    | **Hybrid**   	| 59.4  	| 33.4  	| 47.2  	|
+| MetaMath-7B-Mistral       | **CoT**   	| **77.7** 	| 28.2 	    | 49.3      |
+| OpenChat-3.5-7B           | **CoT**   	| 77.3 	    | 28.6 	    | 49.6      |
+| DeepSeek-Coder-34B        | **PoT**   	| 58.2   	| 35.3 	    | 46.5      |
+| MAmmoTH-7B-Mistral  	    | **Hybrid**   	| 75.0   	| **40.0** 	| **52.5**  |
 ## Usage
 You can use the models through Huggingface's Transformers library. Use the pipeline function to create a text-generation pipeline with the model of your choice, then feed in a math problem to get the solution.