adalaw
/

MAmmoTH-7B-Mistral-MFT

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

adalaw commited on Jul 9, 2024

Commit

1754e42

·

verified ·

1 Parent(s): 0d4700e

Update README.md

Files changed (1) hide show

README.md +18 -3

README.md CHANGED Viewed

@@ -1,3 +1,18 @@
----
-license: mit
----

+---
+datasets:
+- TIGER-Lab/MathInstruct
+---
+## Introduction
+The model is trained with Masked Thought Fine-Tuning (MFT), a simple variant of standard Supervised Fine-Tuning (SFT). You can refer to our code and paper below.
+## Links
+- **Code**: [https://github.com/ChangyuChen347/MaskedThought](https://github.com/ChangyuChen347/MaskedThought)
+- **Paper**: [https://arxiv.org/abs/2403.02178](https://arxiv.org/abs/2403.02178)
+## Results
+We test it with the Hybrid decoding scripts provided in [MAmmoTH](https://github.com/TIGER-AI-Lab/MAmmoTH).
+| Model                                                                                                               | GSM8K | MATH  |
+|----------------------------------------------------------------------------------------------------------------------------------|-------|-------|
+| [adalaw/MAmmoTH-7B-Mistral-MFT](https://huggingface.co/adalaw/MAmmoTH-7B-Mistral-MFT)                                            | 77.10 | 41.2  |
+| [TIGER-Lab/MAmmoTH-7B-Mistral-SFT](https://huggingface.co/TIGER-Lab/MAmmoTH-7B-Mistral)                                          | 75.00 | 40.0  |