adalaw
/

MAmmoTH-7B-Mistral-MFT

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

MAmmoTH-7B-Mistral-MFT / README.md

adalaw's picture

Update README.md

1754e42 verified 5 months ago

|

1.13 kB

	---
	datasets:
	- TIGER-Lab/MathInstruct
	---

	## Introduction
	The model is trained with Masked Thought Fine-Tuning (MFT), a simple variant of standard Supervised Fine-Tuning (SFT). You can refer to our code and paper below.

	## Links
	- Code: [https://github.com/ChangyuChen347/MaskedThought](https://github.com/ChangyuChen347/MaskedThought)
	- Paper: [https://arxiv.org/abs/2403.02178](https://arxiv.org/abs/2403.02178)

	## Results
	We test it with the Hybrid decoding scripts provided in [MAmmoTH](https://github.com/TIGER-AI-Lab/MAmmoTH).
	\| Model \| GSM8K \| MATH \|
	\|----------------------------------------------------------------------------------------------------------------------------------\|-------\|-------\|
	\| [adalaw/MAmmoTH-7B-Mistral-MFT](https://huggingface.co/adalaw/MAmmoTH-7B-Mistral-MFT) \| 77.10 \| 41.2 \|
	\| [TIGER-Lab/MAmmoTH-7B-Mistral-SFT](https://huggingface.co/TIGER-Lab/MAmmoTH-7B-Mistral) \| 75.00 \| 40.0 \|