Safetensors
OVM-Mistral-7b / README.md
OakYU's picture
add models
171155e
---
license: apache-2.0
---
The verifier model (`/mistral7b-ep2-n100-scahead-mse-lm-token`) and the generator model (`/mistral7b-ep2`) in GSM8K, finetuned from Mistral-7B. See the Llama2-7B version in [OVM-llama2-7b](https://huggingface.co/FreedomIntelligence/OVM-llama2-7b).
See the paper [Outcome-supervised Verifiers for Planning in Mathematical Reasoning](https://arxiv.org/pdf/2311.09724.pdf) and the code in [github](https://github.com/FreedomIntelligence/OVM)