license: apache-2.0 | |
The verifier model (`/mistral7b-ep2-n100-scahead-mse-lm-token`) and the generator model (`/mistral7b-ep2`) in GSM8K, finetuned from Mistral-7B. See the Llama2-7B version in [OVM-llama2-7b](https://huggingface.co/FreedomIntelligence/OVM-llama2-7b). | |
See the paper [Outcome-supervised Verifiers for Planning in Mathematical Reasoning](https://arxiv.org/pdf/2311.09724.pdf) and the code in [github](https://github.com/FreedomIntelligence/OVM) |