Safetensors
File size: 468 Bytes
8c86cc5
 
 
171155e
 
 
 
1
2
3
4
5
6
7
---
license: apache-2.0
---

The verifier model (`/mistral7b-ep2-n100-scahead-mse-lm-token`) and the generator model (`/mistral7b-ep2`) in GSM8K, finetuned from Mistral-7B. See the Llama2-7B version in [OVM-llama2-7b](https://huggingface.co/FreedomIntelligence/OVM-llama2-7b).

See the paper [Outcome-supervised Verifiers for Planning in Mathematical Reasoning](https://arxiv.org/pdf/2311.09724.pdf) and the code in [github](https://github.com/FreedomIntelligence/OVM)