Safetensors
OVM-Mistral-7b / README.md
OakYU's picture
add models
171155e
metadata
license: apache-2.0

The verifier model (/mistral7b-ep2-n100-scahead-mse-lm-token) and the generator model (/mistral7b-ep2) in GSM8K, finetuned from Mistral-7B. See the Llama2-7B version in OVM-llama2-7b.

See the paper Outcome-supervised Verifiers for Planning in Mathematical Reasoning and the code in github