longhoang06
commited on
Commit
·
2894f26
1
Parent(s):
d27cf2a
Update README.md
Browse files
README.md
CHANGED
@@ -1,6 +1,6 @@
|
|
1 |
---
|
2 |
library_name: peft
|
3 |
base_model: mistralai/Mistral-7B-v0.1
|
4 |
-
|
5 |
### Model Description
|
6 |
-
|
|
|
1 |
---
|
2 |
library_name: peft
|
3 |
base_model: mistralai/Mistral-7B-v0.1
|
4 |
+
# OVM generator
|
5 |
### Model Description
|
6 |
+
The model is trained based on [Outcome-supervised Verifiers for Planning in Mathematical Reasoning](https://arxiv.org/pdf/2311.09724v1.pdf) paper, trained on [Vi-GSM8K](https://huggingface.co/datasets/longhoang06/Vi-GSM8K) dataset
|