plaguss
/

mistal-7b-prm-openrlhf

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

plaguss HF staff commited on Dec 9, 2024

Commit

a449438

·

verified ·

1 Parent(s): 77be215

Update README.md

Files changed (1) hide show

README.md +22 -1

README.md CHANGED Viewed

@@ -35,7 +35,28 @@ This is the model card of a 🤗 transformers model that has been pushed on the
 ## Uses
-<!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
 ### Direct Use

 ## Uses
+Example from: [peiyi9979/math-shepherd-mistral-7b-prm](https://huggingface.co/peiyi9979/math-shepherd-mistral-7b-prm):
+```python
+from transformers import AutoModelForCausalLM, AutoTokenizer
+import torch
+model_name = "plaguss/mistal-7b-prm-openrlhf"
+model = AutoModelForCausalLM.from_pretrained(model_name)
+model = AutoTokenizer.from_pretrained(model_name)
+for output in [output1, output2]:
+    input_for_prm = f"{question} {output}"
+    input_id = torch.tensor([tokenizer.encode(input_for_prm)])
+    with torch.no_grad():
+        logits = model(input_id).logits[:,:,candidate_tokens]
+        scores = logits.softmax(dim=-1)[:,:,0]
+        step_scores = scores[input_id == step_tag_id]
+        print(step_scores)
+# tensor([0.9982, 0.9780, 0.9969, 0.9983])
+# tensor([0.9982, 0.9780, 0.9969, 0.0441])
+```
 ### Direct Use