allenai
/

llama-3-tulu-2-70b-uf-mean-rm

Text Classification

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

hamishivi commited on Jun 20, 2024

Commit

d53ea25

·

verified ·

1 Parent(s): 1c48be3

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -28,7 +28,7 @@ For more details, read the paper:
 - **Model type:** A reward model trained on UltraFeedback, designed to be used in RLHF training.
 - **Language(s) (NLP):** English
 - **License:** Apache 2.0.
-- **Finetuned from model:** [meta-llama/Meta-Llama-3-70B](https://huggingface.co/meta-llama/Meta-Llama-3-70B)
 ### Model Sources

 - **Model type:** A reward model trained on UltraFeedback, designed to be used in RLHF training.
 - **Language(s) (NLP):** English
 - **License:** Apache 2.0.
+- **Finetuned from model:** [allenai/llama-3-tulu-2-70b](https://huggingface.co/allenai/llama-3-tulu-2-70b)
 ### Model Sources