Update README.md
Browse files
README.md
CHANGED
@@ -28,7 +28,7 @@ For more details, read the paper:
|
|
28 |
- **Model type:** A reward model trained on UltraFeedback, designed to be used in RLHF training.
|
29 |
- **Language(s) (NLP):** English
|
30 |
- **License:** Apache 2.0.
|
31 |
-
- **Finetuned from model:** [
|
32 |
|
33 |
### Model Sources
|
34 |
|
|
|
28 |
- **Model type:** A reward model trained on UltraFeedback, designed to be used in RLHF training.
|
29 |
- **Language(s) (NLP):** English
|
30 |
- **License:** Apache 2.0.
|
31 |
+
- **Finetuned from model:** [allenai/llama-3-tulu-2-70b](https://huggingface.co/allenai/llama-3-tulu-2-70b)
|
32 |
|
33 |
### Model Sources
|
34 |
|