natolambert
commited on
Commit
•
ed16467
1
Parent(s):
de16483
Update README.md
Browse files
README.md
CHANGED
@@ -10,7 +10,7 @@ tags:
|
|
10 |
|
11 |
# Llama-se-rl-peft
|
12 |
Adapter weights of an RL fine-tuned model based on LLaMa. Authored by Edward Beeching, Younes Belkada, Kashif Rasul, Lewis Tunstall and Leandro von Werra.
|
13 |
-
For more info check out the [blog post]() and [github example]().
|
14 |
|
15 |
|
16 |
## Model Description
|
|
|
10 |
|
11 |
# Llama-se-rl-peft
|
12 |
Adapter weights of an RL fine-tuned model based on LLaMa. Authored by Edward Beeching, Younes Belkada, Kashif Rasul, Lewis Tunstall and Leandro von Werra.
|
13 |
+
For more info check out the [blog post](https://huggingface.co/blog/stackllama) and [github example](https://github.com/lvwerra/trl/tree/main/examples/stack_llama/scripts).
|
14 |
|
15 |
|
16 |
## Model Description
|