Upload README.md with huggingface_hub
Browse files
README.md
CHANGED
@@ -1,8 +1,8 @@
|
|
1 |
-
![halos](assets/
|
2 |
|
3 |
This repo contains the model checkpoints for:
|
4 |
-
- model family pythia12-0b
|
5 |
-
- optimized with the loss
|
6 |
- aligned using the SHP, Anthropic HH and Open Assistant datasets.
|
7 |
|
8 |
Please refer to our code repository which contains intructions for training your own HALOs and links to our model cards.
|
|
|
1 |
+
![halos](https://gist.github.com/assets/29318529/fe2d8391-dbd1-4b7e-9dc4-7cb97e55bc06)
|
2 |
|
3 |
This repo contains the model checkpoints for:
|
4 |
+
- model family <b>pythia12-0b</b>
|
5 |
+
- optimized with the loss <b>DPO</b>
|
6 |
- aligned using the SHP, Anthropic HH and Open Assistant datasets.
|
7 |
|
8 |
Please refer to our code repository which contains intructions for training your own HALOs and links to our model cards.
|