Update README.md
Browse files
README.md
CHANGED
@@ -12,11 +12,11 @@ datasets:
|
|
12 |
|
13 |
# Infos
|
14 |
|
15 |
-
Pythia-
|
16 |
|
17 |
[wandb log](https://wandb.ai/pythia_dpo/Pythia_DPO_new/runs/xm0pxfej)
|
18 |
|
19 |
-
See [Pythia-
|
20 |
|
21 |
|
22 |
# Benchmark results:
|
|
|
12 |
|
13 |
# Infos
|
14 |
|
15 |
+
Pythia-1.4b supervised finetuned with Anthropic-hh-rlhf dataset for 1 epoch.
|
16 |
|
17 |
[wandb log](https://wandb.ai/pythia_dpo/Pythia_DPO_new/runs/xm0pxfej)
|
18 |
|
19 |
+
See [Pythia-1.4b](https://huggingface.co/EleutherAI/pythia-1.4b) for model details [(paper)](https://arxiv.org/abs/2101.00027).
|
20 |
|
21 |
|
22 |
# Benchmark results:
|