Update README.md
Browse files
README.md
CHANGED
@@ -20,7 +20,7 @@ pipeline_tag: text-generation
|
|
20 |
|
21 |
# CataLlama-v0.1-Instruct-DPO
|
22 |
|
23 |
-
**
|
24 |
|
25 |
CataLlama was trained on roughly **445 million new tokens** in three separate stages. This is the 3rd and final stage of the training.
|
26 |
|
@@ -52,7 +52,7 @@ See the snippet below for usage with Transformers:
|
|
52 |
import transformers
|
53 |
import torch
|
54 |
|
55 |
-
model_id = "catallama/CataLlama-v0.1-
|
56 |
|
57 |
pipeline = transformers.pipeline(
|
58 |
"text-generation",
|
|
|
20 |
|
21 |
# CataLlama-v0.1-Instruct-DPO
|
22 |
|
23 |
+
**CataLlama-v0.1-Instruct-DPO** is a DPO fine-tune of [catallama/CataLlama-v0.1-Instruct-SFT](https://huggingface.co/catallama/CataLlama-v0.1-Instruct-SFT) on the [catallama/Catalan-DPO](https://huggingface.co/datasets/catallama/Catalan-DPO) dataset.
|
24 |
|
25 |
CataLlama was trained on roughly **445 million new tokens** in three separate stages. This is the 3rd and final stage of the training.
|
26 |
|
|
|
52 |
import transformers
|
53 |
import torch
|
54 |
|
55 |
+
model_id = "catallama/CataLlama-v0.1-Instruct-DPO"
|
56 |
|
57 |
pipeline = transformers.pipeline(
|
58 |
"text-generation",
|