Autoupdate README.md
Browse files
README.md
CHANGED
@@ -121,14 +121,11 @@ Therefore, the model can have biased predictions. This bias will also affect all
|
|
121 |
The `ul2-large-en-nl` T5 model was pre-trained simultaneously on a combination of several datasets,
|
122 |
including the `full` config of the "mc4_nl_cleaned" dataset, which is a cleaned version of Common Crawl's web
|
123 |
crawl corpus, Dutch books, the Dutch subset of Wikipedia (2022-03-20), and a subset of "mc4_nl_cleaned"
|
124 |
-
containing only texts from Dutch
|
125 |
-
towards descriptions of events in the Netherlands and Belgium.
|
126 |
|
127 |
After pre-training, the model was
|
128 |
fine-tuned on a translation dataset containing 13 million sentence and paragraph pairs
|
129 |
sampled from books.
|
130 |
-
|
131 |
-
|
132 |
|
133 |
## Training procedure
|
134 |
|
|
|
121 |
The `ul2-large-en-nl` T5 model was pre-trained simultaneously on a combination of several datasets,
|
122 |
including the `full` config of the "mc4_nl_cleaned" dataset, which is a cleaned version of Common Crawl's web
|
123 |
crawl corpus, Dutch books, the Dutch subset of Wikipedia (2022-03-20), and a subset of "mc4_nl_cleaned"
|
124 |
+
containing only texts from Dutch newspapers.
|
|
|
125 |
|
126 |
After pre-training, the model was
|
127 |
fine-tuned on a translation dataset containing 13 million sentence and paragraph pairs
|
128 |
sampled from books.
|
|
|
|
|
129 |
|
130 |
## Training procedure
|
131 |
|