Update README.md
Browse files
README.md
CHANGED
@@ -97,16 +97,17 @@ Training code was from the Google's Jax/Flax based [t5x framework](https://githu
|
|
97 |
|
98 |
Evaluation was done by fine-tuning the model on a downstream text classification task with two different labeled Finnish datasets: [Yle News](https://github.com/spyysalo/yle-corpus) and [Eduskunta](https://github.com/aajanki/eduskunta-vkk). Classification fine-tuning was done with a sequence length of 128 bytes.
|
99 |
|
100 |
-
When fine-tuned on those datasets, this model (the
|
101 |
|
102 |
| | Model parameters | Yle News accuracy | Eduskunta accuracy |
|
103 |
|-------------------------------------------------------|------------------|---------------------|----------------------|
|
104 |
|Finnish-NLP/t5-tiny-nl6-finnish | 31 million |92.80 |69.07 |
|
105 |
|Finnish-NLP/t5-mini-nl8-finnish | 72 million |93.89 |71.43 |
|
|
|
106 |
|Finnish-NLP/t5-small-nl24-finnish | 260 million |**94.68** |74.90 |
|
107 |
|Finnish-NLP/byt5-base-finnish | 582 million |92.33 |73.13 |
|
108 |
|Finnish-NLP/t5-base-nl36-finnish | 814 million |94.40 |**75.97** |
|
109 |
-
|Finnish-NLP/t5-large-nl36-finnish | 1425 million |
|
110 |
|
111 |
|
112 |
Fine-tuning Google's multilingual mT5 models on the same datasets we can clearly see that our monolingual Finnish T5 models achieve much better results on Finnish text classification:
|
|
|
97 |
|
98 |
Evaluation was done by fine-tuning the model on a downstream text classification task with two different labeled Finnish datasets: [Yle News](https://github.com/spyysalo/yle-corpus) and [Eduskunta](https://github.com/aajanki/eduskunta-vkk). Classification fine-tuning was done with a sequence length of 128 bytes.
|
99 |
|
100 |
+
When fine-tuned on those datasets, this model (the fifth row of the table) achieves the following accuracy results compared to our other T5 models and their parameter counts:
|
101 |
|
102 |
| | Model parameters | Yle News accuracy | Eduskunta accuracy |
|
103 |
|-------------------------------------------------------|------------------|---------------------|----------------------|
|
104 |
|Finnish-NLP/t5-tiny-nl6-finnish | 31 million |92.80 |69.07 |
|
105 |
|Finnish-NLP/t5-mini-nl8-finnish | 72 million |93.89 |71.43 |
|
106 |
+
|Finnish-NLP/t5-small-nl16-finnish | 184 million |94.46 |74.00 |
|
107 |
|Finnish-NLP/t5-small-nl24-finnish | 260 million |**94.68** |74.90 |
|
108 |
|Finnish-NLP/byt5-base-finnish | 582 million |92.33 |73.13 |
|
109 |
|Finnish-NLP/t5-base-nl36-finnish | 814 million |94.40 |**75.97** |
|
110 |
+
|Finnish-NLP/t5-large-nl36-finnish | 1425 million |94.17 |73.50 |
|
111 |
|
112 |
|
113 |
Fine-tuning Google's multilingual mT5 models on the same datasets we can clearly see that our monolingual Finnish T5 models achieve much better results on Finnish text classification:
|