cartesinus
/

iva_mt_wslot-m2m100_418M-en-pl

Text2Text Generation

machine translation

virtual assistants

natural-language-understanding

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

cartesinus commited on Mar 11, 2023

Commit

44106bc

·

1 Parent(s): 30aa00b

Update README.md

Files changed (1) hide show

README.md +17 -3

README.md CHANGED Viewed

@@ -12,14 +12,18 @@ model-index:
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
-# iva_mt_wslot-m2m100_418M-0.1.0
-This model is a fine-tuned version of [facebook/m2m100_418M](https://huggingface.co/facebook/m2m100_418M) on the None dataset.
 It achieves the following results on the evaluation set:
 - Loss: 0.0176
 - Bleu: 61.6249
 - Gen Len: 21.157
 ## Model description
 More information needed
@@ -30,7 +34,17 @@ More information needed
 ## Training and evaluation data
-More information needed
 ## Training procedure

 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
+# iva_mt_wslot-m2m100_418M-0.1.0 en-pl
+This model is a fine-tuned version of [facebook/m2m100_418M](https://huggingface.co/facebook/m2m100_418M) on the [iva_mt_wslot](https://huggingface.co/datasets/cartesinus/iva_mt_wslot) dataset.
 It achieves the following results on the evaluation set:
 - Loss: 0.0176
 - Bleu: 61.6249
 - Gen Len: 21.157
+On training set:
+- translated train witout slots in input: 93.8200 Bleu
+- translated train with slots in input: 70.5597 Bleu
 ## Model description
 More information needed
 ## Training and evaluation data
+## Dataset Composition (en-pl)
+| Corpus                                                               | Train  | Dev   | Test  |
+|----------------------------------------------------------------------|--------|-------|-------|
+| [Massive 1.1](https://huggingface.co/datasets/AmazonScience/massive) | 11514  | 2033  | 2974  |
+| [Leyzer 0.2.0](https://github.com/cartesinus/leyzer/tree/0.2.0)      | 3974   | 701   | 1380  |
+| [OpenSubtitles from OPUS](https://opus.nlpl.eu/OpenSubtitles-v1.php) | 2329   | 411   | 500   |
+| [KDE from OPUS](https://opus.nlpl.eu/KDE4.php)                       | 1154   | 241   | 241   |
+| [CCMatrix from Opus](https://opus.nlpl.eu/CCMatrix.php)              | 1096   | 232   | 237   |
+| [Ubuntu from OPUS](https://opus.nlpl.eu/Ubuntu.php)                  | 281    | 60    | 59    |
+| [Gnome from OPUS](https://opus.nlpl.eu/GNOME.php)                    | 14     | 3     | 3     |
+| *total*                                                              | 20362  | 3681  | 5394  |
 ## Training procedure