cointegrated
commited on
Commit
•
a20d407
1
Parent(s):
d663b7a
Update README.md
Browse files
README.md
CHANGED
@@ -24,7 +24,7 @@ datasets:
|
|
24 |
|
25 |
This a model to translate texts to the Erzya language (`myv`, cyrillic script) from 11 other languages: `ru,fi,de,es,en,hi,zh,tr,uk,fr,ar`.
|
26 |
|
27 |
-
It is described in the paper
|
28 |
|
29 |
This model is based on [facebook/mbart-large-50](https://huggingface.co/facebook/mbart-large-50), but with updated vocabulary and checkpoint:
|
30 |
- Added an extra language token `myv_XX` and 19K new BPE tokens for the Erzya language;
|
|
|
24 |
|
25 |
This a model to translate texts to the Erzya language (`myv`, cyrillic script) from 11 other languages: `ru,fi,de,es,en,hi,zh,tr,uk,fr,ar`.
|
26 |
|
27 |
+
It is described in the paper [The first neural machine translation system for the Erzya language](https://arxiv.org/abs/2209.09368).
|
28 |
|
29 |
This model is based on [facebook/mbart-large-50](https://huggingface.co/facebook/mbart-large-50), but with updated vocabulary and checkpoint:
|
30 |
- Added an extra language token `myv_XX` and 19K new BPE tokens for the Erzya language;
|