JustinLin610
update
10b0761
|
raw
history blame
4.15 kB

WMT 20

This page provides pointers to the models of Facebook-FAIR's WMT'20 news translation task submission (Chen et al., 2020).

Single best MT models (after finetuning on part of WMT20 news dev set)

Model Description Download
transformer.wmt20.ta-en Ta->En download (.tar.gz)
transformer.wmt20.en-ta En->Ta download (.tar.gz)
transformer.wmt20.iu-en.news Iu->En (News domain) download (.tar.gz)
transformer.wmt20.en-iu.news En->Iu (News domain) download (.tar.gz)
transformer.wmt20.iu-en.nh Iu->En (Nunavut Hansard domain) download (.tar.gz)
transformer.wmt20.en-iu.nh En->Iu (Nunavut Hansard domain) download (.tar.gz)

Language models

Model Description Download
transformer_lm.wmt20.en En Language Model download (.tar.gz)
transformer_lm.wmt20.ta Ta Language Model download (.tar.gz)
transformer_lm.wmt20.iu.news Iu Language Model (News domain) download (.tar.gz)
transformer_lm.wmt20.iu.nh Iu Language Model (Nunavut Hansard domain) download (.tar.gz)

Example usage (torch.hub)

Translation

import torch

# English to Tamil translation
en2ta = torch.hub.load('pytorch/fairseq', 'transformer.wmt20.en-ta')
en2ta.translate("Machine learning is great!")  # 'இயந்திரக் கற்றல் அருமை!'

# Tamil to English translation
ta2en = torch.hub.load('pytorch/fairseq', 'transformer.wmt20.ta-en')
ta2en.translate("இயந்திரக் கற்றல் அருமை!")  # 'Machine learning is great!'

# English to Inuktitut translation
en2iu = torch.hub.load('pytorch/fairseq', 'transformer.wmt20.en-iu.news')
en2iu.translate("machine learning is great!")  # 'ᖃᒧᑕᐅᔭᓄᑦ ᐃᓕᓐᓂᐊᕐᓂᖅ ᐱᐅᔪᒻᒪᕆᒃ!'

# Inuktitut to English translation
iu2en = torch.hub.load('pytorch/fairseq', 'transformer.wmt20.iu-en.news')
iu2en.translate("ᖃᒧᑕᐅᔭᓄᑦ ᐃᓕᓐᓂᐊᕐᓂᖅ ᐱᐅᔪᒻᒪᕆᒃ!")  # 'Machine learning excellence!'

Language Modeling

# Sample from the English LM
en_lm = torch.hub.load('pytorch/fairseq', 'transformer_lm.wmt20.en')
en_lm.sample("Machine learning is")  # 'Machine learning is a type of artificial intelligence that uses machine learning to learn from data and make predictions.'

# Sample from the Tamil LM
ta_lm = torch.hub.load('pytorch/fairseq', 'transformer_lm.wmt20.ta')
ta_lm.sample("இயந்திரக் கற்றல் என்பது செயற்கை நுண்ணறிவின்")  # 'இயந்திரக் கற்றல் என்பது செயற்கை நுண்ணறிவின் ஒரு பகுதியாகும்.'

# Sample from the Inuktitut LM
iu_lm = torch.hub.load('pytorch/fairseq', 'transformer_lm.wmt20.iu.news')
iu_lm.sample("ᖃᒧᑕᐅᔭᓄᑦ ᐃᓕᓐᓂᐊᕐᓂᖅ")  # 'ᖃᒧᑕᐅᔭᓄᑦ ᐃᓕᓐᓂᐊᕐᓂᖅ, ᐊᒻᒪᓗ ᓯᓚᐅᑉ ᐊᓯᙳᖅᐸᓪᓕᐊᓂᖓᓄᑦ ᖃᓄᐃᓕᐅᕈᑎᒃᓴᑦ, ᐃᓚᖃᖅᖢᑎᒃ ᐅᑯᓂᖓ:'

Citation

@inproceedings{chen2020facebook
  title={Facebook AI's WMT20 News Translation Task Submission},
  author={Peng-Jen Chen and Ann Lee and Changhan Wang and Naman Goyal and Angela Fan and Mary Williamson and Jiatao Gu},
  booktitle={Proc. of WMT},
  year={2020},
}