TY - GEN
T1 - The University of Edinburgh's English-Tamil and English-Inuktitut Submissions to the WMT20 News Translation Task
AU - Bawden, Rachel
AU - Birch, Alexandra
AU - Dobreva, Radina
AU - Oncevay, Arturo
AU - Barone, Antonio Valerio Miceli
AU - Williams, Philip
N1 - Publisher Copyright:
© 2020 Association for Computational Linguistics
PY - 2020
Y1 - 2020
N2 - We describe the University of Edinburgh's submissions to the WMT20 news translation shared task for the low resource language pair English-Tamil and the mid-resource language pair English-Inuktitut. We use the neural machine translation transformer architecture for all submissions and explore a variety of techniques to improve translation quality to compensate for the lack of parallel training data. For the very low-resource English-Tamil, this involves exploring pretraining, using both language model objectives and translation using an unrelated high-resource language pair (German-English), and iterative backtranslation. For English-Inuktitut, we explore the use of multilingual systems, which, despite not being part of the primary submission, would have achieved the best results on the test set.
AB - We describe the University of Edinburgh's submissions to the WMT20 news translation shared task for the low resource language pair English-Tamil and the mid-resource language pair English-Inuktitut. We use the neural machine translation transformer architecture for all submissions and explore a variety of techniques to improve translation quality to compensate for the lack of parallel training data. For the very low-resource English-Tamil, this involves exploring pretraining, using both language model objectives and translation using an unrelated high-resource language pair (German-English), and iterative backtranslation. For English-Inuktitut, we explore the use of multilingual systems, which, despite not being part of the primary submission, would have achieved the best results on the test set.
UR - http://www.scopus.com/inward/record.url?scp=85115690466&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:85115690466
T3 - 5th Conference on Machine Translation, WMT 2020 - Proceedings
SP - 92
EP - 99
BT - 5th Conference on Machine Translation, WMT 2020 - Proceedings
A2 - Barrault, Loic
A2 - Bojar, Ondrej
A2 - Bougares, Fethi
A2 - Chatterjee, Rajen
A2 - Costa-Jussa, Marta R.
A2 - Federmann, Christian
A2 - Fishel, Mark
A2 - Fraser, Alexander
A2 - Graham, Yvette
A2 - Guzman, Paco
A2 - Haddow, Barry
A2 - Huck, Matthias
A2 - Yepes, Antonio Jimeno
A2 - Koehn, Philipp
A2 - Martins, Andre
A2 - Morishita, Makoto
A2 - Monz, Christof
A2 - Nagata, Masaaki
A2 - Nakazawa, Toshiaki
A2 - Negri, Matteo
PB - Association for Computational Linguistics (ACL)
T2 - 5th Conference on Machine Translation, WMT 2020
Y2 - 19 November 2020 through 20 November 2020
ER -