Title
The University of Edinburgh's English-Tamil and English-Inuktitut Submissions to the WMT20 News Translation Task
Date Issued
01 January 2020
Access level
metadata only access
Resource Type
conference paper
Author(s)
University of Edinburgh
Publisher(s)
Association for Computational Linguistics (ACL)
Abstract
We describe the University of Edinburgh's submissions to the WMT20 news translation shared task for the low resource language pair English-Tamil and the mid-resource language pair English-Inuktitut. We use the neural machine translation transformer architecture for all submissions and explore a variety of techniques to improve translation quality to compensate for the lack of parallel training data. For the very low-resource English-Tamil, this involves exploring pretraining, using both language model objectives and translation using an unrelated high-resource language pair (German-English), and iterative backtranslation. For English-Inuktitut, we explore the use of multilingual systems, which, despite not being part of the primary submission, would have achieved the best results on the test set.
Start page
92
End page
99
Language
English
OCDE Knowledge area
Ciencias de la computación
Lingüística
Scopus EID
2-s2.0-85115690466
ISBN
9781948087810
Resource of which it is part
5th Conference on Machine Translation, WMT 2020 - Proceedings
ISBN of the container
978-194808781-0
Conference
5th Conference on Machine Translation, WMT 2020
Sponsor(s)
This work was supported by funding from the European Union’s Horizon 2020 research and innovation programme under grant agreements No 825299 (GoURMET), 825303 and the UK Engineering and Physical Sciences Research Council (EPSRC) fellowship grant EP/S001271/1 (MT-Stretch). The research presented in this publication was conducted in cooperation with Samsung Electronics Polska sp. z o.o. - Samsung RD Institute Poland.
This work was supported by funding from the European Union's Horizon 2020 research and innovation programme under grant agreements No 825299 (GoURMET), 825303 and the UK Engineering and Physical Sciences Research Council (EPSRC) fellowship grant EP/S001271/1 (MT-Stretch). The research presented in this publication was conducted in cooperation with Samsung Electronics Polska sp. z o.o. - Samsung RD Institute Poland.
Sources of information:
Directorio de Producción Científica
Scopus