Title
Findings of the AmericasNLP 2021 Shared Task on Open Machine Translation for Indigenous Languages of the Americas
Date Issued
01 January 2021
Access level
metadata only access
Resource Type
conference paper
Author(s)
Mager M.
Ebrahimi A.
Ortega J.
Rios A.
Fan A.
Gutierrez-Vasques X.
Chiruzzo L.
Giménez-Lugo G.A.
Ramos R.
Ruiz I.V.M.
Coto-Solano R.
Palmer A.
Mager E.
Chaudhary V.
Neubig G.
Vu N.T.
Kann K.
Universidad de Edimburgo
Publisher(s)
Association for Computational Linguistics (ACL)
Abstract
This paper presents the results of the 2021 Shared Task on Open Machine Translation for Indigenous Languages of the Americas. The shared task featured two independent tracks, and participants submitted machine translation systems for up to 10 indigenous languages. Overall, 8 teams participated with a total of 214 submissions. We provided training sets consisting of data collected from various sources, as well as manually translated sentences for the development and test sets. An official baseline trained on this data was also provided. Team submissions featured a variety of architectures, including both statistical and neural models, and for the majority of languages, many teams were able to considerably improve over the baseline. The best performing systems achieved 12.97 ChrF higher than baseline, when averaged across languages.
Start page
202
End page
217
Language
English
OCDE Knowledge area
Idiomas específicos
Lingüística
Scopus EID
2-s2.0-85115703569
ISBN of the container
9781954085442
Conference
Proceedings of the 1st Workshop on Natural Language Processing for Indigenous Languages of the Americas, AmericasNLP 2021
Source funding
Microsoft Research
Sponsor(s)
We would like to thank translators of the test and development set, that made this shared task possible: Francisco Morales (Bribri), Feliciano Torres Ríos and Esau Zumaeta Rojas (Asháninka), Perla Alvarez Britez (Guarani), Silvino González de la Crúz (Wixarika), Giovany Martínez Sebastián, Pedro Kapoltitan, and José Antonio (Nahuatl), José Mateo Lino Cajero Velázquez (Otomí), Liz Chávez (Shipibo-Konibo), and María del Cármen Sotelo Holguín (Rarámuri). We also thank our sponsors for their financial support: Facebook AI Research, Microsoft Research, Google Research, the Institute of Computational Linguistics at the University of Zurich, the NAACL Emerging Regions Funding, Comunidad Elotl, and Snorkel AI. Additionally we want to thank all participants for their submissions and effort to advance NLP research for the indigenous languages of the Americas. Manuel Mager received financial support by DAAD Doctoral Research Grant for this work.
Sources of information:
Directorio de Producción Científica
Scopus