Title
Neural Borrowing Detection with Monolingual Lexical Models
Date Issued
01 January 2021
Access level
open access
Resource Type
conference paper
Author(s)
Publisher(s)
Incoma Ltd
Abstract
Identification of lexical borrowings, transfer of words between languages, is an essential practice of historical linguistics and a vital tool in analysis of language contact and cultural events in general. We seek to improve tools for automatic detection of lexical borrowings, focusing here on detecting borrowed words from monolingual wordlists. Starting with a recurrent neural network lexical model and competing entropies approach, we incorporate a more current Transformer based lexical model. From there we experiment with several different models and approaches including a lexical donor model with augmented wordlist. The Transformer model reduces execution time and minimally improves borrowing detection, and the augmented donor model shows some promise. A substantive change in approach or model seems necessary for significant gains in detection of lexical borrowings.
Start page
109
End page
117
Volume
2021-September
Language
English
OCDE Knowledge area
Lingüística
Otras ingenierías y tecnologías
Scopus EID
2-s2.0-85122955538
ISBN
9789544520762
Source
International Conference Recent Advances in Natural Language Processing, RANLP
Resource of which it is part
International Conference Recent Advances in Natural Language Processing, RANLP
ISSN of the container
13138502
ISBN of the container
978-954452073-1
Conference
2021 Student Research Workshop, SRW 2021 associated with the International Conference on Recent Advances in Natural Language Processing, RANLP 2021
Sponsor(s)
We sincerely thank the reviewers for their helpful and detailed comments. The first author has received funding and encouragement from the Graduate School of the Pontificia Universidad Católica del Perú (PUCP) through the Huiracocha-2019 scholarship program.
Sources of information:
Directorio de Producción Científica
Scopus