Title
Natural language inference for portuguese using BERT and multilingual information
Date Issued
01 January 2020
Access level
metadata only access
Resource Type
conference paper
Author(s)
Inácio M.
Rodrigues A.C.
Casanova E.
de Sousa R.F.
Interinstitutional Center for Computational Linguistics Instituto de Ciências Matemáticas e de Computação
Publisher(s)
Springer
Abstract
Recognizing Textual Entailment, also known as inference recognition, aims to identify when the meaning of a piece of text contains the meaning of another fragment of text. In this work, we investigate multiples approaches for recognizing inference in the ASSIN dataset, an entailment recognition corpus for Portuguese. We also investigate the consequences of adding external data to improve training in two different forms: multilingual data and automatically translated corpus. Our results outperform, using the multilingual pre-trained BERT model, the current state-of-the-art for the ASSIN corpus. Finally, we show that using external data did not improve the performance of the model or the improvements are not significant.
Start page
346
End page
356
Volume
12037 LNAI
Language
English
OCDE Knowledge area
Lingüística
Scopus EID
2-s2.0-85081579496
Source
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Resource of which it is part
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
ISSN of the container
03029743
ISBN of the container
978-303041504-4
Conference
14th International Conference on Computational Processing of the Portuguese Language, PROPOR 2020
Sponsor(s)
The authors are grateful to CAPES for supporting this work, and would like to thank NVIDIA for donating the GPU.
Sources of information: Directorio de Producción Científica Scopus