Title
Experiments on sentence boundary detection in user-generated web content
Date Issued
01 January 2015
Access level
metadata only access
Resource Type
conference paper
Author(s)
Pardo T.A.S.
Universidad de São Paulo
Publisher(s)
Springer Verlag
Abstract
Sentence Boundary Detection (SBD) is a very important prerequisite for proper sentence analysis in different Natural Language Processing tasks. During the last years, many SBD methods have been used in the transcriptions produced by Automatic Speech Recognition systems and in well-structured texts (e.g. news, scientific texts). However, there are few researches about SBD in informal user-generated content such as web reviews, comments, and posts, which are not necessarily well written and structured. In this paper, we adapt and extend a well-known SBD method to the domain of the opinionated texts in the web. Particularly, we evaluate our proposal in a set of online product reviews and compare it with other traditional SBD methods. The experimental results show that we outperform these other methods.
Start page
227
End page
237
Volume
9041
Language
English
OCDE Knowledge area
Ciencias de la computación
Ciencias de la información
Subjects
Scopus EID
2-s2.0-84942571533
ISSN of the container
03029743
ISBN of the container
9783319181103
Conference
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Sources of information:
Directorio de Producción Científica
Scopus