Title
Assessment of supervised classifiers for the task of detecting messages with suicidal ideation
Date Issued
01 August 2020
Access level
open access
Resource Type
journal article
Author(s)
Publisher(s)
Elsevier Ltd
Abstract
According to the World Health Organization (WHO) close to 800,000 people worldwide die by suicide each year, and many more attempts to do it. In consequence, the WHO recognizes suicide as a global public health priority, which affects not only rich countries but poor and middle-income countries as well. This study makes a systematic analysis of 28 supervised classifiers using different features of the corpus Life to detect messages with suicidal ideation and depression to know if these can be used in an automatic prevention online system. The Life Corpus, used in this research, is a bilingual text corpus (English and Spanish) oriented to the detection of suicide ideation. This corpus was constructed retrieving texts from several social networks and its quality was measured using mutual annotation agreement. The different experiments determined that the classifier with the best performance was KStar, with the corpus features POS-SYNSETS-NUM, achieving the best results with the ROC Area metrics of 0,81036 and F-measure of 0,7148. The present research fulfilled the objective of discovering which supervised classifiers and which features are the most suitable for the automatic classification of messages with suicidal ideation using the Life Corpus. Also, given the imbalance of the results, a new precision measure was developed called the Two-dimensional Accuracy and Recovery Index (GDP), which can provide better results, in unbalanced systems, than the usual measures to assess the quality of the results (measure F, Area ROC), and thus increase the number of messages at risk of suicidal ideation, detected at the cost of receiving more messages that are not related to suicide or vice versa.
Volume
6
Issue
8
Language
English
OCDE Knowledge area
Salud pública, Salud ambiental
Psiquiatría
Subjects
Scopus EID
2-s2.0-85088837682
Source
Heliyon
ISSN of the container
24058440
Sponsor(s)
This research work has been partially funded by the University of Alicante (Spain) , Generalitat Valenciana and the Spanish Government through the projects “Tecnologías del Lenguaje Humano para una Sociedad Inclusiva, Igualitaria y Accesible” (PROMETEU/2018/089), “Modelado del Comportamiento de Entidades Digitales Mediante Tecnologías del Lenguaje Humano” (RTI2018-094653-B-C22) and “INTEGER: Intelligent Text Generation, Generación Inteligente de Textos” (RTI2018-094649-B-I00).
This research work has been partially funded by the University of Alicante (Spain), Generalitat Valenciana and the Spanish Government through the projects ?Tecnolog?as del Lenguaje Humano para una Sociedad Inclusiva, Igualitaria y Accesible? (PROMETEU/2018/089), ?Modelado del Comportamiento de Entidades Digitales Mediante Tecnolog?as del Lenguaje Humano? (RTI2018-094653-B-C22) and ?INTEGER: Intelligent Text Generation, Generaci?n Inteligente de Textos? (RTI2018-094649-B-I00).Authors would like to thank the University of Alicante (Spain), Departamento de Ingenier?a, Secci?n de Ingenier?a Inform?tica de la Pontificia Universidad Cat?lica del Per?, a la Universidad Estatal del Sur de Manab?, y al Senescyt ?Programa de Becas para Doctorado (PhD) para Docentes de Universidades y Escuelas Polit?cnicas? del Ecuador.
Sources of information:
Directorio de Producción Científica
Scopus