Title
Model Comparison for the Classification of Comments Containing Suicidal Traits from Reddit via NLP and Supervised Learning
Date Issued
01 January 2022
Access level
metadata only access
Resource Type
conference paper
Author(s)
Mantilla-Saavedra C.
Publisher(s)
Springer Science and Business Media Deutschland GmbH
Abstract
In recent years, suicide has become one of the most critical issues regarding public health between teenagers and adults. On the other hand, the growth and wide-spread of social networks and mobile devices have allowed us to compile relevant information that helps us understand the thoughts, feelings, and emotions extracted from these platforms. The detection of suicidal traits on social media has be-come one relevant research topic. It has permitted the identification of probable suicide traits among media users by examining their posts on known social net-works such as Reddit. For that reason, the purpose of the present research is to compare different supervised classification models such as Logistic Regression, Support Vector Machines, Random Forest, AdaBoost, Gradient Boosting, and XGBoost; together with feature extraction techniques such as TF-IDF and Glove. The results from our experiments show that the best model is SVM with TF-IDF obtaining metrics of 91.50% in Accuracy, 92.40% in Precision, 90.30% in Re-call, and 91.50% regarding the F1-score. This study also shows that TF-IDF for feature extraction outperforms Glove when applied to the different models tested.
Start page
253
End page
263
Volume
1577 CCIS
Language
English
OCDE Knowledge area
Psicología (incluye terapias de aprendizaje, habla, visual y otras discapacidades físicas y mentales)
Otras ingenierías y tecnologías
Psiquiatría
Scopus EID
2-s2.0-85128982461
ISBN
9783031044465
Source
Communications in Computer and Information Science
Resource of which it is part
Communications in Computer and Information Science
ISSN of the container
18650929
ISBN of the container
978-303104446-5
Sources of information:
Directorio de Producción Científica
Scopus