Title
Age Groups Classification in Social Network Using Deep Learning
Date Issued
2017
Access level
open access
Resource Type
journal article
Author(s)
Guimarães R.G.
Rosa R.L.
De Gaetano D.
Bressan G.
Federal University of Lavras
Publisher(s)
Institute of Electrical and Electronics Engineers Inc.
Abstract
Social networks have a large amount of data available, but often, people do not provide some of their personal data, such as age, gender, and other demographics. Although the sentiment analysis uses such data to develop useful applications in people's daily lives, there are still failures in this type of analysis, either by the restricted number of words contained in the word dictionaries or because they do not consider the most diverse parameters that can influence the sentiments in a sentence; thus, more reliable results can be obtained, if the users profile information and their writing characteristics are considered. This research suggests that one of the most relevant parameter contained in the user profile is the age group, showing that there are typical behaviors among users of the same age group, specifically, when these users write about the same topic. A detailed analysis with 7000 sentences was performed to determine which characteristics are relevant, such as, the use of punctuation, number of characters, media sharing, topics, among others; and which ones can be disregarded for the age groups classification. Different learning machine algorithms are tested for the classification of the teenager and adult age group, and the deep convolutional neural network had the best performance, reaching a precision of 0.95 in the validation tests. Furthermore, in order to validate the usefulness of the proposed model for classifying age groups, it is implemented into the enhanced sentiment metric (eSM). In the performance validation, subjective tests are performed and the eSM with the proposed model reached a root mean square error and a Pearson correlation coefficient of 0.25 and 0.94, respectively, outperforming the eSM metric, when the age group information is not available.
Start page
10805
End page
10816
Volume
5
Language
English
OCDE Knowledge area
Medios de comunicación, Comunicación socio-cultural Sociología
Scopus EID
2-s2.0-85028381978
Source
IEEE Access
ISSN of the container
21693536
Sponsor(s)
This work was supported in part by the University of São Paulo, in part by the Federal University of Lavras, and in part by the Minas Gerais State Agency for Research and Development (FAPEMIG).
Sources of information: Directorio de Producción Científica Scopus