Title
A comparative study of WHO and WHEN prediction approaches for early identification of university students at dropout risk
Date Issued
01 January 2021
Access level
metadata only access
Resource Type
conference paper
Publisher(s)
Institute of Electrical and Electronics Engineers Inc.
Abstract
Reducing the students’ dropout is one of the biggest challenges faced by educational institutions, especially in underdeveloped countries. Identification of the student with the highest risk of dropping out is generally used to apply corrective actions (WHO). Therefore, it is also important to determine WHEN a student will drop out, which is fundamental to planning preventive actions. In this work, we perform a study to quantitatively compare several approaches to address the early identification of dropout students in universities. We categorize our study into three main methods families, i.e., analytical methods, traditional classification methods, and probabilistic methods. The first is exploited at preprocessing step for selecting significant variables into the dropout identification task. The second uses machine learning models to classify students into dropout prone or non-dropout prone classes. The third family uses survival models to determine when the student would desert. To evaluate the predictive capacity of the classification models, the Kappa coefficient was incorporated into the usual machine learning metrics and shows that Kappa is handy for evaluating performance in unbalanced data. Similarly, in the survival models, the concordance index was applied to evaluate the predictive capacity. Our approach was applied over a real data set of Peruvian university graduate students to identify when and who will drop out.
Language
English
OCDE Knowledge area
Educación general (incluye capacitación, pedadogía) Ciencias de la computación
Scopus EID
2-s2.0-85123862737
Resource of which it is part
Proceedings - 2021 47th Latin American Computing Conference, CLEI 2021
ISBN of the container
9781665495035
Conference
Proceedings - 2021 47th Latin American Computing Conference, CLEI 2021
Sponsor(s)
ACKNOWLEDGMENT This research was supported by the National Fund for Scientific and Technological Development and Innovation (Fondecyt-Perú) within the framework of the ”Project of Improvement and Expansion of the Services of the National System of Science, Technology and Technological Innovation” [Grant #028-2019-FONDECYT-BM-INC.INV].
Sources of information: Directorio de Producción Científica Scopus