Title
Comparing Predictive Machine Learning Algorithms in Fit for Work Occupational Health Assessments
Date Issued
01 January 2020
Access level
metadata only access
Resource Type
conference paper
Author(s)
Charapaqui-Miranda S.
Meza-Rodriguez M.
Publisher(s)
Springer
Abstract
Some studies have tried to develop predictors for fitness for work (FFW). This study assessed the question whether factors used in the occupational medical practice could predict an individual fit for work result. We used a Peruvian occupational medical examination dataset of 33347 participants. We obtained a reduced dataset of 2650. It was split into two subsets, a training dataset and a test dataset. Using the training dataset, logistic regression, decision tree, random forest, and support vector machine models were fitted, and important variables of each model were identified. Hyperparameter tuning was an important part in these non-parametric models. Also, the Area Under the Curve (AUC) metric was used for Model Selection with a 5-fold cross validation approach. The results shows the Logistic Regression as the most powerful predictor (AUC = 60.44%, Accuracy = 68.05%). It is important to notice the best variables analysis in fitness to work evaluation by a Random Forest approach. Thus, the best model was logistic regression. This also reveals that the criteria associated with the workplace and occupational clinical criteria have a low level of prediction. Further studies should be done with imbalanced data to process bigger datasets, in consequence to obtain more robust models.
Start page
218
End page
225
Volume
1070 CCIS
Language
English
OCDE Knowledge area
Sistemas de automatización, Sistemas de control
Salud ocupacional
Subjects
Scopus EID
2-s2.0-85084850887
ISBN
9783030461393
ISSN of the container
18650929
ISBN of the container
978-303046139-3
DOI of the container
10.1007/978-3-030-46140-9_21
Conference
Communications in Computer and Information Science
Sources of information:
Directorio de Producción Científica
Scopus