Title
Model-based clustering of censored data via mixtures of factor analyzers
Date Issued
01 December 2019
Access level
metadata only access
Resource Type
journal article
Author(s)
Pontificia Universidad Católica de Chile
Publisher(s)
Elsevier B.V.
Abstract
Mixtures of factor analyzers (MFA) provide a promising tool for modeling and clustering high-dimensional data that contain an overwhelmingly large number of attributes measured on individuals arisen from a heterogeneous population. Due to the restriction of experimental apparatus, measurements can be limited to some lower and/or upper detection bounds and thus the data are possibly censored. In this paper, we extend the MFA to accommodate censored data, and the new model is called the MFA with censoring (MFAC). A computationally feasible alternating expectation conditional maximization (AECM) algorithm is developed to carry out maximum likelihood estimation of the MFAC model. Practical issues related to model-based clustering and recovery of censored data are also discussed. Simulation studies are conducted to examine the effect of censoring in classification, estimation and cluster validation. We also present an application of the proposed approach to two real data examples in which a certain number of left-censored observations are present.
Start page
104
End page
121
Volume
140
Language
English
OCDE Knowledge area
Ciencias de la computación
Estadísticas, Probabilidad
Subjects
Scopus EID
2-s2.0-85068209782
Source
Computational Statistics and Data Analysis
ISSN of the container
01679473
Sponsor(s)
We thank the Editor, Associate Editor, and two referees for their valuable comments and suggestions on the earlier version of this paper. W.L. Wang and T.I. Lin would like to acknowledge the support of the Ministry of Science and Technology of Taiwan under Grant Nos. MOST 107-2628-M-035-001-MY3 and MOST 107-2118-M-005-002-MY2, respectively. L.M. Castro acknowledges support from Grant FONDECYT1170258 and Millennium Science Initiative of the Ministry of Economy, Development and Tourism, Grant “Millennium Nucleus Center for the Discovery of Structures in Complex Data” from the Chilean government.
Sources of information:
Directorio de Producción Científica
Scopus