Title
Automated learning of t factor analysis models with complete and incomplete data
Date Issued
01 September 2017
Access level
metadata only access
Resource Type
journal article
Author(s)
Pontificia Universidad Católica de Chile
Publisher(s)
Academic Press Inc.
Abstract
The t factor analysis (tFA) model is a promising tool for robust reduction of high-dimensional data in the presence of heavy-tailed noises. When determining the number of factors of the tFA model, a two-stage procedure is commonly performed in which parameter estimation is carried out for a number of candidate models, and then the best model is chosen according to certain penalized likelihood indices such as the Bayesian information criterion. However, the computational burden of such a procedure could be extremely high to achieve the optimal performance, particularly for extensively large data sets. In this paper, we develop a novel automated learning method in which parameter estimation and model selection are seamlessly integrated into a one-stage algorithm. This new scheme is called the automated tFA (AtFA) algorithm, and it is also workable when values are missing. In addition, we derive the Fisher information matrix to approximate the asymptotic covariance matrix associated with the ML estimators of tFA models. Experiments on real and simulated data sets reveal that the AtFA algorithm not only provides identical fitting results, as compared to traditional two-stage procedures, but also runs much faster, especially when values are missing.
Start page
157
End page
171
Volume
161
Language
English
OCDE Knowledge area
Estadísticas, Probabilidad
Scopus EID
2-s2.0-85028555696
Source
Journal of Multivariate Analysis
ISSN of the container
0047259X
Sponsor(s)
We gratefully acknowledge the Editor-in-Chief, Christian Genest, the Associate Editor and two anonymous referees for their comments and suggestions that greatly improved this paper. We are also grateful to Ms. Yu-Ju Wang for her skillful assistance on preparing the initial manuscript. W.L. Wang and T.I. Lin would like to acknowledge the support of the Ministry of Science and Technology of Taiwan under Grant Nos. MOST 105-2118-M-035-004-MY2 and MOST 105-2118-M-005-003-MY2 , respectively. L.M. Castro acknowledges support from Grant FONDECYT 1170258 from Chilean government.
Sources of information: Directorio de Producción Científica Scopus