Robust Principal Component Analysis of Data with Missing Values
Kärkkäinen, T., & Saarela, M. (2015). Robust Principal Component Analysis of Data with Missing Values. In P. Perner (Ed.), Machine Learning and Data Mining in Pattern Recognition : Proceedings of the 11th International Conference, MLDM 2015, Hamburg, Germany, July 20-21, 2015 (pp. 140-154). Springer International Publishing. Lecture Notes in Computer Science, 9166. https://doi.org/10.1007/978-3-319-21024-7_10
Published in
Lecture Notes in Computer ScienceEditors
Date
2015Copyright
© Springer International Publishing Switzerland 2015. This is a final draft version of an article whose final and definitive form has been published by Springer. Published in this repository with the kind permission of the publisher.
Principal component analysis is one of the most popular machine
learning and data mining techniques. Having its origins in statistics,
principal component analysis is used in numerous applications. However,
there seems to be not much systematic testing and assessment of principal
component analysis for cases with erroneous and incomplete data.
The purpose of this article is to propose multiple robust approaches for
carrying out principal component analysis and, especially, to estimate
the relative importances of the principal components to explain the data
variability. Computational experiments are first focused on carefully designed
simulated tests where the ground truth is known and can be used
to assess the accuracy of the results of the different methods. In addition,
a practical application and evaluation of the methods for an educational
data set is given.
Publisher
Springer International PublishingParent publication ISBN
978-3-319-21024-7Conference
International conference on machine learning and data miningIs part of publication
Machine Learning and Data Mining in Pattern Recognition : Proceedings of the 11th International Conference, MLDM 2015, Hamburg, Germany, July 20-21, 2015Keywords
NB.
Also part of the Lecture Notes in Artificial Intelligence book sub series (LNAI, volume 9166)Publication in research information system
https://converis.jyu.fi/converis/portal/detail/Publication/24832083
Metadata
Show full item recordCollections
Related items
Showing items with similar title or keywords.
-
Robustifying principal component analysis with spatial sign vectors
Taskinen, Sara; Koch, Inge; Oja, Hannu (Elsevier, 2012)In this paper, we apply orthogonally equivariant spatial sign covariance matrices as well as their affine equivariant counterparts in principal component analysis. The influence functions and asymptotic covariance matrices ... -
Single-trial-based temporal principal component analysis on extracting event-related potentials of interest for an individual subject
Zhang, Guanghui; Li, Xueyan; Lu, Yingzhi; Tiihonen, Timo; Chang, Zheng; Cong, Fengyu (Elsevier, 2023)Background: Temporal principal component analysis (tPCA) has been widely used to extract event-related potentials (ERPs) at group level of multiple subjects ERP data and it assumes that the underlying factor loading is ... -
Multivariate Independent Component Analysis Identifies Patients in Newborn Screening Equally to Adjusted Reference Ranges
Kouři,l Štěpán; de Sousa, Julie; Fačevicová Kamila; Gardlo, Alžběta; Muehlmann, Christoph; Nordhausen, Klaus; Friedecký, David; Adam, Tomáš (MDPI, 2023)Newborn screening (NBS) of inborn errors of metabolism (IEMs) is based on the reference ranges established on a healthy newborn population using quantile statistics of molar concentrations of biomarkers and their ratios. ... -
Clustering Incomplete Spectral Data with Robust Methods
Äyrämö, Sami; Pölönen, Ilkka; Eskelinen, Matti (International Society for Photogrammetry and Remote Sensing, 2017)Missing value imputation is a common approach for preprocessing incomplete data sets. In case of data clustering, imputation methods may cause unexpected bias because they may change the underlying structure of the data. ...