Robust Principal Component Analysis of Data with Missing Values

Kärkkäinen, Tommi; Saarela, Mirka

doi:10.1007/978-3-319-21024-7_10

Final Draft

View/Open

325.9 Kb

Downloads:

Show download details Hide download details

Kärkkäinen, T., & Saarela, M. (2015). Robust Principal Component Analysis of Data with Missing Values. In P. Perner (Ed.), Machine Learning and Data Mining in Pattern Recognition : Proceedings of the 11th International Conference, MLDM 2015, Hamburg, Germany, July 20-21, 2015 (pp. 140-154). Springer International Publishing. Lecture Notes in Computer Science, 9166. https://doi.org/10.1007/978-3-319-21024-7_10

Date

2015

Discipline

Tietotekniikka Mathematical Information Technology

Copyright

© Springer International Publishing Switzerland 2015. This is a final draft version of an article whose final and definitive form has been published by Springer. Published in this repository with the kind permission of the publisher.

Principal component analysis is one of the most popular machine learning and data mining techniques. Having its origins in statistics, principal component analysis is used in numerous applications. However, there seems to be not much systematic testing and assessment of principal component analysis for cases with erroneous and incomplete data. The purpose of this article is to propose multiple robust approaches for carrying out principal component analysis and, especially, to estimate the relative importances of the principal components to explain the data variability. Computational experiments are first focused on carefully designed simulated tests where the ground truth is known and can be used to assess the accuracy of the results of the different methods. In addition, a practical application and evaluation of the methods for an educational data set is given.

Publisher

Springer International Publishing

Parent publication ISBN

978-3-319-21024-7

Conference

International conference on machine learning and data mining

Is part of publication

Machine Learning and Data Mining in Pattern Recognition : Proceedings of the 11th International Conference, MLDM 2015, Hamburg, Germany, July 20-21, 2015

Keywords

PCA missing data robust statistics

NB.

Also part of the Lecture Notes in Artificial Intelligence book sub series (LNAI, volume 9166)

DOI

https://doi.org/10.1007/978-3-319-21024-7_10

URI

http://urn.fi/URN:NBN:fi:jyu-201509042808

Publication in research information system

https://converis.jyu.fi/converis/portal/detail/Publication/24832083

Metadata

Show full item record

Collections

Informaatioteknologian tiedekunta [2328]