Analysis Of Randomized Robust Pca For High Dimensional Data
Keywords
Big Data; Low Rank Matrix; Matrix Decomposition; Outlier Detection; Randomized Algorithm; Subspace Learning
Abstract
Robust Principal Component Analysis (PCA) (or robust subspace recovery) is a particularly important problem in unsupervised learning pertaining to a broad range of applications. In this paper, we analyze a randomized robust subspace recovery algorithm to show that its complexity is independent of the size of the data matrix. Exploiting the intrinsic low-dimensional geometry of the low rank matrix, the big data matrix is first turned to smaller size compressed data. This is accomplished by selecting a small random subset of the columns of the given data matrix, which is then projected into a random low-dimensional subspace. In the next step, a convex robust PCA algorithm is applied to the compressed data to learn the columns subspace of the low rank matrix. We derive new sufficient conditions, which show that the number of linear observations and the complexity of the randomized algorithm do not depend on the size of the given data.
Publication Date
12-30-2015
Publication Title
2015 IEEE Signal Processing and Signal Processing Education Workshop, SP/SPE 2015
Number of Pages
25-30
Document Type
Article; Proceedings Paper
Personal Identifier
scopus
DOI Link
https://doi.org/10.1109/DSP-SPE.2015.7369522
Copyright Status
Unknown
Socpus ID
84964067983 (Scopus)
Source API URL
https://api.elsevier.com/content/abstract/scopus_id/84964067983
STARS Citation
Rahmani, Mostafa and Atia, George K., "Analysis Of Randomized Robust Pca For High Dimensional Data" (2015). Scopus Export 2015-2019. 1908.
https://stars.library.ucf.edu/scopus2015/1908