Abstract
The aim of this paper is to establish several theoretical properties of principal component analysis for multiple-component spike covariance models. Our results reveal an asymptotic conical structure in critical sample eigendirections under the spike models with distinguishable (or indistinguishable) eigenvalues, when the sample size and/or the number of variables (or dimension) tend to infinity. The consistency of the sample eigenvectors relative to their population counterparts is determined by the ratio between the dimension and the product of the sample size with the spike size. When this ratio converges to a nonzero constant, the sample eigenvector converges to a cone, with a certain angle to its corresponding population eigenvector. In the High Dimension, Low Sample Size case, the angle between the sample eigenvector and its population counterpart converges to a limiting distribution. Several generalizations of the multi-spike covariance models are explored, and additional theoretical results are presented.
Original language | English (US) |
---|---|
Pages (from-to) | 1747-1770 |
Number of pages | 24 |
Journal | Statistica Sinica |
Volume | 26 |
Issue number | 4 |
DOIs | |
State | Published - Oct 2016 |
Keywords
- Big data
- Conical behavior
- High dimension low sample size
- PCA
ASJC Scopus subject areas
- Statistics and Probability
- Statistics, Probability and Uncertainty