Authors
Julie Josse, François Husson
Publication date
2012/6/1
Journal
Computational Statistics & Data Analysis
Volume
56
Issue
6
Pages
1869-1879
Publisher
North-Holland
Description
Cross-validation is a tried and tested approach to select the number of components in principal component analysis (PCA), however, its main drawback is its computational cost. In a regression (or in a non parametric regression) setting, criteria such as the general cross-validation one (GCV) provide convenient approximations to leave-one-out cross-validation. They are based on the relation between the prediction error and the residual sum of squares weighted by elements of a projection matrix (or a smoothing matrix). Such a relation is then established in PCA using an original presentation of PCA with a unique projection matrix. It enables the definition of two cross-validation approximation criteria: the smoothing approximation of the cross-validation criterion (SACV) and the GCV criterion. The method is assessed with simulations and gives promising results.
Total citations
2011201220132014201520162017201820192020202120222023202413111319253418371829232410