TY - JOUR
T1 - Sparsifying the least-squares approach to PCA
T2 - Comparison of lasso and cardinality constraint
AU - Guerra-Urzola, Rosember
AU - Schipper, Niek C. de
AU - Tonne, Anya
AU - Sijtsma, Klaas
AU - Vera, J. C.
AU - Deun, Katrijn Van
PY - 2023
Y1 - 2023
N2 - Sparse PCA methods are used to overcome the difficulty of interpreting the solution obtained from PCA. However, constraining PCA to obtain sparse solutions is an intractable problem, especially in a high-dimensional setting. Penalized methods are used to obtain sparse solutions due to their computational tractability. Nevertheless, recent developments permit efficiently obtaining good solutions of cardinality-constrained PCA problems allowing comparison between these approaches. Here, we conduct a comparison between a penalized PCA method with its cardinality-constrained counterpart for the least-squares formulation of PCA imposing sparseness on the component weights. We compare the penalized and cardinality-constrained methods through a simulation study that estimates the sparse structure’s recovery, mean absolute bias, mean variance, and mean squared error. Additionally, we use a high-dimensional data set to illustrate the methods in practice. Results suggest that using cardinality-constrained methods leads to better recovery of the sparse structure.
AB - Sparse PCA methods are used to overcome the difficulty of interpreting the solution obtained from PCA. However, constraining PCA to obtain sparse solutions is an intractable problem, especially in a high-dimensional setting. Penalized methods are used to obtain sparse solutions due to their computational tractability. Nevertheless, recent developments permit efficiently obtaining good solutions of cardinality-constrained PCA problems allowing comparison between these approaches. Here, we conduct a comparison between a penalized PCA method with its cardinality-constrained counterpart for the least-squares formulation of PCA imposing sparseness on the component weights. We compare the penalized and cardinality-constrained methods through a simulation study that estimates the sparse structure’s recovery, mean absolute bias, mean variance, and mean squared error. Additionally, we use a high-dimensional data set to illustrate the methods in practice. Results suggest that using cardinality-constrained methods leads to better recovery of the sparse structure.
KW - ALGORITHMS
KW - Cardinality constraint
KW - PRINCIPAL COMPONENT ANALYSIS
KW - Penalized linear regression
KW - REGRESSION SHRINKAGE
KW - SPARSE
KW - Sparse PCA
KW - VARIABLE SELECTION
UR - http://www.scopus.com/inward/record.url?scp=85128805627&partnerID=8YFLogxK
U2 - 10.1007/s11634-022-00499-2
DO - 10.1007/s11634-022-00499-2
M3 - Article
SN - 1862-5347
VL - 17
SP - 269
EP - 286
JO - Advances in Data Analysis and Classification
JF - Advances in Data Analysis and Classification
IS - 1
ER -