Sparsifying the least-squares approach to PCA: Comparison of lasso and cardinality constraint

Rosember Guerra-Urzola*, Niek C. de Schipper, Anya Tonne, Klaas Sijtsma, J. C. Vera, Katrijn Van Deun

*Corresponding author for this work

Research output: Contribution to journalArticleScientificpeer-review

1 Citation (Scopus)
107 Downloads (Pure)

Abstract

Sparse PCA methods are used to overcome the difficulty of interpreting the solution obtained from PCA. However, constraining PCA to obtain sparse solutions is an intractable problem, especially in a high-dimensional setting. Penalized methods are used to obtain sparse solutions due to their computational tractability. Nevertheless, recent developments permit efficiently obtaining good solutions of cardinality-constrained PCA problems allowing comparison between these approaches. Here, we conduct a comparison between a penalized PCA method with its cardinality-constrained counterpart for the least-squares formulation of PCA imposing sparseness on the component weights. We compare the penalized and cardinality-constrained methods through a simulation study that estimates the sparse structure’s recovery, mean absolute bias, mean variance, and mean squared error. Additionally, we use a high-dimensional data set to illustrate the methods in practice. Results suggest that using cardinality-constrained methods leads to better recovery of the sparse structure.
Original languageEnglish
Pages (from-to)269-286
JournalAdvances in Data Analysis and Classification
Volume17
Issue number1
DOIs
Publication statusPublished - 2023

Keywords

  • ALGORITHMS
  • Cardinality constraint
  • PRINCIPAL COMPONENT ANALYSIS
  • Penalized linear regression
  • REGRESSION SHRINKAGE
  • SPARSE
  • Sparse PCA
  • VARIABLE SELECTION

Fingerprint

Dive into the research topics of 'Sparsifying the least-squares approach to PCA: Comparison of lasso and cardinality constraint'. Together they form a unique fingerprint.

Cite this