Extending principal covariates regression for high-dimensional multi-block data

S. Park*

*Corresponding author for this work

Research output: ThesisDoctoral Thesis

102 Downloads (Pure)

Abstract

This dissertation addresses the challenge of deciphering extensive datasets collected from multiple sources, such as health habits and genetic information, in the context of studying complex issues like depression. A data analysis method known as Principal Covariate Regression (PCovR) provides a strong basis in this challenge.

Yet, analyzing these intricate datasets is far from straightforward. The data often contain redundant and irrelevant variables, making it difficult to extract meaningful insights. Furthermore, these data may involve different types of outcome variables (for instance, the variable pertaining to depression could manifest as a score from a depression scale or a binary diagnosis (yes/no) from a medical professional), adding another layer of complexity.

To overcome these obstacles, novel adaptations of PCovR are proposed in this dissertation. The methods automatically select important variables, categorize insights into those originating from a single source or multiple sources, and accommodate various outcome variable types. The effectiveness of these methods is demonstrated in predicting outcomes and revealing the subtle relationships within data from multiple sources.

Moreover, the dissertation offers a glimpse of future directions in enhancing PCovR. Implications of extending the method such that it selects important variables are critically examined. Also, an algorithm that has the potential to yield optimal results is suggested.

In conclusion, this dissertation proposes methods to tackle the complexity of large data from multiple sources, and points towards where opportunities may lie in the next line of research.
Original languageEnglish
QualificationDoctor of Philosophy
Supervisors/Advisors
  • Vermunt, Jeroen, Promotor
  • Ceulemans, E., Promotor, External person
  • Van Deun, Katrijn, Co-promotor
  • Strobl, Carolin, Member PhD commission, External person
  • Kiers, Henk, Member PhD commission, External person
  • Tenenhaus, A.T., Member PhD commission, External person
  • Kaptein, Maurits, Member PhD commission
Award date17 Nov 2023
Place of Publications.l.
Publisher
Publication statusPublished - 17 Nov 2023

Fingerprint

Dive into the research topics of 'Extending principal covariates regression for high-dimensional multi-block data'. Together they form a unique fingerprint.

Cite this