RegularizedSCA

Regularized simultaneous component analysis of multiblock data in R

Zhengguo Gu*, Katrijn Van Deun

*Corresponding author for this work

Research output: Contribution to journalArticleScientificpeer-review

2 Downloads (Pure)

Abstract

This article introduces a package developed for R (R Core Team, 2017) for performing an integrated analysis of multiple data blocks (i.e., linked data) coming from different sources. The methods in this package combine simultaneous component analysis (SCA) with structured selection of variables. The key feature of this package is that it allows to (1) identify joint variation that is shared across all the data sources and specific variation that is associated with one or a few of the data sources and (2) flexibly estimate component matrices with predefined structures. Linked data occur in many disciplines (e.g., biomedical research, bioinformatics, chemometrics, finance, genomics, psychology, and sociology) and especially in multidisciplinary research. Hence, we expect our package to be useful in various fields.
Original languageEnglish
Pages (from-to)2268-2289
JournalBehavior Research Methods
Volume51
Issue number5
DOIs
Publication statusPublished - 2019

Fingerprint

Information Storage and Retrieval
Computational Biology

Keywords

  • CHILDREN
  • COMMON
  • Common
  • Group Lasso
  • INFORMATION
  • Lasso
  • Linked data analysis
  • MODEL SELECTION
  • Multiblock analysis
  • PARENTS
  • SCA
  • Simultaneous component analysis
  • distinctive components

Cite this

@article{a326d0ed5b1b473cbffd331cfee42837,
title = "RegularizedSCA: Regularized simultaneous component analysis of multiblock data in R",
abstract = "This article introduces a package developed for R (R Core Team, 2017) for performing an integrated analysis of multiple data blocks (i.e., linked data) coming from different sources. The methods in this package combine simultaneous component analysis (SCA) with structured selection of variables. The key feature of this package is that it allows to (1) identify joint variation that is shared across all the data sources and specific variation that is associated with one or a few of the data sources and (2) flexibly estimate component matrices with predefined structures. Linked data occur in many disciplines (e.g., biomedical research, bioinformatics, chemometrics, finance, genomics, psychology, and sociology) and especially in multidisciplinary research. Hence, we expect our package to be useful in various fields.",
keywords = "CHILDREN, COMMON, Common, Group Lasso, INFORMATION, Lasso, Linked data analysis, MODEL SELECTION, Multiblock analysis, PARENTS, SCA, Simultaneous component analysis, distinctive components",
author = "Zhengguo Gu and {Van Deun}, Katrijn",
year = "2019",
doi = "10.3758/s13428-018-1163-z",
language = "English",
volume = "51",
pages = "2268--2289",
journal = "Behavior Research Methods",
issn = "1554-351X",
publisher = "Springer",
number = "5",

}

RegularizedSCA : Regularized simultaneous component analysis of multiblock data in R. / Gu, Zhengguo; Van Deun, Katrijn.

In: Behavior Research Methods, Vol. 51, No. 5, 2019, p. 2268-2289.

Research output: Contribution to journalArticleScientificpeer-review

TY - JOUR

T1 - RegularizedSCA

T2 - Regularized simultaneous component analysis of multiblock data in R

AU - Gu, Zhengguo

AU - Van Deun, Katrijn

PY - 2019

Y1 - 2019

N2 - This article introduces a package developed for R (R Core Team, 2017) for performing an integrated analysis of multiple data blocks (i.e., linked data) coming from different sources. The methods in this package combine simultaneous component analysis (SCA) with structured selection of variables. The key feature of this package is that it allows to (1) identify joint variation that is shared across all the data sources and specific variation that is associated with one or a few of the data sources and (2) flexibly estimate component matrices with predefined structures. Linked data occur in many disciplines (e.g., biomedical research, bioinformatics, chemometrics, finance, genomics, psychology, and sociology) and especially in multidisciplinary research. Hence, we expect our package to be useful in various fields.

AB - This article introduces a package developed for R (R Core Team, 2017) for performing an integrated analysis of multiple data blocks (i.e., linked data) coming from different sources. The methods in this package combine simultaneous component analysis (SCA) with structured selection of variables. The key feature of this package is that it allows to (1) identify joint variation that is shared across all the data sources and specific variation that is associated with one or a few of the data sources and (2) flexibly estimate component matrices with predefined structures. Linked data occur in many disciplines (e.g., biomedical research, bioinformatics, chemometrics, finance, genomics, psychology, and sociology) and especially in multidisciplinary research. Hence, we expect our package to be useful in various fields.

KW - CHILDREN

KW - COMMON

KW - Common

KW - Group Lasso

KW - INFORMATION

KW - Lasso

KW - Linked data analysis

KW - MODEL SELECTION

KW - Multiblock analysis

KW - PARENTS

KW - SCA

KW - Simultaneous component analysis

KW - distinctive components

U2 - 10.3758/s13428-018-1163-z

DO - 10.3758/s13428-018-1163-z

M3 - Article

VL - 51

SP - 2268

EP - 2289

JO - Behavior Research Methods

JF - Behavior Research Methods

SN - 1554-351X

IS - 5

ER -