Model-based approaches to synthesize microarray data: A unifying review using mixture of SEMs

F. Martella, J.K. Vermunt

Research output: Contribution to journalArticleScientificpeer-review


Several statistical methods are nowadays available for the analysis of gene expression data recorded through microarray technology. In this article, we take a closer look at several Gaussian mixture models which have recently been proposed to model gene expression data. It can be shown that these are special cases of a more general model, called the mixture of structural equation models (mixture of SEMs), which has been developed in psychometrics. This model combines mixture modelling and SEMs by assuming that component-specific means and variances are subject to a SEM. The connection with SEM is useful for at least two reasons: (1) it shows the basic assumptions of existing methods more explicitly and (2) it helps in straightforward development of alternative mixture models for gene expression data with alternative mean/covariance structures. Different specifications of mixture of SEMs for clustering gene expression data are illustrated using two benchmark datasets.
Keywords: biclustering, correlated data, microarray data, mixture of SEMs, simultaneous clustering and dimensional reduction
Original languageEnglish
Pages (from-to)567-582
JournalStatistical Methods in Medical Research
Issue number6
Publication statusPublished - 2013


Dive into the research topics of 'Model-based approaches to synthesize microarray data: A unifying review using mixture of SEMs'. Together they form a unique fingerprint.

Cite this