Bayesian generalized biclustering analysis via adaptive structured shrinkage

Ziyi Li, Changgee Chang, Suprateek Kundu, Qi Long

Research output: Contribution to journalArticlepeer-review

10 Scopus citations

Abstract

Biclustering techniques can identify local patterns of a data matrix by clustering feature space and sample space at the same time. Various biclustering methods have been proposed and successfully applied to analysis of gene expression data. While existing biclustering methods have many desirable features, most of them are developed for continuous data and few of them can efficiently handle -omics data of various types, for example, binomial data as in single nucleotide polymorphism data or negative binomial data as in RNA-seq data. In addition, none of existing methods can utilize biological information such as those from functional genomics or proteomics. Recent work has shown that incorporating biological information can improve variable selection and prediction performance in analyses such as linear regression and multivariate analysis. In this article, we propose a novel Bayesian biclustering method that can handle multiple data types including Gaussian, Binomial, and Negative Binomial. In addition, our method uses a Bayesian adaptive structured shrinkage prior that enables feature selection guided by existing biological information. Our simulation studies and application to multi-omics datasets demonstrate robust and superior performance of the proposed method, compared to other existing biclustering methods.

Original languageEnglish (US)
Pages (from-to)610-624
Number of pages15
JournalBiostatistics
Volume21
Issue number3
DOIs
StatePublished - 2021
Externally publishedYes

Keywords

  • Adaptive shrinkage prior
  • Bayesian
  • Biclustering
  • Biological information
  • Integrative analysis
  • Omics data

ASJC Scopus subject areas

  • Statistics and Probability
  • Statistics, Probability and Uncertainty

MD Anderson CCSG core facilities

  • Biostatistics Resource Group

Fingerprint

Dive into the research topics of 'Bayesian generalized biclustering analysis via adaptive structured shrinkage'. Together they form a unique fingerprint.

Cite this