Accounting for network noise in graph-guided Bayesian modeling of structured high-dimensional data

Wenrui Li, Changgee Chang, Suprateek Kundu, Qi Long

Research output: Contribution to journalArticlepeer-review

Abstract

There is a growing body of literature on knowledge-guided statistical learning methods for analysis of structured high-dimensional data (such as genomic and transcriptomic data) that can incorporate knowledge of underlying networks derived from functional genomics and functional proteomics. These methods have been shown to improve variable selection and prediction accuracy and yield more interpretable results. However, these methods typically use graphs extracted from existing databases or rely on subject matter expertise, which are known to be incomplete and may contain false edges. To address this gap, we propose a graph-guided Bayesian modeling framework to account for network noise in regression models involving structured high-dimensional predictors. Specifically, we use 2 sources of network information, including the noisy graph extracted from existing databases and the estimated graph from observed predictors in the dataset at hand, to inform the model for the true underlying network via a latent scale modeling framework. This model is coupled with the Bayesian regression model with structured high-dimensional predictors involving an adaptive structured shrinkage prior. We develop an efficient Markov chain Monte Carlo algorithm for posterior sampling. We demonstrate the advantages of our method over existing methods in simulations, and through analyses of a genomics dataset and another proteomics dataset for Alzheimer's disease.

Original languageEnglish (US)
JournalBiometrics
Volume80
Issue number1
DOIs
StatePublished - Jan 29 2024

Keywords

  • adaptive Bayesian shrinkage
  • latent scale network model
  • MCMC algorithm
  • noisy graph
  • structured high-dimensional prediction

ASJC Scopus subject areas

  • Statistics and Probability
  • General Biochemistry, Genetics and Molecular Biology
  • General Immunology and Microbiology
  • General Agricultural and Biological Sciences
  • Applied Mathematics

Fingerprint

Dive into the research topics of 'Accounting for network noise in graph-guided Bayesian modeling of structured high-dimensional data'. Together they form a unique fingerprint.

Cite this