Bayesian compositional regression with structured priors for microbiome feature selection

Liangliang Zhang, Yushu Shi, Robert R. Jenq, Kim Anh Do, Christine B. Peterson

Research output: Contribution to journalArticlepeer-review

7 Scopus citations

Abstract

The microbiome plays a critical role in human health and disease, and there is a strong scientific interest in linking specific features of the microbiome to clinical outcomes. There are key aspects of microbiome data, however, that limit the applicability of standard variable selection methods. In particular, the observed data are compositional, as the counts within each sample have a fixed-sum constraint. In addition, microbiome features, typically quantified as operational taxonomic units, often reflect microorganisms that are similar in function, and may therefore have a similar influence on the response variable. To address the challenges posed by these aspects of the data structure, we propose a variable selection technique with the following novel features: a generalized transformation and z-prior to handle the compositional constraint, and an Ising prior that encourages the joint selection of microbiome features that are closely related in terms of their genetic sequence similarity. We demonstrate that our proposed method outperforms existing penalized approaches for microbiome variable selection in both simulation and the analysis of real data exploring the relationship of the gut microbiome to body mass index.

Original languageEnglish (US)
Pages (from-to)824-838
Number of pages15
JournalBiometrics
Volume77
Issue number3
DOIs
StatePublished - Sep 2021

Keywords

  • Bayesian variable selection
  • Ising prior
  • compositional covariates
  • linear regression
  • microbiome data
  • zero-constrained prior

ASJC Scopus subject areas

  • Statistics and Probability
  • General Biochemistry, Genetics and Molecular Biology
  • General Immunology and Microbiology
  • General Agricultural and Biological Sciences
  • Applied Mathematics

MD Anderson CCSG core facilities

  • Biostatistics Resource Group
  • Microbiome Facility

Fingerprint

Dive into the research topics of 'Bayesian compositional regression with structured priors for microbiome feature selection'. Together they form a unique fingerprint.

Cite this