Bayesian ensemble methods for survival prediction in gene expression data

Vinicius Bonato, Veerabhadran Baladandayuthapani, Bradley M. Broom, Erik P. Sulman, Kenneth D. Aldape, Kim Anh Do

Research output: Contribution to journalArticlepeer-review

56 Scopus citations

Abstract

Motivation: We propose a Bayesian ensemble method for survival prediction in high-dimensional gene expression data. We specify a fully Bayesian hierarchical approach based on an ensemble 'sum-of-trees' model and illustrate our method using three popular survival models. Our non-parametric method incorporates both additive and interaction effects between genes, which results in high predictive accuracy compared with other methods. In addition, our method provides model-free variable selection of important prognostic markers based on controlling the false discovery rates; thus providing a unified procedure to select relevant genes and predict survivor functions. Results: We assess the performance of our method several simulated and real microarray datasets. We show that our method selects genes potentially related to the development of the disease as well as yields predictive performance that is very competitive to many other existing methods.

Original languageEnglish (US)
Article numberbtq660
Pages (from-to)359-367
Number of pages9
JournalBioinformatics
Volume27
Issue number3
DOIs
StatePublished - Feb 2011

ASJC Scopus subject areas

  • Statistics and Probability
  • Biochemistry
  • Molecular Biology
  • Computer Science Applications
  • Computational Theory and Mathematics
  • Computational Mathematics

MD Anderson CCSG core facilities

  • Bioinformatics Shared Resource

Fingerprint

Dive into the research topics of 'Bayesian ensemble methods for survival prediction in gene expression data'. Together they form a unique fingerprint.

Cite this