Identification of Common Prognostic Gene Expression Signatures with Biological Meanings from Microarray Gene Expression Datasets

Jun Yao, Qi Zhao, Ying Yuan, Li Zhang, Xiaoming Liu, W. K.Alfred Yung, John N. Weinstein

Research output: Contribution to journalArticlepeer-review

17 Scopus citations

Abstract

Numerous prognostic gene expression signatures for breast cancer were generated previously with few overlap and limited insight into the biology of the disease. Here we introduce a novel algorithm named SCoR (Survival analysis using Cox proportional hazard regression and Random resampling) to apply random resampling and clustering methods in identifying gene features correlated with time to event data. This is shown to reduce overfitting noises involved in microarray data analysis and discover functional gene sets linked to patient survival. SCoR independently identified a common poor prognostic signature composed of cell proliferation genes from six out of eight breast cancer datasets. Furthermore, a sequential SCoR analysis on highly proliferative breast cancers repeatedly identified T/B cell markers as favorable prognosis factors. In glioblastoma, SCoR identified a common good prognostic signature of chromosome 10 genes from two gene expression datasets (TCGA and REMBRANDT), recapitulating the fact that loss of one copy of chromosome 10 (which harbors the tumor suppressor PTEN) is linked to poor survival in glioblastoma patients. SCoR also identified prognostic genes on sex chromosomes in lung adenocarcinomas, suggesting patient gender might be used to predict outcome in this disease. These results demonstrate the power of SCoR to identify common and biologically meaningful prognostic gene expression signatures.

Original languageEnglish (US)
Article numbere45894
JournalPloS one
Volume7
Issue number9
DOIs
StatePublished - Sep 21 2012

ASJC Scopus subject areas

  • General Biochemistry, Genetics and Molecular Biology
  • General Agricultural and Biological Sciences
  • General

MD Anderson CCSG core facilities

  • Bioinformatics Shared Resource
  • Biostatistics Resource Group

Fingerprint

Dive into the research topics of 'Identification of Common Prognostic Gene Expression Signatures with Biological Meanings from Microarray Gene Expression Datasets'. Together they form a unique fingerprint.

Cite this