Importance bootstrap resampling for proportional hazards regression

Research output: Contribution to journalArticlepeer-review

7 Scopus citations

Abstract

Importance resampling is an approach that uses exponential tilting to reduce the resampling necessary for the construction of nonparametric bootstrap confidence intervals. The properties of bootstrap importance confidence intervals are well established when the data is a smooth function of means and when there is no censoring. However, in the framework of survival or time-to-event data, the asymptotic properties of importance resampling have not been rigorously studied, mainly because of the unduly complicated theory incurred when data is censored. This paper uses extensive simulation to show that, for parameter estimates arising from fitting Cox proportional hazards models, importance bootstrap confidence intervals can be constructed if the importance resampling probabilities of the records for the n individuals in the study are determined by the empirical influence function for the parameter of interest. Our results show that, compared to uniform resampling, importance resampling improves the relative mean-squared-error (MSE) efficiency by a factor of nine (for n = 200). The efficiency increases significantly with sample size, is mildly associated with the amount of censoring, but decreases slightly as the number of bootstrap resamples increases. The extra CPU time requirement for calculating importance resamples is negligible when compared to the large improvement in MSE efficiency. The method is illustrated through an application to data on chronic lymphocytic leukemia, which highlights that the bootstrap confidence interval is the preferred alternative to large sample inferences when the distribution of a specific covariate deviates from normality. Our results imply that, because of its computational efficiency, importance resampling is recommended whenever bootstrap methodology is implemented in a survival framework. Its use is particularly important when complex covariates are involved or the survival problem to be solved is part of a larger problem; for instance, when determining confidence bounds for models linking survival time with clusters identified in gene expression microarray data.

Original languageEnglish (US)
Pages (from-to)2173-2188
Number of pages16
JournalCommunications in Statistics - Theory and Methods
Volume30
Issue number10
DOIs
StatePublished - Oct 2001

Keywords

  • Bootstrap
  • Censored data
  • Importance resampling
  • Influence function
  • Proportional hazards regression
  • Survival data

ASJC Scopus subject areas

  • Statistics and Probability

Fingerprint

Dive into the research topics of 'Importance bootstrap resampling for proportional hazards regression'. Together they form a unique fingerprint.

Cite this