Survival estimation and testing via multiple imputation

Jeremy M.G. Taylor, Susan Murray, Chiu Hsieh Hsu

Research output: Contribution to journalArticle

33 Scopus citations

Abstract

Multiple imputation is a technique for handling data sets with missing values. The method fills in each missing value several times, creating many augmented data sets. Each augmented data set is analyzed separately and the results combined to give a final result consisting of an estimate and a measure of uncertainty. In this paper we consider nonparametric multiple-imputation methods to handle missing event times for censored observations in the context of nonparametric survival estimation and testing. Two nonparametric imputation schemes are considered. In risk set imputation the censored time is replaced by a random draw of the observed times amongst those at risk after the censoring time. In Kaplan-Meier (KM) imputation the imputed time is a draw from the estimated distribution of event times amongst those at risk after the censoring time. We show that with a large number of imputes the estimates from both methods reproduce the KM estimator. In a simulation study we show that the inclusion of a bootstrap stage in the multiple imputation algorithm gives coverage rates of confidence intervals that are comparable to that from Greenwood's formula. Connections to the redistribute to the right algorithm are discussed.

Original languageEnglish (US)
Pages (from-to)221-232
Number of pages12
JournalStatistics and Probability Letters
Volume58
Issue number3
DOIs
StatePublished - Jul 1 2002
Externally publishedYes

    Fingerprint

Keywords

  • Kaplan-Meier estimate
  • Multiple imputation
  • Redistribute to the right

ASJC Scopus subject areas

  • Statistics and Probability
  • Statistics, Probability and Uncertainty

Cite this