Bootstrap Inference for Multiple Imputation under Uncongeniality and Misspecification

Jonathan Bartlett, Rachael Hughes

Research output: Contribution to journalArticlepeer-review

53 Citations (SciVal)
20 Downloads (Pure)

Abstract

Multiple imputation has become one of the most popular approaches for handling missing data in statistical analyses. Part of this success is due to Rubin’s simple combination rules. These give frequentist valid inferences when the imputation and analysis procedures are so-called congenial and the embedding model is correctly specified, but otherwise may not. Roughly speaking, congeniality corresponds to whether the imputation and analysis models make different assumptions about the data. In practice, imputation models and analysis procedures are often not congenial, such that tests may not have the correct size, and confidence interval coverage deviates from the advertised level. We examine a number of recent proposals which combine bootstrapping with multiple imputation and determine which are valid under uncongeniality and model misspecification. Imputation followed by bootstrapping generally does not result in valid variance estimates under uncongeniality or misspecification, whereas certain bootstrap followed by imputation methods do. We recommend a particular computationally efficient variant of bootstrapping followed by imputation.

Original languageEnglish
Pages (from-to)3533-3546
JournalStatistical Methods in Medical Research
Volume29
Issue number12
Early online date30 Jun 2020
DOIs
Publication statusPublished - 1 Dec 2020

Bibliographical note

The paper has been accepted but has not been published online yet, so I don't know the embargo release date yet.

Keywords

  • Multiple imputation
  • bootstrap
  • congeniality

ASJC Scopus subject areas

  • Epidemiology
  • Statistics and Probability
  • Health Information Management

Fingerprint

Dive into the research topics of 'Bootstrap Inference for Multiple Imputation under Uncongeniality and Misspecification'. Together they form a unique fingerprint.

Cite this