Academic Journals Database
Disseminating quality controlled scientific knowledge

Comparing the performance of different multiple imputation strategies for missing binary outcomes in cluster randomized trials: a simulation study

Author(s): Ma J | Raina P | Beyene J | Thabane L

Journal: Open Access Medical Statistics
ISSN 2230-3251

Volume: 2012;
Issue: default;
Start page: 93;
Date: 2012;
Original page

Jinhui Ma,1–3 Parminder Raina,1,2 Joseph Beyene,1 Lehana Thabane1,3–51Department of Clinical Epidemiology and Biostatistics, McMaster University, Hamilton, ON, Canada; 2McMaster University Evidence-based Practice Center, Hamilton, ON, Canada; 3Biostatistics Unit, St Joseph's Healthcare Hamilton, Hamilton, ON, Canada; 4Centre for Evaluation of Medicines, St Joseph's Healthcare Hamilton, Hamilton, ON, Canada; 5Population Health Research Institute, Hamilton Health Sciences, Hamilton, ON, CanadaIntroduction: Although researchers have proposed various strategies to handle missing outcomes in cluster randomized trials (CRTs), limited attention has been paid to the performance of these strategies. Under the assumption of covariate-dependent missingness, the objective of this simulation study is to compare the performance of various strategies in handling missing binary outcomes in CRTs under different design settings.Methods: There are six missing data strategies investigated in this paper, which include complete case analysis, standard multiple imputation (MI) strategies using either logistic regression or Markov chain Monte Carlo (MCMC) method, within-cluster MI strategies using either logistic regression or MCMC method, and MI using logistic regression with cluster as a fixed effect. The performance of these strategies is evaluated through bias, empirical standard error, root mean squared error, and coverage probability.Results: Under the assumption of covariate-dependent missingness and applying the generalized estimating equations approach for fitting the logistic regression, it was shown that complete case analysis yields valid inferences when the percentage of missing outcomes is not large (50) and the design effect is large (VIF > 3). In contrast, within-cluster MI strategy using MCMC method may yield biased estimates of treatment effect for CRTs with small cluster size (≤50). MI using logistic regression with cluster as a fixed effect may substantially overestimate the standard error of the estimated treatment effect when the intracluster correlation coefficient is small. It may also lead to biased estimated treatment effect.Conclusion: Findings from this simulation study provide researchers with quantitative evidence to guide selection of an appropriate strategy to deal with missing binary outcomes.Keywords: missing data, design effect, variance inflation factor
RPA Switzerland

RPA Switzerland

Robotic process automation


Tango Jona
Tangokurs Rapperswil-Jona