1 School of Public Health, University of Adelaide, Australia.
2 MRC Biostatistics Unit, Cambridge Institute of Public Health, UK.
Stat Methods Med Res. 2018 Sep;27(9):2610-2626. doi: 10.1177/0962280216683570. Epub 2016 Dec 19.
The use of multiple imputation has increased markedly in recent years, and journal reviewers may expect to see multiple imputation used to handle missing data. However in randomized trials, where treatment group is always observed and independent of baseline covariates, other approaches may be preferable. Using data simulation we evaluated multiple imputation, performed both overall and separately by randomized group, across a range of commonly encountered scenarios. We considered both missing outcome and missing baseline data, with missing outcome data induced under missing at random mechanisms. Provided the analysis model was correctly specified, multiple imputation produced unbiased treatment effect estimates, but alternative unbiased approaches were often more efficient. When the analysis model overlooked an interaction effect involving randomized group, multiple imputation produced biased estimates of the average treatment effect when applied to missing outcome data, unless imputation was performed separately by randomized group. Based on these results, we conclude that multiple imputation should not be seen as the only acceptable way to handle missing data in randomized trials. In settings where multiple imputation is adopted, we recommend that imputation is carried out separately by randomized group.
近年来,多重插补的使用显著增加,期刊审稿人可能期望看到使用多重插补来处理缺失数据。然而,在随机试验中,处理组始终是观察到的,且与基线协变量无关,因此其他方法可能更可取。我们使用数据模拟评估了多重插补,在一系列常见情况下,分别对整个数据集和按随机分组进行了插补。我们考虑了缺失结局和缺失基线数据,结局数据的缺失是在随机缺失机制下产生的。只要分析模型正确指定,多重插补就会产生无偏的治疗效果估计,但替代的无偏方法通常更有效。当分析模型忽略了涉及随机分组的交互效应时,对于缺失结局数据,应用多重插补会产生平均治疗效果的有偏估计,除非按随机分组分别进行插补。基于这些结果,我们得出结论,不应将多重插补视为处理随机试验中缺失数据的唯一可接受方法。在采用多重插补的情况下,我们建议按随机分组分别进行插补。