Epidemiology. 2018 Jan;29(1):87-95. doi: 10.1097/EDE.0000000000000755.
Most longitudinal studies do not address potential selection biases due to selective attrition. Using empirical data and simulating additional attrition, we investigated the effectiveness of common approaches to handle missing outcome data from attrition in the association between individual education level and change in body mass index (BMI).
Using data from the two waves of the French RECORD Cohort Study (N = 7,172), we first examined how inverse probability weighting (IPW) and multiple imputation handled missing outcome data from attrition in the observed data (stage 1). Second, simulating additional missing data in BMI at follow-up under various missing-at-random scenarios, we quantified the impact of attrition and assessed how multiple imputation performed compared to complete case analysis and to a perfectly specified IPW model as a gold standard (stage 2).
With the observed data in stage 1, we found an inverse association between individual education and change in BMI, with complete case analysis, as well as with IPW and multiple imputation. When we simulated additional attrition under a missing-at-random pattern (stage 2), the bias increased with the magnitude of selective attrition, and multiple imputation was useless to address it.
Our simulations revealed that selective attrition in the outcome heavily biased the association of interest. The present article contributes to raising awareness that for missing outcome data, multiple imputation does not do better than complete case analysis. More effort is thus needed during the design phase to understand attrition mechanisms by collecting information on the reasons for dropout.
大多数纵向研究都没有解决由于选择性流失而导致的潜在选择偏差问题。本研究使用实证数据和模拟额外的流失数据,调查了在个体教育水平与体重指数(BMI)变化之间的关联中,处理因流失而导致的缺失结局数据的常见方法对于处理因流失而导致的缺失结局数据的有效性。
利用法国 RECORD 队列研究(N=7172)的两波数据,我们首先检查了逆概率加权(IPW)和多重插补在观察数据中(第 1 阶段)如何处理因流失而导致的缺失结局数据。其次,在各种随机缺失情况下,在随访时模拟 BMI 的额外缺失数据,我们量化了流失的影响,并评估了多重插补与完全案例分析和作为黄金标准的完全指定 IPW 模型相比的表现。
在第 1 阶段使用观察数据,我们发现个体教育与 BMI 变化之间呈负相关,无论是完全案例分析,还是 IPW 和多重插补。当我们在随机缺失模式下模拟额外的流失(第 2 阶段)时,选择性流失的程度越大,偏倚就越大,而多重插补对于解决这个问题是无效的。
我们的模拟结果表明,结局数据的选择性流失严重影响了感兴趣的关联。本研究有助于提高认识,即对于缺失结局数据,多重插补并不比完全案例分析更好。因此,在设计阶段需要付出更多努力,通过收集有关辍学原因的信息来了解流失机制。