Koopman Laura, van der Heijden Geert J M G, Grobbee Diederick E, Rovers Maroeska M
Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht, the Netherlands.
Am J Epidemiol. 2008 Mar 1;167(5):540-5. doi: 10.1093/aje/kwm341. Epub 2008 Jan 9.
What is the influence of various methods of handling missing data (complete case analyses, single imputation within and over trials, and multiple imputations within and over trials) on the subgroup effects of individual patient data meta-analyses? An empirical data set was used to compare these five methods regarding the subgroup results. Logistic regression analyses were used to determine interaction effects (regression coefficients, standard errors, and p values) between subgrouping variables and treatment. Stratified analyses were performed to determine the effects in subgroups (rate ratios, rate differences, and their 95% confidence intervals). Imputation over trials resulted in different regression coefficients and standard errors of the interaction term as compared with imputation within trials and complete case analyses. Significant interaction effects were found for complete case analyses and imputation within trials, whereas imputation over trials often showed no significant interaction effect. Imputation of missing data over trials might lead to bias, because association of covariates might differ across the included studies. Therefore, despite the gain in statistical power, imputation over trials is not recommended. In the authors' empirical example, imputation within trials appears to be the most appropriate approach of handling missing data in individual patient data meta-analyses.
各种处理缺失数据的方法(完全病例分析、试验内和试验间的单一填补以及试验内和试验间的多重填补)对个体患者数据荟萃分析的亚组效应有何影响?使用一个经验数据集就亚组结果对这五种方法进行比较。采用逻辑回归分析来确定亚组变量与治疗之间的交互效应(回归系数、标准误和p值)。进行分层分析以确定亚组中的效应(率比、率差及其95%置信区间)。与试验内填补和完全病例分析相比,试验间填补导致交互项的回归系数和标准误不同。完全病例分析和试验内填补发现有显著的交互效应,而试验间填补通常未显示出显著的交互效应。试验间对缺失数据的填补可能会导致偏差,因为纳入研究中协变量的关联可能不同。因此,尽管统计效能有所提高,但不建议进行试验间填补。在作者的经验示例中,试验内填补似乎是个体患者数据荟萃分析中处理缺失数据的最合适方法。