Mitra Robin, Reiter Jerome P
School of Mathematics, University of Southampton, Southampton, UK
Department of Statistical Science, Duke University, Durham, NC, USA.
Stat Methods Med Res. 2016 Feb;25(1):188-204. doi: 10.1177/0962280212445945. Epub 2012 Jun 11.
In many observational studies, analysts estimate treatment effects using propensity scores, e.g. by matching or sub-classifying on the scores. When some values of the covariates are missing, analysts can use multiple imputation to fill in the missing data, estimate propensity scores based on the m completed datasets, and use the propensity scores to estimate treatment effects. We compare two approaches to implement this process. In the first, the analyst estimates the treatment effect using propensity score matching within each completed data set, and averages the m treatment effect estimates. In the second approach, the analyst averages the m propensity scores for each record across the completed datasets, and performs propensity score matching with these averaged scores to estimate the treatment effect. We compare properties of both methods via simulation studies using artificial and real data. The simulations suggest that the second method has greater potential to produce substantial bias reductions than the first, particularly when the missing values are predictive of treatment assignment.
在许多观察性研究中,分析人员使用倾向得分来估计治疗效果,例如通过在得分上进行匹配或亚分类。当协变量的某些值缺失时,分析人员可以使用多重填补法来填补缺失数据,基于m个完整数据集估计倾向得分,并使用倾向得分来估计治疗效果。我们比较了实现这一过程的两种方法。在第一种方法中,分析人员在每个完整数据集中使用倾向得分匹配来估计治疗效果,并对m个治疗效果估计值求平均值。在第二种方法中,分析人员对完整数据集中每条记录的m个倾向得分求平均值,并使用这些平均得分进行倾向得分匹配以估计治疗效果。我们通过使用人工数据和真实数据的模拟研究来比较这两种方法的特性。模拟结果表明,第二种方法比第一种方法更有可能大幅减少偏差,特别是当缺失值可预测治疗分配时。