The University of Texas MD Anderson Cancer Center UTHealth Graduate School of Biomedical Sciences, Houston, TX, USA.
Department of Biostatistics, University of Texas MD Anderson Cancer Center, Houston, TX, USA.
Stat Methods Med Res. 2021 Apr;30(4):1119-1142. doi: 10.1177/0962280220983512. Epub 2021 Feb 1.
The inverse probability weighting is an important propensity score weighting method to estimate the average treatment effect. Recent literature shows that it can be easily combined with covariate balancing constraints to reduce the detrimental effects of excessively large weights and improve balance. Other methods are available to derive weights that balance covariate distributions between the treatment groups without the involvement of propensity scores. We conducted comprehensive Monte Carlo experiments to study whether the use of covariate balancing constraints circumvent the need for correct propensity score model specification, and whether the use of a propensity score model further improves the estimation performance among methods that use similar covariate balancing constraints. We compared simple inverse probability weighting, two propensity score weighting methods with balancing constraints (covariate balancing propensity score, covariate balancing scoring rule), and two weighting methods with balancing constraints but without using the propensity scores (entropy balancing and kernel balancing). We observed that correct specification of the propensity score model remains important even when the constraints effectively balance the covariates. We also observed evidence suggesting that, with similar covariate balance constraints, the use of a propensity score model improves the estimation performance when the dimension of covariates is large. These findings suggest that it is important to develop flexible data-driven propensity score models that satisfy covariate balancing conditions.
逆概率加权是一种重要的倾向评分加权方法,用于估计平均治疗效果。最近的文献表明,它可以很容易地与协变量平衡约束结合使用,以减少过大权重的不利影响,并改善平衡。还有其他方法可用于在不涉及倾向评分的情况下,在治疗组之间平衡协变量分布的权重。我们进行了全面的蒙特卡罗实验,以研究使用协变量平衡约束是否可以避免正确的倾向评分模型规范的需要,以及在使用类似协变量平衡约束的方法中,使用倾向评分模型是否可以进一步提高估计性能。我们比较了简单的逆概率加权、两种具有平衡约束的倾向评分加权方法(协变量平衡倾向评分、协变量平衡评分规则),以及两种具有平衡约束但不使用倾向评分的加权方法(熵平衡和核平衡)。我们观察到,即使约束有效地平衡了协变量,正确指定倾向评分模型仍然很重要。我们还观察到一些证据表明,在具有相似协变量平衡约束的情况下,当协变量的维度较大时,使用倾向评分模型可以提高估计性能。这些发现表明,开发满足协变量平衡条件的灵活数据驱动倾向评分模型非常重要。