The University of Texas MD Anderson Cancer Center UTHealth Graduate School of Biomedical Sciences, Houston, Texas, USA.
Department of Biostatistics, The University of Texas MD Anderson Cancer Center, Houston, Texas, USA.
Stat Med. 2023 Jul 10;42(15):2637-2660. doi: 10.1002/sim.9741. Epub 2023 Apr 3.
Most propensity score (PS) analysis methods rely on a correctly specified parametric PS model, which may result in biased estimation of the average treatment effect (ATE) when the model is misspecified. More flexible nonparametric models for treatment assignment alleviate this issue, but they do not always guarantee covariate balance. Methods that force balance in the means of covariates and their transformations between the treatment groups, termed global balance in this article, do not always lead to unbiased estimation of ATE. Their estimated propensity scores only ensure global balance but not the balancing property, which is defined as the conditional independence between treatment assignment and covariates given the propensity score. The balancing property implies not only global balance but also local balance-the mean balance of covariates in propensity score stratified sub-populations. Local balance implies global balance, but the reverse is false. We propose the propensity score with local balance (PSLB) methodology, which incorporates nonparametric propensity score models and optimizes local balance. Extensive numerical studies showed that the proposed method can substantially outperform existing methods that estimate the propensity score by optimizing global balance, when the model is misspecified. The proposed method is implemented in the R package PSLB.
大多数倾向评分(PS)分析方法都依赖于正确指定的参数 PS 模型,当模型指定错误时,可能会导致治疗效果的平均估计(ATE)存在偏差。用于治疗分配的更灵活的非参数模型可以缓解此问题,但它们并不总是保证协变量平衡。在处理组之间强制协变量及其变换的均值平衡的方法,本文中称为全局平衡,并不总是保证 ATE 的无偏估计。它们估计的倾向分数仅确保全局平衡,但不保证平衡属性,即给定倾向分数时治疗分配与协变量之间的条件独立性。平衡属性不仅意味着全局平衡,还意味着局部平衡-倾向分数分层子群体中协变量的均值平衡。局部平衡意味着全局平衡,但反之则不然。我们提出了具有局部平衡的倾向评分(PSLB)方法,该方法结合了非参数倾向评分模型并优化了局部平衡。广泛的数值研究表明,当模型指定错误时,与通过优化全局平衡来估计倾向分数的现有方法相比,所提出的方法可以大大提高性能。该方法在 R 包 PSLB 中实现。