Woo Mi-Ja, Reiter Jerome P, Karr Alan F
National Institute of Statistical Sciences, Research Triangle Park, NC, USA.
Stat Med. 2008 Aug 30;27(19):3805-16. doi: 10.1002/sim.3278.
Propensity score matching is often used in observational studies to create treatment and control groups with similar distributions of observed covariates. Typically, propensity scores are estimated using logistic regressions that assume linearity between the logistic link and the predictors. We evaluate the use of generalized additive models (GAMs) for estimating propensity scores. We compare logistic regressions and GAMs in terms of balancing covariates using simulation studies with artificial and genuine data. We find that, when the distributions of covariates in the treatment and control groups overlap sufficiently, using GAMs can improve overall covariate balance, especially for higher-order moments of distributions. When the distributions in the two groups overlap insufficiently, GAM more clearly reveals this fact than logistic regression does. We also demonstrate via simulation that matching with GAMs can result in larger reductions in bias when estimating treatment effects than matching with logistic regression.
倾向得分匹配常用于观察性研究,以创建具有相似观察协变量分布的治疗组和对照组。通常,倾向得分使用逻辑回归进行估计,逻辑回归假设逻辑链接与预测变量之间呈线性关系。我们评估使用广义相加模型(GAM)来估计倾向得分。我们通过使用人工数据和真实数据的模拟研究,在平衡协变量方面比较逻辑回归和GAM。我们发现,当治疗组和对照组中协变量的分布充分重叠时,使用GAM可以改善整体协变量平衡,特别是对于分布的高阶矩。当两组中的分布重叠不足时,GAM比逻辑回归更清楚地揭示这一事实。我们还通过模拟证明,与逻辑回归匹配相比,使用GAM进行匹配在估计治疗效果时可以更大程度地减少偏差。