Austin Peter C, Grootendorst Paul, Normand Sharon-Lise T, Anderson Geoffrey M
Institute for Clinical Evaluative Sciences, Toronto, Ont., Canada.
Stat Med. 2007 Feb 20;26(4):754-68. doi: 10.1002/sim.2618.
Propensity score methods are increasingly being used to estimate causal treatment effects in the medical literature. Conditioning on the propensity score results in unbiased estimation of the expected difference in observed responses to two treatments. The degree to which conditioning on the propensity score introduces bias into the estimation of the conditional odds ratio or conditional hazard ratio, which are frequently used as measures of treatment effect in observational studies, has not been extensively studied. We conducted Monte Carlo simulations to determine the degree to which propensity score matching, stratification on the quintiles of the propensity score, and covariate adjustment using the propensity score result in biased estimation of conditional odds ratios, hazard ratios, and rate ratios. We found that conditioning on the propensity score resulted in biased estimation of the true conditional odds ratio and the true conditional hazard ratio. In all scenarios examined, treatment effects were biased towards the null treatment effect. However, conditioning on the propensity score did not result in biased estimation of the true conditional rate ratio. In contrast, conventional regression methods allowed unbiased estimation of the true conditional treatment effect when all variables associated with the outcome were included in the regression model. The observed bias in propensity score methods is due to the fact that regression models allow one to estimate conditional treatment effects, whereas propensity score methods allow one to estimate marginal treatment effects. In several settings with non-linear treatment effects, marginal and conditional treatment effects do not coincide.
倾向得分方法在医学文献中越来越多地被用于估计因果治疗效果。基于倾向得分进行条件设定可对两种治疗的观察反应的预期差异进行无偏估计。在观察性研究中,基于倾向得分进行条件设定会在多大程度上给常用作治疗效果衡量指标的条件优势比或条件风险比的估计带来偏差,这方面尚未得到广泛研究。我们进行了蒙特卡洛模拟,以确定倾向得分匹配、按倾向得分五分位数分层以及使用倾向得分进行协变量调整在多大程度上会导致对条件优势比、风险比和率比的估计出现偏差。我们发现,基于倾向得分进行条件设定会导致对真实条件优势比和真实条件风险比的估计出现偏差。在所考察的所有情形中,治疗效果都偏向于无效治疗效果。然而,基于倾向得分进行条件设定并未导致对真实条件率比的估计出现偏差。相比之下,当回归模型中纳入了所有与结局相关的变量时,传统回归方法可对真实条件治疗效果进行无偏估计。倾向得分方法中观察到的偏差是由于回归模型允许估计条件治疗效果,而倾向得分方法允许估计边际治疗效果这一事实。在几种具有非线性治疗效果的情形中,边际治疗效果和条件治疗效果并不一致。