Wan Fei
Am J Epidemiol. 2024 Feb 5;193(2):360-369. doi: 10.1093/aje/kwad189.
Conventional propensity score methods encounter challenges when unmeasured confounding is present, as it becomes impossible to accurately estimate the gold-standard propensity score when data on certain confounders are unavailable. Propensity score calibration (PSC) addresses this issue by constructing a surrogate for the gold-standard propensity score under the surrogacy assumption. This assumption posits that the error-prone propensity score, based on observed confounders, is independent of the outcome when conditioned on the gold-standard propensity score and the exposure. However, this assumption implies that confounders cannot directly impact the outcome and that their effects on the outcome are solely mediated through the propensity score. This raises concerns regarding the applicability of PSC in practical settings where confounders can directly affect the outcome. While PSC aims to target a conditional treatment effect by conditioning on a subject's unobservable propensity score, the causal interest in the latter case lies in a conditional treatment effect conditioned on a subject's baseline characteristics. Our analysis reveals that PSC is generally biased unless the effects of confounders on the outcome and treatment are proportional to each other. Furthermore, we identify 2 sources of bias: 1) the noncollapsibility of effect measures, such as the odds ratio or hazard ratio and 2) residual confounding, as the calibrated propensity score may not possess the properties of a valid propensity score.
当存在未测量的混杂因素时,传统的倾向评分方法会遇到挑战,因为当某些混杂因素的数据不可用时,就无法准确估计金标准倾向评分。倾向评分校准(PSC)通过在替代假设下构建金标准倾向评分的替代物来解决这个问题。该假设假定,基于观察到的混杂因素的容易出错的倾向评分,在以金标准倾向评分和暴露为条件时与结果独立。然而,这个假设意味着混杂因素不能直接影响结果,并且它们对结果的影响仅通过倾向评分来介导。这引发了对PSC在混杂因素可直接影响结果的实际环境中的适用性的担忧。虽然PSC旨在通过以受试者不可观察的倾向评分为条件来针对条件治疗效果,但在后一种情况下的因果兴趣在于以受试者的基线特征为条件的条件治疗效果。我们的分析表明,除非混杂因素对结果和治疗的影响相互成比例,否则PSC通常存在偏差。此外,我们确定了2个偏差来源:1)效应测量指标(如比值比或风险比)的不可折叠性,以及2)残余混杂,因为校准后的倾向评分可能不具备有效倾向评分的属性。