Robins J M, Mark S D, Newey W K
Harvard School of Public Health, Boston, Massachusetts 02115.
Biometrics. 1992 Jun;48(2):479-95.
In order to estimate the causal effects of one or more exposures or treatments on an outcome of interest, one has to account for the effect of "confounding factors" which both covary with the exposures or treatments and are independent predictors of the outcome. In this paper we present regression methods which, in contrast to standard methods, adjust for the confounding effect of multiple continuous or discrete covariates by modelling the conditional expectation of the exposures or treatments given the confounders. In the special case of a univariate dichotomous exposure or treatment, this conditional expectation is identical to what Rosenbaum and Rubin have called the propensity score. They have also proposed methods to estimate causal effects by modelling the propensity score. Our methods generalize those of Rosenbaum and Rubin in several ways. First, our approach straightforwardly allows for multivariate exposures or treatments, each of which may be continuous, ordinal, or discrete. Second, even in the case of a single dichotomous exposure, our approach does not require subclassification or matching on the propensity score so that the potential for "residual confounding," i.e., bias, due to incomplete matching is avoided. Third, our approach allows a rather general formalization of the idea that it is better to use the "estimated propensity score" than the true propensity score even when the true score is known. The additional power of our approach derives from the fact that we assume the causal effects of the exposures or treatments can be described by the parametric component of a semiparametric regression model. To illustrate our methods, we reanalyze the effect of current cigarette smoking on the level of forced expiratory volume in one second in a cohort of 2,713 adult white males. We compare the results with those obtained using standard methods.
为了估计一种或多种暴露因素或治疗方法对感兴趣的结局的因果效应,必须考虑“混杂因素”的影响,这些因素既与暴露因素或治疗方法共同变化,又是结局的独立预测因素。在本文中,我们提出了一些回归方法,与标准方法不同的是,这些方法通过对给定混杂因素情况下暴露因素或治疗方法的条件期望进行建模,来调整多个连续或离散协变量的混杂效应。在单变量二分暴露或治疗的特殊情况下,这种条件期望与罗森鲍姆和鲁宾所称的倾向得分相同。他们还提出了通过对倾向得分进行建模来估计因果效应的方法。我们的方法在几个方面对罗森鲍姆和鲁宾的方法进行了推广。首先,我们的方法直接允许多变量暴露或治疗,其中每一个都可以是连续的、有序的或离散的。其次,即使在单一二分暴露的情况下,我们的方法也不需要根据倾向得分进行亚分类或匹配,从而避免了由于不完全匹配导致的“残余混杂”,即偏差的可能性。第三,我们的方法允许对这样一种观点进行相当一般的形式化表述,即即使已知真实得分,使用 “估计的倾向得分” 也比使用真实倾向得分更好。我们方法的额外优势源于这样一个事实,即我们假设暴露因素或治疗方法的因果效应可以用半参数回归模型的参数部分来描述。为了说明我们的方法,我们重新分析了2713名成年白人男性队列中当前吸烟对一秒用力呼气量水平的影响。我们将结果与使用标准方法获得的结果进行比较。