Bembom Oliver, van der Laan Mark J
University of California at Berkeley, Division of Biostatistics, School of Public Health, 101 Haviland Hall #7358, Berkeley, California 94720-7358.
Electron J Stat. 2007;1:574-596. doi: 10.1214/07-EJS105.
The effect of vigorous physical activity on mortality in the elderly is difficult to estimate using conventional approaches to causal inference that define this effect by comparing the mortality risks corresponding to hypothetical scenarios in which all subjects in the target population engage in a given level of vigorous physical activity. A causal effect defined on the basis of such a static treatment intervention can only be identified from observed data if all subjects in the target population have a positive probability of selecting each of the candidate treatment options, an assumption that is highly unrealistic in this case since subjects with serious health problems will not be able to engage in higher levels of vigorous physical activity. This problem can be addressed by focusing instead on causal effects that are defined on the basis of realistic individualized treatment rules and intention-to-treat rules that explicitly take into account the set of treatment options that are available to each subject. We present a data analysis to illustrate that estimators of static causal effects in fact tend to overestimate the beneficial impact of high levels of vigorous physical activity while corresponding estimators based on realistic individualized treatment rules and intention-to-treat rules can yield unbiased estimates. We emphasize that the problems encountered in estimating static causal effects are not restricted to the IPTW estimator, but are also observed with the G-computation estimator, the DR-IPTW estimator, and the targeted MLE. Our analyses based on realistic individualized treatment rules and intention-to-treat rules suggest that high levels of vigorous physical activity may confer reductions in mortality risk on the order of 15-30%, although in most cases the evidence for such an effect does not quite reach the 0.05 level of significance.
使用传统的因果推断方法来估计剧烈体育活动对老年人死亡率的影响是困难的,传统方法通过比较目标人群中所有受试者都进行给定水平的剧烈体育活动这一假设情景下对应的死亡风险来定义这种影响。基于这种静态治疗干预定义的因果效应,只有在目标人群中的所有受试者都有选择每个候选治疗方案的正概率时,才能从观察数据中识别出来,而在这种情况下这个假设是极不现实的,因为有严重健康问题的受试者将无法进行更高水平的剧烈体育活动。这个问题可以通过转而关注基于现实的个体化治疗规则和意向性治疗规则定义的因果效应来解决,这些规则明确考虑了每个受试者可用的治疗方案集。我们进行了一项数据分析,以说明静态因果效应的估计实际上往往高估了高水平剧烈体育活动的有益影响,而基于现实的个体化治疗规则和意向性治疗规则的相应估计可以产生无偏估计。我们强调,在估计静态因果效应时遇到的问题不仅限于逆概率加权(IPTW)估计器,在G计算估计器、双重稳健逆概率加权(DR-IPTW)估计器和靶向极大似然估计(targeted MLE)中也能观察到。我们基于现实的个体化治疗规则和意向性治疗规则的分析表明,高水平的剧烈体育活动可能使死亡风险降低15%-30%左右,尽管在大多数情况下,这种效应的证据并未达到0.05的显著性水平。