The RAND Corporation, 4570 Fifth Avenue, Pittsburgh, PA 15213, USA.
Stat Med. 2013 Aug 30;32(19):3388-414. doi: 10.1002/sim.5753. Epub 2013 Mar 18.
The use of propensity scores to control for pretreatment imbalances on observed variables in non-randomized or observational studies examining the causal effects of treatments or interventions has become widespread over the past decade. For settings with two conditions of interest such as a treatment and a control, inverse probability of treatment weighted estimation with propensity scores estimated via boosted models has been shown in simulation studies to yield causal effect estimates with desirable properties. There are tools (e.g., the twang package in R) and guidance for implementing this method with two treatments. However, there is not such guidance for analyses of three or more treatments. The goals of this paper are twofold: (1) to provide step-by-step guidance for researchers who want to implement propensity score weighting for multiple treatments and (2) to propose the use of generalized boosted models (GBM) for estimation of the necessary propensity score weights. We define the causal quantities that may be of interest to studies of multiple treatments and derive weighted estimators of those quantities. We present a detailed plan for using GBM to estimate propensity scores and using those scores to estimate weights and causal effects. We also provide tools for assessing balance and overlap of pretreatment variables among treatment groups in the context of multiple treatments. A case study examining the effects of three treatment programs for adolescent substance abuse demonstrates the methods.
在过去十年中,使用倾向评分来控制非随机或观察性研究中观察变量的预处理不平衡,以检验治疗或干预措施的因果效应,已经得到了广泛的应用。对于有两种感兴趣条件的情况,例如治疗和对照,可以通过增强模型估计的倾向评分进行治疗的逆概率加权估计,在模拟研究中已证明可以产生具有理想特性的因果效应估计。有一些工具(例如 R 中的 twang 包)和指南可用于对两种治疗方法实施此方法。但是,对于三种或更多种治疗方法的分析,尚无此类指导。本文的目标有两个:(1)为希望对多种治疗方法实施倾向评分加权的研究人员提供逐步指导;(2)提出使用广义增强模型(GBM)来估计必要的倾向评分权重。我们定义了可能对多种治疗方法研究感兴趣的因果数量,并推导出这些数量的加权估计量。我们提出了一个详细的计划,用于使用 GBM 来估计倾向评分,并使用这些评分来估计权重和因果效应。我们还提供了用于评估多种治疗情况下治疗组之间预处理变量的平衡和重叠的工具。一项针对青少年药物滥用的三种治疗方案效果的案例研究展示了这些方法。