From the aDivision of Pharmacoepidemiology and Pharmacoeconomics, Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA; and bDepartment of Epidemiology, Harvard School of Public Health, Boston, MA.
Epidemiology. 2014 Mar;25(2):268-78. doi: 10.1097/EDE.0000000000000069.
Propensity scores are useful for confounding adjustment in the commonly observed setting of many potential confounders, frequent exposure, and rare events. However, with few exposed outcomes to inform covariate selection and many candidate confounders, optimal approaches to construct and implement propensity-score-based confounding adjustment remain unclear.
In a cohort study on the effect of anticonvulsant drugs on cardiovascular risk among adult patients from the HealthCore Integrated Research Database, we compared the performance for confounding control of various covariate-selection strategies for propensity-score estimation (expert knowledge only, expert knowledge informed by empirical covariate selection via high-dimensional propensity-score, and high-dimensional propensity-score empirical specification only) and propensity-score-based adjustment methods (propensity-score-matching and propensity-score-decile stratification). This article focuses on the first 90 days of follow-up because any treatment effect identified in this temporal window almost certainty originates from residual confounding rather than pharmacologic action.
We identified 166,031 new users and 564 ischemic cardiovascular events. Among those, 12,580 patients initiated anticonvulsants that strongly induce cytochrome P450 enzymes and experienced 68 events. The unadjusted hazard ratio was 1.72 (95% confidence interval = 1.34-2.22). Adjustment for investigator-identified covariates led to 41% to 59% reductions in the hazard ratio; adjustment for both investigator-identified and high-dimensional propensity-score empirically identified covariates led to larger reductions (54% to 72%). A selection strategy based on high-dimensional propensity-score empirical specification alone produced less-attenuated and more-volatile hazard ratio estimates. This volatility seemed to be slightly attenuated in a trimmed propensity-score-stratified analysis.
The high-dimensional propensity-score algorithm complements expert knowledge for confounding adjustment, but in settings with few exposed outcomes, its performance without investigator-specified covariates is less clear and may be associated with an increased likelihood of bias. In our example, investigator specification of variables combined with high-dimensional propensity-score empirical selection and the use of trimmed propensity-score-stratified analysis seem to improve effect estimation. Plotting the relation of effect estimates to the increasing number of empirical covariates is a useful diagnostic.
在存在大量潜在混杂因素、频繁暴露和罕见结局的常见观察性研究中,倾向评分可用于混杂因素调整。然而,当暴露结局较少而候选混杂因素较多时,构建和实施基于倾向评分的混杂因素调整的最佳方法仍不明确。
在一项来自 HealthCore 综合研究数据库的成年患者抗癫痫药物对心血管风险影响的队列研究中,我们比较了不同倾向评分估计的协变量选择策略(仅专家知识、基于高维倾向评分的专家知识和经验性协变量选择、仅高维倾向评分经验性指定)和倾向评分调整方法(倾向评分匹配和倾向评分十分位数分层)对混杂控制的效果。本文重点关注随访的前 90 天,因为在此时间窗口内识别出的任何治疗效果几乎肯定源于残余混杂因素而非药物作用。
我们确定了 166031 名新使用者和 564 例缺血性心血管事件。其中,12580 名患者开始使用强烈诱导细胞色素 P450 酶的抗癫痫药物,发生 68 例事件。未调整的风险比为 1.72(95%置信区间为 1.34-2.22)。调整研究者确定的协变量后,风险比降低了 41%至 59%;同时调整研究者确定的和高维倾向评分经验性确定的协变量后,风险比降低更多(54%至 72%)。单纯基于高维倾向评分经验性指定的选择策略产生的风险比估计值衰减较小且波动较大。在修剪后的倾向评分分层分析中,这种波动似乎略有减弱。
高维倾向评分算法可补充专家知识进行混杂调整,但在暴露结局较少的情况下,其在没有研究者指定协变量的情况下的性能尚不清楚,并且可能与偏倚的可能性增加相关。在我们的示例中,研究者指定变量与高维倾向评分经验性选择相结合,并使用修剪后的倾向评分分层分析,似乎可以改善效果估计。绘制效果估计与经验性协变量数量增加的关系是一种有用的诊断方法。