Duan Tony, Rajpurkar Pranav, Laird Dillon, Ng Andrew Y, Basu Sanjay
Department of Computer Science (T.D., P.R., D.L., A.Y.N.), Stanford University, Stanford, CA.
Center for Primary Care and Outcomes Research and Center for Population Health Sciences, Departments of Medicine and of Health Research and Policy (S.B.), Stanford University, Stanford, CA.
Circ Cardiovasc Qual Outcomes. 2019 Mar;12(3):e005010. doi: 10.1161/CIRCOUTCOMES.118.005010.
The absolute risk reduction (ARR) in cardiovascular events from therapy is generally assumed to be proportional to baseline risk-such that high-risk patients benefit most. Yet newer analyses have proposed using randomized trial data to develop models that estimate individual treatment effects. We tested 2 hypotheses: first, that models of individual treatment effects would reveal that benefit from intensive blood pressure therapy is proportional to baseline risk; and second, that a machine learning approach designed to predict heterogeneous treatment effects-the X-learner meta-algorithm-is equivalent to a conventional logistic regression approach.
We compared conventional logistic regression to the X-learner approach for prediction of 3-year cardiovascular disease event risk reduction from intensive (target systolic blood pressure <120 mm Hg) versus standard (target <140 mm Hg) blood pressure treatment, using individual participant data from the SPRINT (Systolic Blood Pressure Intervention Trial; N=9361) and ACCORD BP (Action to Control Cardiovascular Risk in Diabetes Blood Pressure; N=4733) trials. Each model incorporated 17 covariates, an indicator for treatment arm, and interaction terms between covariates and treatment. Logistic regression had lower C statistic for benefit than the X-learner (0.51 [95% CI, 0.49-0.53] versus 0.60 [95% CI, 0.58-0.63], respectively). Following the logistic regression's recommendation for individualized therapy produced restricted mean time until cardiovascular disease event of 1065.47 days (95% CI, 1061.04-1069.35), while following the X-learner's recommendation improved mean time until cardiovascular disease event to 1068.71 days (95% CI, 1065.42-1072.08). Calibration was worse for logistic regression; it over-estimated ARR attributable to intensive treatment (slope between predicted and observed ARR of 0.73 [95% CI, 0.30-1.14] versus 1.06 [95% CI, 0.74-1.32] for the X-learner, compared with the ideal of 1). Predicted ARRs using logistic regression were generally proportional to baseline pretreatment cardiovascular risk, whereas the X-learner observed-correctly-that individual treatment effects were often not proportional to baseline risk.
Predictions for individual treatment effects from trial data reveal that patients may experience ARRs not simply proportional to baseline cardiovascular disease risk. Machine learning methods may improve discrimination and calibration of individualized treatment effect estimates from clinical trial data.
URL: https://www.clinicaltrials.gov . Unique identifiers: NCT01206062; NCT00000620.
通常认为治疗带来的心血管事件绝对风险降低(ARR)与基线风险成正比,即高危患者获益最大。然而,最新分析建议使用随机试验数据来建立估计个体治疗效果的模型。我们检验了两个假设:第一,个体治疗效果模型将显示强化血压治疗的获益与基线风险成正比;第二,一种旨在预测异质性治疗效果的机器学习方法——X学习者元算法——等同于传统的逻辑回归方法。
我们使用收缩压干预试验(SPRINT;N = 9361)和糖尿病血压心血管风险控制行动(ACCORD BP;N = 4733)试验的个体参与者数据,比较了传统逻辑回归与X学习者方法对强化(目标收缩压<120 mmHg)与标准(目标<140 mmHg)血压治疗3年心血管疾病事件风险降低的预测情况。每个模型纳入了17个协变量、一个治疗组指标以及协变量与治疗之间的交互项。逻辑回归在预测获益方面的C统计量低于X学习者(分别为0.51 [95%CI,0.49 - 0.53]和0.60 [95%CI,0.58 - 0.63])。遵循逻辑回归关于个体化治疗的建议得出,直至心血管疾病事件的受限平均时间为1065.47天(95%CI,1061.04 - 1069.35),而遵循X学习者的建议可将直至心血管疾病事件的平均时间改善至1068.71天(95%CI,1065.42 - 1072.08)。逻辑回归的校准情况更差;它高估了强化治疗所致的ARR(预测与观察到的ARR之间的斜率为0.73 [95%CI,0.30 - 1.14],而X学习者为1.06 [95%CI,0.74 - 1.32],理想情况为1)。使用逻辑回归预测的ARR通常与基线治疗前心血管风险成正比,而X学习者正确观察到个体治疗效果往往与基线风险不成正比。
根据试验数据对个体治疗效果的预测表明,患者经历的ARR可能并非简单地与基线心血管疾病风险成正比。机器学习方法可能会改善从临床试验数据估计个体治疗效果的辨别力和校准情况。
网址:https://www.clinicaltrials.gov 。唯一标识符:NCT01206062;NCT00000620。