Department of Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia.
Abramson Cancer Center, University of Pennsylvania, Philadelphia.
JAMA Netw Open. 2019 Oct 2;2(10):e1915997. doi: 10.1001/jamanetworkopen.2019.15997.
Machine learning algorithms could identify patients with cancer who are at risk of short-term mortality. However, it is unclear how different machine learning algorithms compare and whether they could prompt clinicians to have timely conversations about treatment and end-of-life preferences.
To develop, validate, and compare machine learning algorithms that use structured electronic health record data before a clinic visit to predict mortality among patients with cancer.
DESIGN, SETTING, AND PARTICIPANTS: Cohort study of 26 525 adult patients who had outpatient oncology or hematology/oncology encounters at a large academic cancer center and 10 affiliated community practices between February 1, 2016, and July 1, 2016. Patients were not required to receive cancer-directed treatment. Patients were observed for up to 500 days after the encounter. Data analysis took place between October 1, 2018, and September 1, 2019.
Logistic regression, gradient boosting, and random forest algorithms.
Primary outcome was 180-day mortality from the index encounter; secondary outcome was 500-day mortality from the index encounter.
Among 26 525 patients in the analysis, 1065 (4.0%) died within 180 days of the index encounter. Among those who died, the mean age was 67.3 (95% CI, 66.5-68.0) years, and 500 (47.0%) were women. Among those who were alive at 180 days, the mean age was 61.3 (95% CI, 61.1-61.5) years, and 15 922 (62.5%) were women. The population was randomly partitioned into training (18 567 [70.0%]) and validation (7958 [30.0%]) cohorts at the patient level, and a randomly selected encounter was included in either the training or validation set. At a prespecified alert rate of 0.02, positive predictive values were higher for the random forest (51.3%) and gradient boosting (49.4%) algorithms compared with the logistic regression algorithm (44.7%). There was no significant difference in discrimination among the random forest (area under the receiver operating characteristic curve [AUC], 0.88; 95% CI, 0.86-0.89), gradient boosting (AUC, 0.87; 95% CI, 0.85-0.89), and logistic regression (AUC, 0.86; 95% CI, 0.84-0.88) models (P for comparison = .02). In the random forest model, observed 180-day mortality was 51.3% (95% CI, 43.6%-58.8%) in the high-risk group vs 3.4% (95% CI, 3.0%-3.8%) in the low-risk group; at 500 days, observed mortality was 64.4% (95% CI, 56.7%-71.4%) in the high-risk group and 7.6% (7.0%-8.2%) in the low-risk group. In a survey of 15 oncology clinicians with a 52.1% response rate, 100 of 171 patients (58.8%) who had been flagged as having high risk by the gradient boosting algorithm were deemed appropriate for a conversation about treatment and end-of-life preferences in the upcoming week.
In this cohort study, machine learning algorithms based on structured electronic health record data accurately identified patients with cancer at risk of short-term mortality. When the gradient boosting algorithm was applied in real time, clinicians believed that most patients who had been identified as having high risk were appropriate for a timely conversation about treatment and end-of-life preferences.
机器学习算法可以识别出癌症患者中短期死亡率高的患者。然而,不同的机器学习算法之间的比较以及它们是否能促使临床医生及时就治疗和临终关怀偏好进行对话尚不清楚。
开发、验证和比较使用在门诊就诊前的结构化电子健康记录数据的机器学习算法,以预测癌症患者的死亡率。
设计、设置和参与者:这是一项在 2016 年 2 月 1 日至 2016 年 7 月 1 日期间在一家大型学术癌症中心和 10 家附属社区诊所进行的门诊肿瘤学或血液学/肿瘤学就诊的 26525 例成年患者的队列研究。患者无需接受癌症靶向治疗。在就诊后,对患者进行了长达 500 天的观察。数据分析于 2018 年 10 月 1 日至 2019 年 9 月 1 日进行。
逻辑回归、梯度提升和随机森林算法。
主要结果是从就诊开始的 180 天死亡率;次要结果是从就诊开始的 500 天死亡率。
在分析的 26525 例患者中,有 1065 例(4.0%)在就诊后 180 天内死亡。在死亡的患者中,平均年龄为 67.3 岁(95%CI,66.5-68.0),500 例(47.0%)为女性。在 180 天存活的患者中,平均年龄为 61.3 岁(95%CI,61.1-61.5),15922 例(62.5%)为女性。人群按患者水平随机分为训练集(18567 例[70.0%])和验证集(7958 例[30.0%]),随机选择的就诊既包含在训练集中也包含在验证集中。在预设的 0.02 警报率下,随机森林(51.3%)和梯度提升(49.4%)算法的阳性预测值高于逻辑回归算法(44.7%)。随机森林(曲线下接收者操作特征面积[AUC],0.88;95%CI,0.86-0.89)、梯度提升(AUC,0.87;95%CI,0.85-0.89)和逻辑回归(AUC,0.86;95%CI,0.84-0.88)模型之间的区分度没有显著差异(比较的 P 值=0.02)。在随机森林模型中,观察到的 180 天死亡率在高危组为 51.3%(95%CI,43.6%-58.8%),在低危组为 3.4%(95%CI,3.0%-3.8%);在 500 天时,高危组观察到的死亡率为 64.4%(95%CI,56.7%-71.4%),低危组为 7.6%(7.0%-8.2%)。在对 15 名肿瘤学临床医生进行的一项调查中,有 171 名患者中的 100 名(58.8%)被梯度提升算法标记为有高风险,临床医生认为,在接下来的一周内,大多数被认为有高风险的患者都适合进行关于治疗和临终关怀偏好的对话。
在这项队列研究中,基于结构化电子健康记录数据的机器学习算法准确识别出了癌症患者中短期死亡率高的患者。当梯度提升算法实时应用时,临床医生认为,大多数被识别为高风险的患者都适合及时进行关于治疗和临终关怀偏好的对话。