机器学习方法预测癌症患者 6 个月死亡率。

Machine Learning Approaches to Predict 6-Month Mortality Among Patients With Cancer.

机构信息

Department of Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia.

Abramson Cancer Center, University of Pennsylvania, Philadelphia.

出版信息

JAMA Netw Open. 2019 Oct 2;2(10):e1915997. doi: 10.1001/jamanetworkopen.2019.15997.

DOI:10.1001/jamanetworkopen.2019.15997

PMID:31651973

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6822091/

Abstract

IMPORTANCE

Machine learning algorithms could identify patients with cancer who are at risk of short-term mortality. However, it is unclear how different machine learning algorithms compare and whether they could prompt clinicians to have timely conversations about treatment and end-of-life preferences.

OBJECTIVES

To develop, validate, and compare machine learning algorithms that use structured electronic health record data before a clinic visit to predict mortality among patients with cancer.

DESIGN, SETTING, AND PARTICIPANTS: Cohort study of 26 525 adult patients who had outpatient oncology or hematology/oncology encounters at a large academic cancer center and 10 affiliated community practices between February 1, 2016, and July 1, 2016. Patients were not required to receive cancer-directed treatment. Patients were observed for up to 500 days after the encounter. Data analysis took place between October 1, 2018, and September 1, 2019.

EXPOSURES

Logistic regression, gradient boosting, and random forest algorithms.

MAIN OUTCOMES AND MEASURES

Primary outcome was 180-day mortality from the index encounter; secondary outcome was 500-day mortality from the index encounter.

RESULTS

Among 26 525 patients in the analysis, 1065 (4.0%) died within 180 days of the index encounter. Among those who died, the mean age was 67.3 (95% CI, 66.5-68.0) years, and 500 (47.0%) were women. Among those who were alive at 180 days, the mean age was 61.3 (95% CI, 61.1-61.5) years, and 15 922 (62.5%) were women. The population was randomly partitioned into training (18 567 [70.0%]) and validation (7958 [30.0%]) cohorts at the patient level, and a randomly selected encounter was included in either the training or validation set. At a prespecified alert rate of 0.02, positive predictive values were higher for the random forest (51.3%) and gradient boosting (49.4%) algorithms compared with the logistic regression algorithm (44.7%). There was no significant difference in discrimination among the random forest (area under the receiver operating characteristic curve [AUC], 0.88; 95% CI, 0.86-0.89), gradient boosting (AUC, 0.87; 95% CI, 0.85-0.89), and logistic regression (AUC, 0.86; 95% CI, 0.84-0.88) models (P for comparison = .02). In the random forest model, observed 180-day mortality was 51.3% (95% CI, 43.6%-58.8%) in the high-risk group vs 3.4% (95% CI, 3.0%-3.8%) in the low-risk group; at 500 days, observed mortality was 64.4% (95% CI, 56.7%-71.4%) in the high-risk group and 7.6% (7.0%-8.2%) in the low-risk group. In a survey of 15 oncology clinicians with a 52.1% response rate, 100 of 171 patients (58.8%) who had been flagged as having high risk by the gradient boosting algorithm were deemed appropriate for a conversation about treatment and end-of-life preferences in the upcoming week.

CONCLUSIONS AND RELEVANCE

In this cohort study, machine learning algorithms based on structured electronic health record data accurately identified patients with cancer at risk of short-term mortality. When the gradient boosting algorithm was applied in real time, clinicians believed that most patients who had been identified as having high risk were appropriate for a timely conversation about treatment and end-of-life preferences.

摘要

重要性

机器学习算法可以识别出癌症患者中短期死亡率高的患者。然而，不同的机器学习算法之间的比较以及它们是否能促使临床医生及时就治疗和临终关怀偏好进行对话尚不清楚。

目的

开发、验证和比较使用在门诊就诊前的结构化电子健康记录数据的机器学习算法，以预测癌症患者的死亡率。

设计、设置和参与者：这是一项在 2016 年 2 月 1 日至 2016 年 7 月 1 日期间在一家大型学术癌症中心和 10 家附属社区诊所进行的门诊肿瘤学或血液学/肿瘤学就诊的 26525 例成年患者的队列研究。患者无需接受癌症靶向治疗。在就诊后，对患者进行了长达 500 天的观察。数据分析于 2018 年 10 月 1 日至 2019 年 9 月 1 日进行。

暴露因素

逻辑回归、梯度提升和随机森林算法。

主要结果和测量

主要结果是从就诊开始的 180 天死亡率；次要结果是从就诊开始的 500 天死亡率。

结果

在分析的 26525 例患者中，有 1065 例（4.0%）在就诊后 180 天内死亡。在死亡的患者中，平均年龄为 67.3 岁（95%CI，66.5-68.0），500 例（47.0%）为女性。在 180 天存活的患者中，平均年龄为 61.3 岁（95%CI，61.1-61.5），15922 例（62.5%）为女性。人群按患者水平随机分为训练集（18567 例[70.0%]）和验证集（7958 例[30.0%]），随机选择的就诊既包含在训练集中也包含在验证集中。在预设的 0.02 警报率下，随机森林（51.3%）和梯度提升（49.4%）算法的阳性预测值高于逻辑回归算法（44.7%）。随机森林（曲线下接收者操作特征面积[AUC]，0.88；95%CI，0.86-0.89）、梯度提升（AUC，0.87；95%CI，0.85-0.89）和逻辑回归（AUC，0.86；95%CI，0.84-0.88）模型之间的区分度没有显著差异（比较的 P 值=0.02）。在随机森林模型中，观察到的 180 天死亡率在高危组为 51.3%（95%CI，43.6%-58.8%），在低危组为 3.4%（95%CI，3.0%-3.8%）；在 500 天时，高危组观察到的死亡率为 64.4%（95%CI，56.7%-71.4%），低危组为 7.6%（7.0%-8.2%）。在对 15 名肿瘤学临床医生进行的一项调查中，有 171 名患者中的 100 名（58.8%）被梯度提升算法标记为有高风险，临床医生认为，在接下来的一周内，大多数被认为有高风险的患者都适合进行关于治疗和临终关怀偏好的对话。

结论和相关性

在这项队列研究中，基于结构化电子健康记录数据的机器学习算法准确识别出了癌症患者中短期死亡率高的患者。当梯度提升算法实时应用时，临床医生认为，大多数被识别为高风险的患者都适合及时进行关于治疗和临终关怀偏好的对话。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d37/6822091/a5e59f8c2ce6/jamanetwopen-2-e1915997-g001.jpg

相似文献

Machine Learning Approaches to Predict 6-Month Mortality Among Patients With Cancer.机器学习方法预测癌症患者 6 个月死亡率。

JAMA Netw Open. 2019 Oct 2;2(10):e1915997. doi: 10.1001/jamanetworkopen.2019.15997.

Validation of a Machine Learning Algorithm to Predict 180-Day Mortality for Outpatients With Cancer.机器学习算法预测癌症门诊患者 180 天死亡率的验证。

JAMA Oncol. 2020 Nov 1;6(11):1723-1730. doi: 10.1001/jamaoncol.2020.4331.

Can Predictive Modeling Tools Identify Patients at High Risk of Prolonged Opioid Use After ACL Reconstruction?预测模型工具能否识别 ACL 重建术后阿片类药物使用时间延长的高风险患者？

Clin Orthop Relat Res. 2020 Jul;478(7):0-1618. doi: 10.1097/CORR.0000000000001251.

Development, Validation, and Evaluation of a Simple Machine Learning Model to Predict Cirrhosis Mortality.开发、验证和评估一种简单的机器学习模型以预测肝硬化死亡率。

JAMA Netw Open. 2020 Nov 2;3(11):e2023780. doi: 10.1001/jamanetworkopen.2020.23780.

Development and Validation of an Electronic Health Record-Based Machine Learning Model to Estimate Delirium Risk in Newly Hospitalized Patients Without Known Cognitive Impairment.基于电子病历的机器学习模型开发与验证：用于预测无已知认知障碍的新入院患者发生谵妄的风险。

JAMA Netw Open. 2018 Aug 3;1(4):e181018. doi: 10.1001/jamanetworkopen.2018.1018.

Development and Validation of a Machine Learning Algorithm Predicting Emergency Department Use and Unplanned Hospitalization in Patients With Head and Neck Cancer.开发和验证一种机器学习算法，用于预测头颈部癌症患者在急诊科的使用情况和非计划性住院。

JAMA Otolaryngol Head Neck Surg. 2022 Aug 1;148(8):764-772. doi: 10.1001/jamaoto.2022.1629.

Development and Application of a Machine Learning Approach to Assess Short-term Mortality Risk Among Patients With Cancer Starting Chemotherapy.开发和应用机器学习方法评估开始化疗的癌症患者的短期死亡风险。

JAMA Netw Open. 2018 Jul 6;1(3):e180926. doi: 10.1001/jamanetworkopen.2018.0926.

Evaluation of Machine-Learning Algorithms for Predicting Opioid Overdose Risk Among Medicare Beneficiaries With Opioid Prescriptions.评估机器学习算法在预测有阿片类药物处方的医疗保险受益人群中阿片类药物过量风险中的应用。

JAMA Netw Open. 2019 Mar 1;2(3):e190968. doi: 10.1001/jamanetworkopen.2019.0968.

Predictors of 30-Day Mortality Among Dutch Patients Undergoing Colorectal Cancer Surgery, 2011-2016.2011-2016 年荷兰结直肠癌手术患者 30 天死亡率的预测因素。

JAMA Netw Open. 2021 Apr 1;4(4):e217737. doi: 10.1001/jamanetworkopen.2021.7737.

Effect of Integrating Machine Learning Mortality Estimates With Behavioral Nudges to Clinicians on Serious Illness Conversations Among Patients With Cancer: A Stepped-Wedge Cluster Randomized Clinical Trial.将机器学习死亡率估计与行为提示相结合，为临床医生提供指导，以改善癌症患者的严重疾病沟通：一项 stepped-wedge 聚类随机临床试验。

JAMA Oncol. 2020 Dec 1;6(12):e204759. doi: 10.1001/jamaoncol.2020.4759. Epub 2020 Dec 10.

引用本文的文献

Performance Drift in a Nationally Deployed Population Health Risk Algorithm in the US Veterans Health Administration.美国退伍军人健康管理局全国部署的人口健康风险算法中的性能漂移

JAMA Health Forum. 2025 Aug 1;6(8):e252717. doi: 10.1001/jamahealthforum.2025.2717.

Doing More with Less: Predicting Primary Care Provider Effectiveness.以更少投入实现更多成果：预测初级保健提供者的效能。

Rev Econ Stat. 2025 Mar;107(2):289-305. doi: 10.1162/rest_a_01290. Epub 2025 Mar 12.

Economic analysis of an AI-enabled ECG alert system: impact on mortality outcomes from a pragmatic randomized trial.人工智能心电图警报系统的经济分析：一项实用随机试验对死亡率结果的影响。

NPJ Digit Med. 2025 Jun 11;8(1):348. doi: 10.1038/s41746-025-01735-7.

Machine learning model for prediction of palliative care phases in patients with advanced cancer: a retrospective study.用于预测晚期癌症患者姑息治疗阶段的机器学习模型：一项回顾性研究。

BMC Palliat Care. 2025 May 24;24(1):148. doi: 10.1186/s12904-025-01785-4.

Perceptions, Attitudes, and Concerns on Artificial Intelligence Applications in Patients with Cancer.癌症患者对人工智能应用的认知、态度及担忧

Cancer Control. 2025 Jan-Dec;32:10732748251343245. doi: 10.1177/10732748251343245. Epub 2025 May 23.

Construction and evaluation of machine learning-based prediction model for live birth following fresh embryo transfer in IVF/ICSI patients with polycystic ovary syndrome.基于机器学习的预测模型在多囊卵巢综合征体外受精/卵胞浆内单精子注射患者新鲜胚胎移植后活产率的构建与评估

J Ovarian Res. 2025 Apr 4;18(1):70. doi: 10.1186/s13048-025-01654-x.

A SEMIPARAMETRIC METHOD FOR RISK PREDICTION USING INTEGRATED ELECTRONIC HEALTH RECORD DATA.一种使用综合电子健康记录数据进行风险预测的半参数方法。

Ann Appl Stat. 2024 Dec;18(4):3318-3337. doi: 10.1214/24-AOAS1938. Epub 2024 Oct 31.

Hallmarks of artificial intelligence contributions to precision oncology.人工智能对精准肿瘤学贡献的标志。

Nat Cancer. 2025 Mar;6(3):417-431. doi: 10.1038/s43018-025-00917-2. Epub 2025 Mar 7.

Multidisciplinary clinician perceptions on utility of a machine learning tool (ALERT) to predict 6-month mortality and improve end-of-life outcomes for advanced cancer patients.多学科临床医生对一种机器学习工具（ALERT）在预测晚期癌症患者6个月死亡率及改善临终结局方面的效用的看法。

Cancer Med. 2025 Mar;14(5):e70137. doi: 10.1002/cam4.70137.

Quality Improvement Study Using a Machine Learning Mortality Risk Prediction Model Notification System on Advance Care Planning in High-Risk Patients.使用机器学习死亡率风险预测模型通知系统对高危患者进行预先护理计划的质量改进研究。

J Brown Hosp Med. 2024 Jul 2;3(3):120907. doi: 10.56305/001c.120907. eCollection 2024.

本文引用的文献

Electronic Health Record Mortality Prediction Model for Targeted Palliative Care Among Hospitalized Medical Patients: a Pilot Quasi-experimental Study.电子健康记录在院医疗患者目标性姑息治疗死亡率预测模型：一项试点类实验研究。

J Gen Intern Med. 2019 Sep;34(9):1841-1847. doi: 10.1007/s11606-019-05169-2. Epub 2019 Jul 16.

Evaluation of Mortality Data From the Social Security Administration Death Master File for Clinical Research.利用社会安全管理局死亡主档案评估临床研究中的死亡率数据。

JAMA Cardiol. 2019 Apr 1;4(4):375-379. doi: 10.1001/jamacardio.2019.0198.

Applied Informatics Decision Support Tool for Mortality Predictions in Patients With Cancer.用于癌症患者死亡率预测的应用信息学决策支持工具

JCO Clin Cancer Inform. 2018 Dec;2:1-11. doi: 10.1200/CCI.18.00003.

JAMA Netw Open. 2018 Jul 6;1(3):e180926. doi: 10.1001/jamanetworkopen.2018.0926.

Advance Care Planning Among Patients With Advanced Cancer.晚期癌症患者的预先医疗指示计划。

J Oncol Pract. 2019 Jan;15(1):e65-e73. doi: 10.1200/JOP.18.00044. Epub 2018 Dec 13.

Development and Validation of Machine Learning Models for Prediction of 1-Year Mortality Utilizing Electronic Medical Record Data Available at the End of Hospitalization in Multicondition Patients: a Proof-of-Concept Study.利用多病种患者住院结束时可获取的电子病历数据开发和验证机器学习模型预测 1 年死亡率：概念验证研究。

J Gen Intern Med. 2018 Jun;33(6):921-928. doi: 10.1007/s11606-018-4316-y. Epub 2018 Jan 30.

Lung cancer prognostic index: a risk score to predict overall survival after the diagnosis of non-small-cell lung cancer.肺癌预后指数：一种预测非小细胞肺癌诊断后总生存期的风险评分。

Br J Cancer. 2017 Aug 22;117(5):744-751. doi: 10.1038/bjc.2017.232. Epub 2017 Jul 20.

Can machine-learning improve cardiovascular risk prediction using routine clinical data?机器学习能否利用常规临床数据改善心血管疾病风险预测？

PLoS One. 2017 Apr 4;12(4):e0174944. doi: 10.1371/journal.pone.0174944. eCollection 2017.

Clinicians' Expectations of the Benefits and Harms of Treatments, Screening, and Tests: A Systematic Review.临床医生对治疗、筛查和检测的获益和危害的期望：系统评价。

JAMA Intern Med. 2017 Mar 1;177(3):407-419. doi: 10.1001/jamainternmed.2016.8254.

Integration of Palliative Care Into Standard Oncology Care: American Society of Clinical Oncology Clinical Practice Guideline Update.姑息治疗融入标准肿瘤学治疗中：美国临床肿瘤学会临床实践指南更新。

J Clin Oncol. 2017 Jan;35(1):96-112. doi: 10.1200/JCO.2016.70.1474. Epub 2016 Oct 28.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

机器学习方法预测癌症患者 6 个月死亡率。

Machine Learning Approaches to Predict 6-Month Mortality Among Patients With Cancer.

机构信息

出版信息

IMPORTANCE

OBJECTIVES

EXPOSURES

MAIN OUTCOMES AND MEASURES

RESULTS

CONCLUSIONS AND RELEVANCE

重要性

目的

暴露因素

主要结果和测量

结果

结论和相关性

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献