机器学习用于预测老年癌症患者术后功能残疾和死亡率：回顾性队列研究

Machine Learning for Predicting Postoperative Functional Disability and Mortality Among Older Patients With Cancer: Retrospective Cohort Study.

作者信息

Hashimoto Yuki, Inoue Norihiko, Tani Takuaki, Imai Shinobu

机构信息

Department of Clinical Data Management and Research, Clinical Research Center, National Hospital Organization Headquarters, 2-5-21 Higashigaoka, Meguroku, 152-8621, Japan, 81 3-5712-5133, 81 3-5712-5088.

Department of Pharmacoepidemiology, Showa University Graduate School of Pharmacy, Shinagawaku, Japan.

出版信息

JMIR Aging. 2025 May 14;8:e65898. doi: 10.2196/65898.

DOI:10.2196/65898

PMID:40369796

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12094529/

Abstract

BACKGROUND

The global cancer burden is rapidly increasing, with 20 million new cases estimated in 2022. The world population aged ≥65 years is also increasing, projected to reach 15.9% by 2050, making cancer control for older patients urgent. Surgical resection is important for cancer treatment; however, predicting postoperative disability and mortality in older patients is crucial for surgical decision-making, considering the quality of life and care burden. Currently, no model directly predicts postoperative functional disability in this population.

OBJECTIVE

We aimed to develop and validate machine-learning models to predict postoperative functional disability (≥5-point decrease in the Barthel Index) or in-hospital death in patients with cancer aged ≥ 65 years.

METHODS

This retrospective cohort study included patients aged ≥65 years who underwent surgery for major cancers (lung, stomach, colorectal, liver, pancreatic, breast, or prostate cancer) between April 2016 and March 2023 in 70 Japanese hospitals across 6 regional groups. One group was randomly selected for external validation, while the remaining 5 groups were randomly divided into training (70%) and internal validation (30%) sets. Predictor variables were selected from 37 routinely available preoperative factors through electronic medical records (age, sex, income, comorbidities, laboratory values, and vital signs) using crude odds ratios (P<.1) and the least absolute shrinkage and selection operator method. We developed 6 machine-learning models, including category boosting (CatBoost), extreme gradient boosting (XGBoost), logistic regression, neural networks, random forest, and support vector machine. Model predictive performance was evaluated using the area under the receiver operating characteristic curve (AUC) with 95% CI. We used the Shapley additive explanations (SHAP) method to evaluate contribution to the predictive performance for each predictor variable.

RESULTS

This study included 33,355 patients in the training, 14,294 in the internal validation, and 6711 in the external validation sets. In the training set, 1406/33,355 (4.2%) patients experienced worse discharge. A total of 24 predictor variables were selected for the final models. CatBoost and XGBoost achieved the largest AUCs among the 6 models: 0.81 (95% CI 0.80-0.82) and 0.81 (95% CI 0.80-0.82), respectively. In the top 15 influential factors based on the mean absolute SHAP value, both models shared the same 14 factors such as dementia, age ≥85 years, and gastrointestinal cancer. The CatBoost model showed the largest AUCs in both internal (0.77, 95% CI 0.75-0.79) and external validation (0.72, 95% CI 0.68-0.75).

CONCLUSIONS

The CatBoost model demonstrated good performance in predicting postoperative outcomes for older patients with cancer using routinely available preoperative factors. The robustness of these findings was supported by the identical top influential factors between the CatBoost and XGBoost models. This model could support surgical decision-making while considering postoperative quality of life and care burden, with potential for implementation through electronic health records.

摘要

背景

全球癌症负担正在迅速增加，2022年估计有2000万新病例。全球65岁及以上的人口也在增加，预计到2050年将达到15.9%，这使得老年患者的癌症控制变得紧迫。手术切除对癌症治疗很重要；然而，考虑到生活质量和护理负担，预测老年患者术后的残疾和死亡率对于手术决策至关重要。目前，尚无模型可直接预测该人群术后的功能残疾情况。

目的

我们旨在开发并验证机器学习模型，以预测65岁及以上癌症患者术后的功能残疾（Barthel指数下降≥5分）或院内死亡情况。

方法

这项回顾性队列研究纳入了2016年4月至2023年3月期间在日本6个地区组的70家医院接受主要癌症（肺癌、胃癌、结直肠癌、肝癌、胰腺癌、乳腺癌或前列腺癌）手术的65岁及以上患者。随机选择一组进行外部验证，其余5组随机分为训练集（7０%）和内部验证集（3０%）。通过电子病历从37个常规可得的术前因素（年龄、性别、收入、合并症、实验室检查值和生命体征）中，使用粗比值比（P<0.1）和最小绝对收缩和选择算子方法选择预测变量。我们开发了6种机器学习模型，包括类别提升（CatBoost）、极端梯度提升（XGBoost）、逻辑回归、神经网络、随机森林和支持向量机。使用受试者工作特征曲线下面积（AUC）及95%置信区间评估模型预测性能。我们使用Shapley加性解释（SHAP）方法评估每个预测变量对预测性能的贡献。

结果

本研究纳入训练集患者33355例、内部验证集患者14294例和外部验证集患者6711例。在训练集中，1406/33355（4.2%）例患者出院时情况较差。最终模型共选择了24个预测变量。CatBoost和XGBoost在6种模型中AUC最大，分别为0.81（95%CI 0.80-0.82）和0.81（95%CI 0.80-0.82）。在基于平均绝对SHAP值的前15个影响因素中，两个模型共有14个相同因素，如痴呆、年龄≥85岁和胃肠道癌。CatBoost模型在内部验证（0.77，95%CI 0.75-0.79）和外部验证（0.72，95%CI 0.68-0.75）中AUC均最大。

结论

CatBoost模型在使用常规可得的术前因素预测老年癌症患者术后结局方面表现良好。CatBoost和XGBoost模型中相同的顶级影响因素支持了这些发现的稳健性。该模型可以在考虑术后生活质量和护理负担的同时支持手术决策，并且有可能通过电子健康记录来实施。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4e2f/12094529/093a1ea254c3/aging-v8-e65898-g001.jpg

相似文献

Machine Learning for Predicting Postoperative Functional Disability and Mortality Among Older Patients With Cancer: Retrospective Cohort Study.机器学习用于预测老年癌症患者术后功能残疾和死亡率：回顾性队列研究

JMIR Aging. 2025 May 14;8:e65898. doi: 10.2196/65898.

Development and Validation of an Explainable Machine Learning Model for Predicting Myocardial Injury After Noncardiac Surgery in Two Centers in China: Retrospective Study.中国两个中心用于预测非心脏手术后心肌损伤的可解释机器学习模型的开发与验证：一项回顾性研究

JMIR Aging. 2024 Jul 26;7:e54872. doi: 10.2196/54872.

Prediction of STAS in lung adenocarcinoma with nodules ≤ 2 cm using machine learning: a multicenter retrospective study.使用机器学习预测直径≤2 cm的肺腺癌中的STAS：一项多中心回顾性研究

BMC Cancer. 2025 Mar 7;25(1):417. doi: 10.1186/s12885-025-13783-z.

Machine learning-based predictive models for perioperative major adverse cardiovascular events in patients with stable coronary artery disease undergoing noncardiac surgery.基于机器学习的预测模型用于接受非心脏手术的稳定冠状动脉疾病患者围手术期主要不良心血管事件的预测

Comput Methods Programs Biomed. 2025 Mar;260:108561. doi: 10.1016/j.cmpb.2024.108561. Epub 2024 Dec 13.

Development of a 5-Year Risk Prediction Model for Transition From Prediabetes to Diabetes Using Machine Learning: Retrospective Cohort Study.使用机器学习开发一个用于预测糖尿病前期转变为糖尿病的5年风险预测模型：回顾性队列研究。

J Med Internet Res. 2025 May 9;27:e73190. doi: 10.2196/73190.

Can Predictive Modeling Tools Identify Patients at High Risk of Prolonged Opioid Use After ACL Reconstruction?预测模型工具能否识别 ACL 重建术后阿片类药物使用时间延长的高风险患者？

Clin Orthop Relat Res. 2020 Jul;478(7):0-1618. doi: 10.1097/CORR.0000000000001251.

Machine-learning Models Predict 30-Day Mortality, Cardiovascular Complications, and Respiratory Complications After Aseptic Revision Total Joint Arthroplasty.机器学习模型预测无菌翻修全关节置换术后 30 天死亡率、心血管并发症和呼吸系统并发症。

Clin Orthop Relat Res. 2022 Nov 1;480(11):2137-2145. doi: 10.1097/CORR.0000000000002276. Epub 2022 Jun 20.

Clinical decision support systems for 3-month mortality in elderly patients admitted to ICU with ischemic stroke using interpretable machine learning.使用可解释机器学习的针对入住重症监护病房的老年缺血性中风患者3个月死亡率的临床决策支持系统

Digit Health. 2024 Sep 17;10:20552076241280126. doi: 10.1177/20552076241280126. eCollection 2024 Jan-Dec.

Application of machine learning model in predicting the likelihood of blood transfusion after hip fracture surgery.机器学习模型在预测髋部骨折手术后输血可能性中的应用。

Aging Clin Exp Res. 2023 Nov;35(11):2643-2656. doi: 10.1007/s40520-023-02550-4. Epub 2023 Sep 21.

Prognosis prediction and risk stratification of transarterial chemoembolization or intraarterial chemotherapy for unresectable hepatocellular carcinoma based on machine learning.基于机器学习的经动脉化疗栓塞术或动脉内化疗对不可切除肝细胞癌的预后预测及风险分层

Eur Radiol. 2024 Aug;34(8):5094-5107. doi: 10.1007/s00330-024-10581-2. Epub 2024 Jan 30.

本文引用的文献

JMIR Aging. 2024 Jul 26;7:e54872. doi: 10.2196/54872.

Global cancer burden growing, amidst mounting need for services.全球癌症负担不断增加，对服务的需求也日益迫切。

Saudi Med J. 2024 Mar;45(3):326-327.

Interpretable machine learning models for predicting venous thromboembolism in the intensive care unit: an analysis based on data from 207 centers.用于预测重症监护病房静脉血栓栓塞症的可解释机器学习模型：基于来自 207 个中心的数据的分析。

Crit Care. 2023 Oct 24;27(1):406. doi: 10.1186/s13054-023-04683-4.

Long-Term Risk of Being Bedridden in Elderly Patients Who Underwent Oncologic Surgery: A Retrospective Study Using a Japanese Claims Database.老年肿瘤手术患者长期卧床的风险：使用日本理赔数据库的回顾性研究。

Ann Surg Oncol. 2023 Aug;30(8):4604-4612. doi: 10.1245/s10434-023-13566-5. Epub 2023 May 6.

External beam radiotherapy combination is a risk factor for bladder cancer in patients with prostate cancer treated with brachytherapy.外照射放疗联合是接受近距离放疗的前列腺癌患者发生膀胱癌的一个危险因素。

World J Urol. 2023 May;41(5):1317-1321. doi: 10.1007/s00345-023-04380-5. Epub 2023 Apr 6.

Patterns of staging, treatment, and mortality in gastric, colorectal, and lung cancer among older adults with and without preexisting dementia: a Japanese multicentre cohort study.老年人胃癌、结直肠癌和肺癌的分期、治疗和死亡率模式：有无预先存在痴呆的日本多中心队列研究。

BMC Cancer. 2023 Jan 19;23(1):67. doi: 10.1186/s12885-022-10411-y.

Hospital-associated disability and hospitalization costs for acute heart failure stratified by body mass index- insight from the JROAD/JROAD-DPC database.根据体重指数分层的急性心力衰竭的医院相关残疾和住院费用：来自 JROAD/JROAD-DPC 数据库的见解。

Int J Cardiol. 2022 Nov 15;367:38-44. doi: 10.1016/j.ijcard.2022.08.044. Epub 2022 Aug 24.

Survival and long-term surgical outcomes after colorectal surgery: are there any gender-related differences?结直肠手术后的生存和长期手术结果：是否存在与性别相关的差异？

Updates Surg. 2022 Aug;74(4):1337-1343. doi: 10.1007/s13304-022-01323-4. Epub 2022 Jul 9.

Existing Data Sources for Clinical Epidemiology: Database of the National Hospital Organization in Japan.临床流行病学的现有数据来源：日本国立医院组织数据库

Clin Epidemiol. 2022 May 19;14:689-698. doi: 10.2147/CLEP.S359072. eCollection 2022.

Machine learning-based diagnosis and risk factor analysis of cardiocerebrovascular disease based on KNHANES.基于 KNHANES 的基于机器学习的心血脑管疾病诊断和风险因素分析。

Sci Rep. 2022 Feb 10;12(1):2250. doi: 10.1038/s41598-022-06333-1.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

机器学习用于预测老年癌症患者术后功能残疾和死亡率：回顾性队列研究

Machine Learning for Predicting Postoperative Functional Disability and Mortality Among Older Patients With Cancer: Retrospective Cohort Study.

作者信息

机构信息

出版信息

BACKGROUND

OBJECTIVE

METHODS

RESULTS

CONCLUSIONS

背景

目的

方法

结果

结论

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献