Hashimoto Yuki, Inoue Norihiko, Tani Takuaki, Imai Shinobu
Department of Clinical Data Management and Research, Clinical Research Center, National Hospital Organization Headquarters, 2-5-21 Higashigaoka, Meguroku, 152-8621, Japan, 81 3-5712-5133, 81 3-5712-5088.
Department of Pharmacoepidemiology, Showa University Graduate School of Pharmacy, Shinagawaku, Japan.
JMIR Aging. 2025 May 14;8:e65898. doi: 10.2196/65898.
The global cancer burden is rapidly increasing, with 20 million new cases estimated in 2022. The world population aged ≥65 years is also increasing, projected to reach 15.9% by 2050, making cancer control for older patients urgent. Surgical resection is important for cancer treatment; however, predicting postoperative disability and mortality in older patients is crucial for surgical decision-making, considering the quality of life and care burden. Currently, no model directly predicts postoperative functional disability in this population.
We aimed to develop and validate machine-learning models to predict postoperative functional disability (≥5-point decrease in the Barthel Index) or in-hospital death in patients with cancer aged ≥ 65 years.
This retrospective cohort study included patients aged ≥65 years who underwent surgery for major cancers (lung, stomach, colorectal, liver, pancreatic, breast, or prostate cancer) between April 2016 and March 2023 in 70 Japanese hospitals across 6 regional groups. One group was randomly selected for external validation, while the remaining 5 groups were randomly divided into training (70%) and internal validation (30%) sets. Predictor variables were selected from 37 routinely available preoperative factors through electronic medical records (age, sex, income, comorbidities, laboratory values, and vital signs) using crude odds ratios (P<.1) and the least absolute shrinkage and selection operator method. We developed 6 machine-learning models, including category boosting (CatBoost), extreme gradient boosting (XGBoost), logistic regression, neural networks, random forest, and support vector machine. Model predictive performance was evaluated using the area under the receiver operating characteristic curve (AUC) with 95% CI. We used the Shapley additive explanations (SHAP) method to evaluate contribution to the predictive performance for each predictor variable.
This study included 33,355 patients in the training, 14,294 in the internal validation, and 6711 in the external validation sets. In the training set, 1406/33,355 (4.2%) patients experienced worse discharge. A total of 24 predictor variables were selected for the final models. CatBoost and XGBoost achieved the largest AUCs among the 6 models: 0.81 (95% CI 0.80-0.82) and 0.81 (95% CI 0.80-0.82), respectively. In the top 15 influential factors based on the mean absolute SHAP value, both models shared the same 14 factors such as dementia, age ≥85 years, and gastrointestinal cancer. The CatBoost model showed the largest AUCs in both internal (0.77, 95% CI 0.75-0.79) and external validation (0.72, 95% CI 0.68-0.75).
The CatBoost model demonstrated good performance in predicting postoperative outcomes for older patients with cancer using routinely available preoperative factors. The robustness of these findings was supported by the identical top influential factors between the CatBoost and XGBoost models. This model could support surgical decision-making while considering postoperative quality of life and care burden, with potential for implementation through electronic health records.
全球癌症负担正在迅速增加,2022年估计有2000万新病例。全球65岁及以上的人口也在增加,预计到2050年将达到15.9%,这使得老年患者的癌症控制变得紧迫。手术切除对癌症治疗很重要;然而,考虑到生活质量和护理负担,预测老年患者术后的残疾和死亡率对于手术决策至关重要。目前,尚无模型可直接预测该人群术后的功能残疾情况。
我们旨在开发并验证机器学习模型,以预测65岁及以上癌症患者术后的功能残疾(Barthel指数下降≥5分)或院内死亡情况。
这项回顾性队列研究纳入了2016年4月至2023年3月期间在日本6个地区组的70家医院接受主要癌症(肺癌、胃癌、结直肠癌、肝癌、胰腺癌、乳腺癌或前列腺癌)手术的65岁及以上患者。随机选择一组进行外部验证,其余5组随机分为训练集(70%)和内部验证集(30%)。通过电子病历从37个常规可得的术前因素(年龄、性别、收入、合并症、实验室检查值和生命体征)中,使用粗比值比(P<0.1)和最小绝对收缩和选择算子方法选择预测变量。我们开发了6种机器学习模型,包括类别提升(CatBoost)、极端梯度提升(XGBoost)、逻辑回归、神经网络、随机森林和支持向量机。使用受试者工作特征曲线下面积(AUC)及95%置信区间评估模型预测性能。我们使用Shapley加性解释(SHAP)方法评估每个预测变量对预测性能的贡献。
本研究纳入训练集患者33355例、内部验证集患者14294例和外部验证集患者6711例。在训练集中,1406/33355(4.2%)例患者出院时情况较差。最终模型共选择了24个预测变量。CatBoost和XGBoost在6种模型中AUC最大,分别为0.81(95%CI 0.80-0.82)和0.81(95%CI 0.80-0.82)。在基于平均绝对SHAP值的前15个影响因素中,两个模型共有14个相同因素,如痴呆、年龄≥85岁和胃肠道癌。CatBoost模型在内部验证(0.77,95%CI 0.75-0.79)和外部验证(0.72,95%CI 0.68-0.75)中AUC均最大。
CatBoost模型在使用常规可得的术前因素预测老年癌症患者术后结局方面表现良好。CatBoost和XGBoost模型中相同的顶级影响因素支持了这些发现的稳健性。该模型可以在考虑术后生活质量和护理负担的同时支持手术决策,并且有可能通过电子健康记录来实施。