Suppr超能文献

基于机器学习预测新辅助化疗后非病理完全缓解的乳腺癌患者无病生存期:一项回顾性多中心队列研究

Machine learning-based prediction of disease-free survival in breast cancer patients with non-pathological complete response after neoadjuvant chemotherapy: a retrospective multicenter cohort study.

作者信息

Zhang Zi-Ran, Wang Chao-Xian, Wang Huan, Jin Si-Li

机构信息

Department of Breast Diseases, Jiaxing Women and Children's Hospital, Wenzhou Medical University Jiaxing 314000, Zhejiang, P. R. China.

出版信息

Am J Cancer Res. 2025 Jun 15;15(6):2482-2499. doi: 10.62347/MHSV3723. eCollection 2025.

Abstract

This study aimed to construct a robust machine learning (ML) model for predicting the disease-free survival (DFS) and risk stratification of breast cancer (BC) patients with non-pathological complete response (non-PCR) after neoadjuvant chemotherapy (NAC). The model will facilitate the initiation of early interventions for high-risk patients. This retrospective multicenter cohort study included BC patients from two hospitals in China who received NAC but did not achieve PCR. Four ML algorithms were utilized to construct models based on patients' clinicopathological data, followed by a performance evaluation of these models. To improve the interpretability of the model, the shapley additive explanation (SHAP) method was employed to analyze the contribution of each feature to the predictive outcomes. A total of 463 non-PCR patients were included in the study. Of these, 385 patients were from Ruijin Hospital, affiliated with Shanghai Jiao Tong University, and were randomly split into a training cohort and an internal validation cohort in a 3:1 ratio for model development and preliminary performance evaluation. In addition, 78 patients enrolled from Jiaxing Women and Children's Hospital were assigned to the external validation cohort to evaluate the model's generalizability. Univariate and multivariate Cox regression analyses demonstrated that age, residual tumor size, Ki67 change, molecular subtype, and axillary lymph node metastasis were independent factors influencing DFS. Among the four ML models, the random survival forest (RSF) model showed the best performance, with a concordance index of 0.820 in the training cohort, 0.642 in the internal validation cohort, and 0.689 in the external validation cohort. Further analysis revealed that the RSF model had excellent discriminative ability with a high area under curve value, while its low Brier score indicated excellent calibration. Decision curve analysis indicated that the RSF model offered a higher clinical net benefit at various time points and effectively stratified risk, successfully identifying high-risk patients. SHAP analysis underscored residual tumor size as the most influential predictive feature. The RSF model can effectively predict DFS and risk of BC patients with non-PCR following NAC, offering a critical reference for developing individualized treatment strategies.

摘要

本研究旨在构建一个强大的机器学习(ML)模型,用于预测接受新辅助化疗(NAC)后未达到病理完全缓解(非PCR)的乳腺癌(BC)患者的无病生存期(DFS)和风险分层。该模型将有助于为高危患者启动早期干预措施。这项回顾性多中心队列研究纳入了来自中国两家医院的接受NAC但未达到PCR的BC患者。利用四种ML算法基于患者的临床病理数据构建模型,随后对这些模型进行性能评估。为了提高模型的可解释性,采用Shapley加性解释(SHAP)方法分析每个特征对预测结果的贡献。该研究共纳入463例非PCR患者。其中,385例患者来自上海交通大学附属瑞金医院,并以3:1的比例随机分为训练队列和内部验证队列,用于模型开发和初步性能评估。此外,从嘉兴市妇幼保健院招募的78例患者被分配到外部验证队列,以评估模型的泛化能力。单因素和多因素Cox回归分析表明,年龄、残余肿瘤大小、Ki67变化、分子亚型和腋窝淋巴结转移是影响DFS的独立因素。在四个ML模型中,随机生存森林(RSF)模型表现最佳,训练队列中的一致性指数为0.820,内部验证队列中为0.642,外部验证队列中为0.689。进一步分析显示,RSF模型具有出色的判别能力,曲线下面积值较高,而其较低的Brier评分表明校准良好。决策曲线分析表明,RSF模型在各个时间点都提供了更高的临床净效益,并有效地进行了风险分层,成功识别出高危患者。SHAP分析强调残余肿瘤大小是最具影响力的预测特征。RSF模型可以有效预测NAC后非PCR的BC患者的DFS和风险,为制定个体化治疗策略提供重要参考。

相似文献

3
Are Current Survival Prediction Tools Useful When Treating Subsequent Skeletal-related Events From Bone Metastases?
Clin Orthop Relat Res. 2024 Sep 1;482(9):1710-1721. doi: 10.1097/CORR.0000000000003030. Epub 2024 Mar 22.
8
Cost-effectiveness of using prognostic information to select women with breast cancer for adjuvant systemic therapy.
Health Technol Assess. 2006 Sep;10(34):iii-iv, ix-xi, 1-204. doi: 10.3310/hta10340.

本文引用的文献

1
Global cancer statistics 2022: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries.
CA Cancer J Clin. 2024 May-Jun;74(3):229-263. doi: 10.3322/caac.21834. Epub 2024 Apr 4.
2
Machine learning‑based prediction of survival prognosis in esophageal squamous cell carcinoma.
Sci Rep. 2023 Aug 19;13(1):13532. doi: 10.1038/s41598-023-40780-8.
3
Novel models by machine learning to predict prognosis of breast cancer brain metastases.
J Transl Med. 2023 Jun 21;21(1):404. doi: 10.1186/s12967-023-04277-2.
4
From patterns to patients: Advances in clinical machine learning for cancer diagnosis, prognosis, and treatment.
Cell. 2023 Apr 13;186(8):1772-1791. doi: 10.1016/j.cell.2023.01.035. Epub 2023 Mar 10.
7
Does Residual Cancer Burden Predict Local Recurrence After Neoadjuvant Chemotherapy?
Ann Surg Oncol. 2022 Nov;29(12):7716-7724. doi: 10.1245/s10434-022-12038-6. Epub 2022 Jul 9.
9
Analysis of the tumor characteristics in young age breast cancer patients using collaborative stage data of the Korea Central Cancer Registry.
Breast Cancer Res Treat. 2021 Jun;187(3):785-792. doi: 10.1007/s10549-021-06107-9. Epub 2021 Feb 18.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验