• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用深度学习、网格搜索和贝叶斯网络预测乳腺癌远处复发

Leveraging Deep Learning, Grid Search, and Bayesian Networks to Predict Distant Recurrence of Breast Cancer.

作者信息

Jiang Xia, Zhou Yijun, Wells Alan, Brufsky Adam

机构信息

Department of Biomedical Informatics, University of Pittsburgh, Pittsburgh, PA 15206, USA.

Department of Pathology, University of Pittsburgh and Pittsburgh VA Health System, Pittsburgh, PA 15261, USA.

出版信息

Cancers (Basel). 2025 Jul 30;17(15):2515. doi: 10.3390/cancers17152515.

DOI:10.3390/cancers17152515
PMID:40805211
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12346417/
Abstract

: Unlike most cancers, breast cancer poses a persistent risk of distant recurrence-often years after initial treatment-making long-term risk stratification uniquely challenging. Current tools fall short in predicting late metastatic events, particularly for early-stage patients. : We present an interpretable machine learning (ML) pipeline to predict distant recurrence-free survival at 5, 10, and 15 years, integrating Bayesian network-based causal feature selection, deep feed-forward neural network models (DNMs), and SHAP-based interpretation. Using electronic health record (EHR)-based clinical data from over 6000 patients, we first applied the Markov blanket and interactive risk factor learner (MBIL) to identify minimally sufficient predictor subsets. These were then used to train optimized DNM classifiers, with hyperparameters tuned via grid search and benchmarked against models from 10 traditional ML methods and models trained using all predictors. : Our best models achieved area under the curve (AUC) scores of 0.79, 0.83, and 0.89 for 5-, 10-, and 15-year predictions, respectively-substantially outperforming baselines. MBIL reduced input dimensionality by over 80% without sacrificing accuracy. Importantly, MBIL-selected features (e.g., nodal status, hormone receptor expression, tumor size) overlapped strongly with top SHAP contributors, reinforcing interpretability. Calibration plots further demonstrated close agreement between predicted probabilities and observed recurrence rates. The percentage performance improvement due to grid search ranged from 25.3% to 60%. : This study demonstrates that combining causal selection, deep learning, and grid search improves prediction accuracy, transparency, and calibration for long-horizon breast cancer recurrence risk. The proposed framework is well-positioned for clinical use, especially to guide long-term follow-up and therapy decisions in early-stage patients.

摘要

与大多数癌症不同,乳腺癌存在远处复发的持续风险——通常在初始治疗数年之后——这使得长期风险分层极具挑战性。目前的工具在预测晚期转移事件方面存在不足,尤其是对于早期患者。我们提出了一种可解释的机器学习(ML)流程,用于预测5年、10年和15年的无远处复发生存率,该流程整合了基于贝叶斯网络的因果特征选择、深度前馈神经网络模型(DNM)和基于SHAP的解释。利用来自6000多名患者的基于电子健康记录(EHR)的临床数据,我们首先应用马尔可夫毯和交互式风险因素学习器(MBIL)来识别最小充分预测子集。然后将这些子集用于训练优化的DNM分类器,通过网格搜索调整超参数,并与10种传统ML方法的模型以及使用所有预测因子训练的模型进行基准测试。我们的最佳模型在5年、10年和15年预测中的曲线下面积(AUC)得分分别为0.79、0.83和0.89——显著优于基线。MBIL在不牺牲准确性的情况下将输入维度降低了80%以上。重要的是,MBIL选择的特征(如淋巴结状态、激素受体表达、肿瘤大小)与SHAP贡献最大的因素高度重叠,增强了可解释性。校准图进一步证明了预测概率与观察到的复发率之间的密切一致性。由于网格搜索导致的性能提升百分比在25.3%至60%之间。这项研究表明,结合因果选择、深度学习和网格搜索可以提高对长期乳腺癌复发风险的预测准确性、透明度和校准。所提出的框架非常适合临床应用,特别是用于指导早期患者的长期随访和治疗决策。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf80/12346417/3f8089c16f52/cancers-17-02515-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf80/12346417/5e76526c6c32/cancers-17-02515-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf80/12346417/eef60ae41945/cancers-17-02515-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf80/12346417/128fd571b5c3/cancers-17-02515-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf80/12346417/06067a52073d/cancers-17-02515-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf80/12346417/45c129778bd6/cancers-17-02515-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf80/12346417/c3d3b40cb020/cancers-17-02515-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf80/12346417/e74306d181f5/cancers-17-02515-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf80/12346417/3f8089c16f52/cancers-17-02515-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf80/12346417/5e76526c6c32/cancers-17-02515-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf80/12346417/eef60ae41945/cancers-17-02515-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf80/12346417/128fd571b5c3/cancers-17-02515-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf80/12346417/06067a52073d/cancers-17-02515-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf80/12346417/45c129778bd6/cancers-17-02515-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf80/12346417/c3d3b40cb020/cancers-17-02515-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf80/12346417/e74306d181f5/cancers-17-02515-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf80/12346417/3f8089c16f52/cancers-17-02515-g008.jpg

相似文献

1
Leveraging Deep Learning, Grid Search, and Bayesian Networks to Predict Distant Recurrence of Breast Cancer.利用深度学习、网格搜索和贝叶斯网络预测乳腺癌远处复发
Cancers (Basel). 2025 Jul 30;17(15):2515. doi: 10.3390/cancers17152515.
2
Deep Learning: A Heuristic Three-Stage Mechanism for Grid Searches to Optimize the Future Risk Prediction of Breast Cancer Metastasis Using EHR-Based Clinical Data.深度学习:一种用于网格搜索的启发式三阶段机制,利用基于电子健康记录的临床数据优化乳腺癌转移的未来风险预测。
Cancers (Basel). 2025 Mar 25;17(7):1092. doi: 10.3390/cancers17071092.
3
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.
4
Are Current Survival Prediction Tools Useful When Treating Subsequent Skeletal-related Events From Bone Metastases?当前的生存预测工具在治疗骨转移后的骨骼相关事件时有用吗?
Clin Orthop Relat Res. 2024 Sep 1;482(9):1710-1721. doi: 10.1097/CORR.0000000000003030. Epub 2024 Mar 22.
5
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
6
Does the Presence of Missing Data Affect the Performance of the SORG Machine-learning Algorithm for Patients With Spinal Metastasis? Development of an Internet Application Algorithm.缺失数据的存在是否会影响 SORG 机器学习算法在脊柱转移瘤患者中的性能?开发一种互联网应用算法。
Clin Orthop Relat Res. 2024 Jan 1;482(1):143-157. doi: 10.1097/CORR.0000000000002706. Epub 2023 Jun 12.
7
A Responsible Framework for Assessing, Selecting, and Explaining Machine Learning Models in Cardiovascular Disease Outcomes Among People With Type 2 Diabetes: Methodology and Validation Study.用于评估、选择和解释2型糖尿病患者心血管疾病结局机器学习模型的责任框架:方法与验证研究
JMIR Med Inform. 2025 Jun 27;13:e66200. doi: 10.2196/66200.
8
Development and Validation of a Convolutional Neural Network Model to Predict a Pathologic Fracture in the Proximal Femur Using Abdomen and Pelvis CT Images of Patients With Advanced Cancer.利用晚期癌症患者腹部和骨盆 CT 图像建立卷积神经网络模型预测股骨近端病理性骨折的研究
Clin Orthop Relat Res. 2023 Nov 1;481(11):2247-2256. doi: 10.1097/CORR.0000000000002771. Epub 2023 Aug 23.
9
Leveraging a foundation model zoo for cell similarity search in oncological microscopy across devices.利用基础模型库进行跨设备肿瘤显微镜检查中的细胞相似性搜索。
Front Oncol. 2025 Jun 18;15:1480384. doi: 10.3389/fonc.2025.1480384. eCollection 2025.
10
Cost-effectiveness of using prognostic information to select women with breast cancer for adjuvant systemic therapy.利用预后信息为乳腺癌患者选择辅助性全身治疗的成本效益
Health Technol Assess. 2006 Sep;10(34):iii-iv, ix-xi, 1-204. doi: 10.3310/hta10340.

本文引用的文献

1
Empirical Study of Overfitting in Deep Learning for Predicting Breast Cancer Metastasis.深度学习预测乳腺癌转移中过拟合的实证研究
Cancers (Basel). 2023 Mar 25;15(7):1969. doi: 10.3390/cancers15071969.
2
Deep Learning and Machine Learning with Grid Search to Predict Later Occurrence of Breast Cancer Metastasis Using Clinical Data.利用临床数据,通过深度学习和带网格搜索的机器学习预测乳腺癌转移的后期发生情况。
J Clin Med. 2022 Sep 29;11(19):5772. doi: 10.3390/jcm11195772.
3
Text Data Augmentation for Deep Learning.用于深度学习的文本数据增强
J Big Data. 2021;8(1):101. doi: 10.1186/s40537-021-00492-0. Epub 2021 Jul 19.
4
DeepOmix: A scalable and interpretable multi-omics deep learning framework and application in cancer survival analysis.深度混合模型(DeepOmix):一种可扩展且可解释的多组学深度学习框架及其在癌症生存分析中的应用。
Comput Struct Biotechnol J. 2021 May 1;19:2719-2725. doi: 10.1016/j.csbj.2021.04.067. eCollection 2021.
5
Automatic detect lung node with deep learning in segmentation and imbalance data labeling.深度学习在分割和不平衡数据标注中自动检测肺结节。
Sci Rep. 2021 May 27;11(1):11174. doi: 10.1038/s41598-021-90599-4.
6
Deep learning-enabled breast cancer hormonal receptor status determination from base-level H&E stains.基于基础水平的 H&E 染色的深度学习辅助乳腺癌激素受体状态判定。
Nat Commun. 2020 Nov 16;11(1):5727. doi: 10.1038/s41467-020-19334-3.
7
Leveraging Bayesian networks and information theory to learn risk factors for breast cancer metastasis.利用贝叶斯网络和信息论学习乳腺癌转移的风险因素。
BMC Bioinformatics. 2020 Jul 10;21(1):298. doi: 10.1186/s12859-020-03638-8.
8
Lymph Node Metastasis Prediction from Primary Breast Cancer US Images Using Deep Learning.基于深度学习的原发性乳腺癌 US 图像淋巴结转移预测。
Radiology. 2020 Jan;294(1):19-28. doi: 10.1148/radiol.2019190372. Epub 2019 Nov 19.
9
A clinical decision support system learned from data to personalize treatment recommendations towards preventing breast cancer metastasis.一个从数据中学习的临床决策支持系统,旨在针对预防乳腺癌转移的个性化治疗建议。
PLoS One. 2019 Mar 8;14(3):e0213292. doi: 10.1371/journal.pone.0213292. eCollection 2019.
10
An overview of deep learning in medical imaging focusing on MRI.深度学习在医学影像中的概述,重点是 MRI。
Z Med Phys. 2019 May;29(2):102-127. doi: 10.1016/j.zemedi.2018.11.002. Epub 2018 Dec 13.