• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用机器学习预测乳腺癌 5 年生存率:系统评价。

Predicting breast cancer 5-year survival using machine learning: A systematic review.

机构信息

School of Nursing, Jilin University, Jilin, China.

Breast Surgery, Jilin Province Tumor Hospital, Jilin, China.

出版信息

PLoS One. 2021 Apr 16;16(4):e0250370. doi: 10.1371/journal.pone.0250370. eCollection 2021.

DOI:10.1371/journal.pone.0250370
PMID:33861809
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8051758/
Abstract

BACKGROUND

Accurately predicting the survival rate of breast cancer patients is a major issue for cancer researchers. Machine learning (ML) has attracted much attention with the hope that it could provide accurate results, but its modeling methods and prediction performance remain controversial. The aim of this systematic review is to identify and critically appraise current studies regarding the application of ML in predicting the 5-year survival rate of breast cancer.

METHODS

In accordance with the PRISMA guidelines, two researchers independently searched the PubMed (including MEDLINE), Embase, and Web of Science Core databases from inception to November 30, 2020. The search terms included breast neoplasms, survival, machine learning, and specific algorithm names. The included studies related to the use of ML to build a breast cancer survival prediction model and model performance that can be measured with the value of said verification results. The excluded studies in which the modeling process were not explained clearly and had incomplete information. The extracted information included literature information, database information, data preparation and modeling process information, model construction and performance evaluation information, and candidate predictor information.

RESULTS

Thirty-one studies that met the inclusion criteria were included, most of which were published after 2013. The most frequently used ML methods were decision trees (19 studies, 61.3%), artificial neural networks (18 studies, 58.1%), support vector machines (16 studies, 51.6%), and ensemble learning (10 studies, 32.3%). The median sample size was 37256 (range 200 to 659820) patients, and the median predictor was 16 (range 3 to 625). The accuracy of 29 studies ranged from 0.510 to 0.971. The sensitivity of 25 studies ranged from 0.037 to 1. The specificity of 24 studies ranged from 0.008 to 0.993. The AUC of 20 studies ranged from 0.500 to 0.972. The precision of 6 studies ranged from 0.549 to 1. All of the models were internally validated, and only one was externally validated.

CONCLUSIONS

Overall, compared with traditional statistical methods, the performance of ML models does not necessarily show any improvement, and this area of research still faces limitations related to a lack of data preprocessing steps, the excessive differences of sample feature selection, and issues related to validation. Further optimization of the performance of the proposed model is also needed in the future, which requires more standardization and subsequent validation.

摘要

背景

准确预测乳腺癌患者的生存率是癌症研究人员面临的主要问题。机器学习(ML)受到了广泛关注,希望它能够提供准确的结果,但它的建模方法和预测性能仍存在争议。本系统评价的目的是确定并批判性地评价当前关于 ML 在预测乳腺癌 5 年生存率方面应用的研究。

方法

根据 PRISMA 指南,两名研究人员独立检索了从成立到 2020 年 11 月 30 日的 PubMed(包括 MEDLINE)、Embase 和 Web of Science Core 数据库。检索词包括乳腺肿瘤、生存、机器学习和特定算法名称。纳入的研究涉及使用 ML 构建乳腺癌生存预测模型以及可以通过验证结果值来衡量的模型性能。排除建模过程未被清楚解释且信息不完整的研究。提取的信息包括文献信息、数据库信息、数据准备和建模过程信息、模型构建和性能评估信息以及候选预测因子信息。

结果

符合纳入标准的 31 项研究被纳入,其中大多数发表于 2013 年以后。最常使用的 ML 方法是决策树(19 项研究,61.3%)、人工神经网络(18 项研究,58.1%)、支持向量机(16 项研究,51.6%)和集成学习(10 项研究,32.3%)。中位样本量为 37256(范围 200-659820)例患者,中位预测因子为 16(范围 3-625)。29 项研究的准确性范围为 0.510-0.971。25 项研究的敏感性范围为 0.037-1。24 项研究的特异性范围为 0.008-0.993。20 项研究的 AUC 范围为 0.500-0.972。6 项研究的精确度范围为 0.549-1。所有模型均进行了内部验证,仅有一项进行了外部验证。

结论

总体而言,与传统统计方法相比,ML 模型的性能不一定有所提高,该领域的研究仍然存在与数据预处理步骤缺乏、样本特征选择差异过大以及验证相关问题等局限性。未来还需要进一步优化所提出模型的性能,这需要更加标准化和后续验证。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/db96/8051758/1050f7d6160c/pone.0250370.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/db96/8051758/84ad6077a93b/pone.0250370.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/db96/8051758/1050f7d6160c/pone.0250370.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/db96/8051758/84ad6077a93b/pone.0250370.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/db96/8051758/1050f7d6160c/pone.0250370.g002.jpg

相似文献

1
Predicting breast cancer 5-year survival using machine learning: A systematic review.使用机器学习预测乳腺癌 5 年生存率:系统评价。
PLoS One. 2021 Apr 16;16(4):e0250370. doi: 10.1371/journal.pone.0250370. eCollection 2021.
2
Data-driven modeling and prediction of blood glucose dynamics: Machine learning applications in type 1 diabetes.基于数据驱动的血糖动力学建模与预测:机器学习在 1 型糖尿病中的应用。
Artif Intell Med. 2019 Jul;98:109-134. doi: 10.1016/j.artmed.2019.07.007. Epub 2019 Jul 26.
3
Machine learning in predicting graft failure following kidney transplantation: A systematic review of published predictive models.机器学习在预测肾移植后移植物失败中的应用:已发表预测模型的系统评价。
Int J Med Inform. 2019 Oct;130:103957. doi: 10.1016/j.ijmedinf.2019.103957. Epub 2019 Aug 24.
4
Predictive value of machine learning for breast cancer recurrence: a systematic review and meta-analysis.机器学习对乳腺癌复发的预测价值:系统评价和荟萃分析。
J Cancer Res Clin Oncol. 2023 Sep;149(12):10659-10674. doi: 10.1007/s00432-023-04967-w. Epub 2023 Jun 11.
5
A systematic review of machine learning models for predicting outcomes of stroke with structured data.基于结构化数据的机器学习模型预测脑卒中结局的系统评价。
PLoS One. 2020 Jun 12;15(6):e0234722. doi: 10.1371/journal.pone.0234722. eCollection 2020.
6
Use of machine learning to analyse routinely collected intensive care unit data: a systematic review.运用机器学习分析常规收集的重症监护病房数据:系统评价。
Crit Care. 2019 Aug 22;23(1):284. doi: 10.1186/s13054-019-2564-9.
7
Predicting factors for survival of breast cancer patients using machine learning techniques.运用机器学习技术预测乳腺癌患者的生存因素。
BMC Med Inform Decis Mak. 2019 Mar 22;19(1):48. doi: 10.1186/s12911-019-0801-4.
8
Reviewing ensemble classification methods in breast cancer.综述乳腺癌中的集成分类方法。
Comput Methods Programs Biomed. 2019 Aug;177:89-112. doi: 10.1016/j.cmpb.2019.05.019. Epub 2019 May 20.
9
Machine learning in oral squamous cell carcinoma: Current status, clinical concerns and prospects for future-A systematic review.机器学习在口腔鳞状细胞癌中的应用:现状、临床关注点及未来展望——系统综述。
Artif Intell Med. 2021 May;115:102060. doi: 10.1016/j.artmed.2021.102060. Epub 2021 Mar 26.
10
Cervical cancer survival prediction by machine learning algorithms: a systematic review.基于机器学习算法的宫颈癌生存预测:系统综述。
BMC Cancer. 2023 Apr 13;23(1):341. doi: 10.1186/s12885-023-10808-3.

引用本文的文献

1
SEER-based machine learning prediction of bone metastasis in breast cancer: model development and validation.基于监测、流行病学和最终结果(SEER)数据库的乳腺癌骨转移机器学习预测:模型开发与验证
Gland Surg. 2025 Jul 31;14(7):1366-1378. doi: 10.21037/gs-2025-168. Epub 2025 Jul 28.
2
Machine Learning Framework for Ovarian Cancer Diagnostics Using Plasma Lipidomics and Metabolomics.基于血浆脂质组学和代谢组学的卵巢癌诊断机器学习框架
Int J Mol Sci. 2025 Jul 10;26(14):6630. doi: 10.3390/ijms26146630.
3
Machine learning to evaluate the effects of non-clinical social determinant features in predicting colorectal Cancer mortality in a medically underserved Appalachian population.

本文引用的文献

1
Prediction of breast cancer distant recurrence using natural language processing and knowledge-guided convolutional neural network.使用自然语言处理和知识引导卷积神经网络预测乳腺癌远处复发。
Artif Intell Med. 2020 Nov;110:101977. doi: 10.1016/j.artmed.2020.101977. Epub 2020 Nov 1.
2
Breast cancer risk factors and their effects on survival: a Mendelian randomisation study.乳腺癌风险因素及其对生存的影响:一项孟德尔随机化研究。
BMC Med. 2020 Nov 17;18(1):327. doi: 10.1186/s12916-020-01797-2.
3
Use and performance of machine learning models for type 2 diabetes prediction in community settings: A systematic review and meta-analysis.
机器学习用于评估非临床社会决定因素特征在预测医疗服务不足的阿巴拉契亚人群结直肠癌死亡率中的作用。
Sci Rep. 2025 Jul 16;15(1):25781. doi: 10.1038/s41598-025-11074-y.
4
Development and validation of novel machine learning-based prognostic models and propensity score matching for comparison of surgical approaches in mucinous breast cancer.基于新型机器学习的预后模型的开发与验证以及倾向评分匹配用于黏液性乳腺癌手术方法比较
Front Endocrinol (Lausanne). 2025 Jun 3;16:1557858. doi: 10.3389/fendo.2025.1557858. eCollection 2025.
5
Cancer incidence data at the ZIP Code Tabulation Area level in the United States interpolated by Monte Carlo simulation with multiple constraints.美国邮政编码分区层面的癌症发病率数据,通过具有多重约束的蒙特卡洛模拟进行插值。
Sci Data. 2025 May 30;12(1):909. doi: 10.1038/s41597-025-05254-8.
6
Bayesian Model Prediction for Breast Cancer Survival: A Retrospective Analysis.乳腺癌生存的贝叶斯模型预测:一项回顾性分析。
Eur J Breast Health. 2025 Jun 20;21(3):255-264. doi: 10.4274/ejbh.galenos.2025.2025-2-14. Epub 2025 May 27.
7
Breast cancer survival in Ethiopia: a systematic review and meta-analysis of rates and predictors.埃塞俄比亚的乳腺癌生存率:一项关于生存率及预测因素的系统评价与荟萃分析
Cancer Causes Control. 2025 Jul;36(7):653-662. doi: 10.1007/s10552-025-01982-9. Epub 2025 Mar 2.
8
Machine learning-based prediction of distant metastasis risk in invasive ductal carcinoma of the breast.基于机器学习的乳腺浸润性导管癌远处转移风险预测
PLoS One. 2025 Feb 26;20(2):e0310410. doi: 10.1371/journal.pone.0310410. eCollection 2025.
9
Comparison of Recurrence Risk Prediction by the CanAssist Breast Test and the PREDICT Online Tool in Early Breast Cancer Patients.CanAssist乳腺检测与PREDICT在线工具对早期乳腺癌患者复发风险预测的比较
Cureus. 2025 Jan 12;17(1):e77356. doi: 10.7759/cureus.77356. eCollection 2025 Jan.
10
Predicting Factors Affecting Survival Rate in Patients Undergoing On-Pump Coronary Artery Bypass Graft Surgery Using Machine Learning Methods: A Systematic Review.使用机器学习方法预测接受体外循环冠状动脉旁路移植术患者生存率的影响因素:一项系统评价
Health Sci Rep. 2025 Jan 22;8(1):e70336. doi: 10.1002/hsr2.70336. eCollection 2025 Jan.
机器学习模型在社区环境中用于 2 型糖尿病预测的使用和性能:系统评价和荟萃分析。
Int J Med Inform. 2020 Nov;143:104268. doi: 10.1016/j.ijmedinf.2020.104268. Epub 2020 Sep 7.
4
Prognostic value of depression and anxiety on breast cancer recurrence and mortality: a systematic review and meta-analysis of 282,203 patients.抑郁和焦虑对乳腺癌复发及死亡率的预后价值:对282,203例患者的系统评价和荟萃分析
Mol Psychiatry. 2020 Dec;25(12):3186-3197. doi: 10.1038/s41380-020-00865-6. Epub 2020 Aug 20.
5
Empirical assessment of bias in machine learning diagnostic test accuracy studies.机器学习诊断试验准确性研究中偏倚的实证评估。
J Am Med Inform Assoc. 2020 Jul 1;27(7):1092-1101. doi: 10.1093/jamia/ocaa075.
6
Identifying predictive factors for neuropathic pain after breast cancer surgery using machine learning.利用机器学习识别乳腺癌手术后发生神经性疼痛的预测因素。
Int J Med Inform. 2020 Sep;141:104170. doi: 10.1016/j.ijmedinf.2020.104170. Epub 2020 Jun 13.
7
A machine learning model that classifies breast cancer pathologic complete response on MRI post-neoadjuvant chemotherapy.一种基于机器学习的模型,用于对新辅助化疗后 MRI 上的乳腺癌病理完全缓解进行分类。
Breast Cancer Res. 2020 May 28;22(1):57. doi: 10.1186/s13058-020-01291-w.
8
Machine Learning and Deep Learning Approaches in Breast Cancer Survival Prediction Using Clinical Data.使用临床数据进行乳腺癌生存预测的机器学习和深度学习方法
Folia Biol (Praha). 2019;65(5-6):212-220. doi: 10.14712/fb2019065050212.
9
Prediction model of the response to neoadjuvant chemotherapy in breast cancers by a Naive Bayes algorithm.基于朴素贝叶斯算法的乳腺癌新辅助化疗反应预测模型。
Comput Methods Programs Biomed. 2020 Aug;192:105458. doi: 10.1016/j.cmpb.2020.105458. Epub 2020 Mar 19.
10
A Systematic Review of Estimating Breast Cancer Recurrence at the Population Level With Administrative Data.基于行政数据评估人群乳腺癌复发的系统评价。
J Natl Cancer Inst. 2020 Oct 1;112(10):979-988. doi: 10.1093/jnci/djaa050.