• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

应用多因素的不同机器学习方法进行乳腺癌预测。

Breast cancer prediction using different machine learning methods applying multi factors.

机构信息

Faculty of Medicine, Department of Medical Informatics, Mashhad University of Medical Sciences, Mashhad, Iran.

Metabolic Syndrome Research Center, Mashhad University of Medical Sciences, Mashhad, Iran.

出版信息

J Cancer Res Clin Oncol. 2023 Dec;149(19):17133-17146. doi: 10.1007/s00432-023-05388-5. Epub 2023 Sep 29.

DOI:10.1007/s00432-023-05388-5
PMID:37773467
Abstract

OBJECTIVE

Breast cancer (BC) is a multifactorial disease and is one of the most common cancers globally. This study aimed to compare different machine learning (ML) techniques to develop a comprehensive breast cancer risk prediction model based on features of various factors.

METHODS

The population sample contained 810 records (115 cancer patients and 695 healthy individuals). 45 attributes out of 85 were selected based on the opinion of experts. These selected attributes are in genetic, biochemical, biomarker, gender, demographic and pathological factors. 13 Machine learning models were trained with proposed attributes and coefficient of attributes and internal relationships were calculated.

RESULT

Compared to other methods random forest (RF) has higher performance (accuracy 99.26%, precision 99%, and area under the curve (AUC) 99%). The results of assessing the impact and correlation of variables using the RF method based on PCA indicated that pathology, biomarker, biochemistry, gene, and demographic factors with a coefficient of 0.35, 0.23, 0.15, 0.14, and 0.13 respectively, affected the risk of BC (r = 0.54).

CONCLUSION

Breast cancer has several risk factors. Medical experts use these risk factors for early diagnosis. Therefore, identifying related risk factors and their effect can increase the accuracy of diagnosis. Considering the broad features for predicting breast cancer leads to the development of a comprehensive prediction model. In this study, using RF technique a breast cancer prediction model with 99.3% accuracy was developed based on multifactorial features.

摘要

目的

乳腺癌(BC)是一种多因素疾病,也是全球最常见的癌症之一。本研究旨在比较不同的机器学习(ML)技术,以基于各种因素的特征开发综合的乳腺癌风险预测模型。

方法

人群样本包含 810 条记录(115 名癌症患者和 695 名健康个体)。根据专家意见,从 85 个特征中选择了 45 个特征。这些选定的特征涉及遗传、生化、生物标志物、性别、人口统计学和病理学因素。使用提出的属性和属性系数训练了 13 种机器学习模型,并计算了属性系数和内部关系。

结果

与其他方法相比,随机森林(RF)具有更高的性能(准确率为 99.26%,精度为 99%,曲线下面积(AUC)为 99%)。使用基于 PCA 的 RF 方法评估变量的影响和相关性的结果表明,病理学、生物标志物、生化、基因和人口统计学因素的系数分别为 0.35、0.23、0.15、0.14 和 0.13,这些因素对乳腺癌的风险有影响(r=0.54)。

结论

乳腺癌有多种危险因素。医学专家使用这些危险因素进行早期诊断。因此,确定相关的危险因素及其影响可以提高诊断的准确性。考虑到预测乳腺癌的广泛特征,可以开发综合的预测模型。在本研究中,使用 RF 技术,基于多因素特征开发了一种准确率为 99.3%的乳腺癌预测模型。

相似文献

1
Breast cancer prediction using different machine learning methods applying multi factors.应用多因素的不同机器学习方法进行乳腺癌预测。
J Cancer Res Clin Oncol. 2023 Dec;149(19):17133-17146. doi: 10.1007/s00432-023-05388-5. Epub 2023 Sep 29.
2
Predicting Pathological Complete Response Following Neoadjuvant Therapy in Patients With Breast Cancer: Development of Machine Learning-Based Prediction Models in a Retrospective Study.预测乳腺癌患者新辅助治疗后的病理完全缓解:一项回顾性研究中基于机器学习的预测模型的开发
JMIR Cancer. 2025 Jul 18;11:e64685. doi: 10.2196/64685.
3
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.
4
Supervised Machine Learning Models for Predicting Sepsis-Associated Liver Injury in Patients With Sepsis: Development and Validation Study Based on a Multicenter Cohort Study.用于预测脓毒症患者脓毒症相关肝损伤的监督式机器学习模型:基于多中心队列研究的开发与验证研究
J Med Internet Res. 2025 May 26;27:e66733. doi: 10.2196/66733.
5
Stabilizing machine learning for reproducible and explainable results: A novel validation approach to subject-specific insights.稳定机器学习以获得可重复和可解释的结果:一种针对特定个体见解的新型验证方法。
Comput Methods Programs Biomed. 2025 Jun 21;269:108899. doi: 10.1016/j.cmpb.2025.108899.
6
Are Current Survival Prediction Tools Useful When Treating Subsequent Skeletal-related Events From Bone Metastases?当前的生存预测工具在治疗骨转移后的骨骼相关事件时有用吗?
Clin Orthop Relat Res. 2024 Sep 1;482(9):1710-1721. doi: 10.1097/CORR.0000000000003030. Epub 2024 Mar 22.
7
An Assessment of the Predictive Performance of Current Machine Learning-Based Breast Cancer Risk Prediction Models: Systematic Review.基于当前机器学习的乳腺癌风险预测模型的预测性能评估:系统评价。
JMIR Public Health Surveill. 2022 Dec 29;8(12):e35750. doi: 10.2196/35750.
8
Approaches for predicting dairy cattle methane emissions: from traditional methods to machine learning.预测奶牛甲烷排放的方法:从传统方法到机器学习。
J Anim Sci. 2024 Jan 3;102. doi: 10.1093/jas/skae219.
9
Does the Presence of Missing Data Affect the Performance of the SORG Machine-learning Algorithm for Patients With Spinal Metastasis? Development of an Internet Application Algorithm.缺失数据的存在是否会影响 SORG 机器学习算法在脊柱转移瘤患者中的性能?开发一种互联网应用算法。
Clin Orthop Relat Res. 2024 Jan 1;482(1):143-157. doi: 10.1097/CORR.0000000000002706. Epub 2023 Jun 12.
10
Cost-effectiveness of using prognostic information to select women with breast cancer for adjuvant systemic therapy.利用预后信息为乳腺癌患者选择辅助性全身治疗的成本效益
Health Technol Assess. 2006 Sep;10(34):iii-iv, ix-xi, 1-204. doi: 10.3310/hta10340.

引用本文的文献

1
Identification of Prognostic and Diagnostic Biomarkers for Glioma Utilizing Immune System Gene Profiling.利用免疫系统基因谱鉴定胶质瘤的预后和诊断生物标志物
Med J Islam Repub Iran. 2025 Apr 1;39:49. doi: 10.47176/mjiri.39.49. eCollection 2025.
2
Identification of M1 macrophage infiltration-related genes for immunotherapy in Her2-positive breast cancer based on bioinformatics analysis and machine learning.基于生物信息学分析和机器学习鉴定Her2阳性乳腺癌免疫治疗中M1巨噬细胞浸润相关基因
Sci Rep. 2025 Apr 11;15(1):12525. doi: 10.1038/s41598-025-96917-4.
3
Identification of BCL11A, NTN5, and OGN as Diagnosis Biomarker of Papillary Renal Cell Carcinomas by Bioinformatic Analysis.

本文引用的文献

1
Deep Learning for Acute Myeloid Leukemia Diagnosis.用于急性髓系白血病诊断的深度学习
J Med Life. 2020 Jul-Sep;13(3):382-387. doi: 10.25122/jml-2019-0090.
2
Local extraction and detection of early stage breast cancers through a microneedle and nano-Ag/MBL film based painless and blood-free strategy.通过基于微针和纳米 Ag/MBL 膜的无痛、无血策略进行早期乳腺癌的局部提取和检测。
Mater Sci Eng C Mater Biol Appl. 2020 Apr;109:110402. doi: 10.1016/j.msec.2019.110402. Epub 2019 Nov 18.
3
Machine Learning techniques in breast cancer prognosis prediction: A primary evaluation.
通过生物信息学分析鉴定BCL11A、NTN5和OGN作为乳头状肾细胞癌的诊断生物标志物
J Kidney Cancer VHL. 2025 Feb 28;12(1):12-22. doi: 10.15586/jkc.v12i1.366. eCollection 2025.
4
CASCADE: Context-Aware Data-Driven AI for Streamlined Multidisciplinary Tumor Board Recommendations in Oncology.CASCADE:用于肿瘤学中简化多学科肿瘤委员会建议的上下文感知数据驱动人工智能
Cancers (Basel). 2024 May 23;16(11):1975. doi: 10.3390/cancers16111975.
5
PFAS and their association with the increased risk of cardiovascular disease in postmenopausal women.全氟和多氟烷基物质及其与绝经后妇女心血管疾病风险增加的关联。
Toxicol Sci. 2024 Aug 1;200(2):312-323. doi: 10.1093/toxsci/kfae065.
乳腺癌预后预测中的机器学习技术:初步评估。
Cancer Med. 2020 May;9(9):3234-3243. doi: 10.1002/cam4.2811. Epub 2020 Mar 10.
4
Prediagnostic Circulating Levels of Sex Steroid Hormones and SHBG in Relation to Risk of Ductal Carcinoma of the Breast among UK Women.英国女性中,与乳腺癌(导管癌)风险相关的预诊断循环性激素和 SHBG 水平。
Cancer Epidemiol Biomarkers Prev. 2020 May;29(5):1058-1066. doi: 10.1158/1055-9965.EPI-19-1302. Epub 2020 Mar 3.
5
Perceived barriers to early diagnosis of breast Cancer in south and southwestern Ethiopia: a qualitative study.埃塞俄比亚南部和西南部乳腺癌早期诊断的认知障碍:一项定性研究。
BMC Womens Health. 2020 Feb 27;20(1):38. doi: 10.1186/s12905-020-00909-7.
6
A Hierarchical Age-Period-Cohort Analysis of Breast Cancer Mortality and Disability Adjusted Life Years (1990-2015) Attributable to Modified Risk Factors among Chinese Women.基于调整后的风险因素,对中国女性乳腺癌死亡率和伤残调整生命年(1990-2015 年)进行的分层年龄-时期-队列分析。
Int J Environ Res Public Health. 2020 Feb 20;17(4):1367. doi: 10.3390/ijerph17041367.
7
Association between ABCB1, ABCG2 carrier protein and COX-2 enzyme gene polymorphisms and breast cancer risk in a Turkish population.ABCB1、ABCG2载体蛋白和COX-2酶基因多态性与土耳其人群乳腺癌风险之间的关联。
Saudi Pharm J. 2020 Feb;28(2):215-219. doi: 10.1016/j.jsps.2019.11.024. Epub 2019 Dec 7.
8
The association of single nucleotide polymorphisms (SNPs) with breast density and breast cancer survival: the Malmö Diet and Cancer Study.单核苷酸多态性(SNP)与乳腺密度及乳腺癌生存率的关联:马尔默饮食与癌症研究
Acta Radiol. 2020 Oct;61(10):1326-1334. doi: 10.1177/0284185119900436. Epub 2020 Feb 9.
9
Incidence of male breast cancer in Scotland over a twenty-five-year period (1992-2017).苏格兰二十五年间(1992-2017 年)男性乳腺癌发病率。
Eur J Surg Oncol. 2020 Aug;46(8):1546-1550. doi: 10.1016/j.ejso.2020.01.009. Epub 2020 Jan 8.
10
Incidence of breast cancer attributable to breast density, modifiable and non-modifiable breast cancer risk factors in Singapore.新加坡归因于乳腺密度的乳腺癌发病率、可改变和不可改变的乳腺癌危险因素。
Sci Rep. 2020 Jan 16;10(1):503. doi: 10.1038/s41598-019-57341-7.