• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于机器学习的乳腺癌诊断预测模型的改进。

Improved Machine Learning-Based Predictive Models for Breast Cancer Diagnosis.

机构信息

University of Chinese Academy of Sciences, Beijing 101408, China.

Shenzhen Key Lab for High Performance Data Mining, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518055, China.

出版信息

Int J Environ Res Public Health. 2022 Mar 9;19(6):3211. doi: 10.3390/ijerph19063211.

DOI:10.3390/ijerph19063211
PMID:35328897
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8949437/
Abstract

Breast cancer death rates are higher than any other cancer in American women. Machine learning-based predictive models promise earlier detection techniques for breast cancer diagnosis. However, making an evaluation for models that efficiently diagnose cancer is still challenging. In this work, we proposed data exploratory techniques (DET) and developed four different predictive models to improve breast cancer diagnostic accuracy. Prior to models, four-layered essential DET, e.g., feature distribution, correlation, elimination, and hyperparameter optimization, were deep-dived to identify the robust feature classification into malignant and benign classes. These proposed techniques and classifiers were implemented on the Wisconsin Diagnostic Breast Cancer (WDBC) and Breast Cancer Coimbra Dataset (BCCD) datasets. Standard performance metrics, including confusion matrices and K-fold cross-validation techniques, were applied to assess each classifier's efficiency and training time. The models' diagnostic capability improved with our DET, i.e., polynomial SVM gained 99.3%, LR with 98.06%, KNN acquired 97.35%, and EC achieved 97.61% accuracy with the WDBC dataset. We also compared our significant results with previous studies in terms of accuracy. The implementation procedure and findings can guide physicians to adopt an effective model for a practical understanding and prognosis of breast cancer tumors.

摘要

乳腺癌死亡率在美国女性中高于其他任何癌症。基于机器学习的预测模型有望为乳腺癌诊断提供更早的检测技术。然而,对于能够有效诊断癌症的模型进行评估仍然具有挑战性。在这项工作中,我们提出了数据探索技术(DET)并开发了四个不同的预测模型来提高乳腺癌诊断的准确性。在建立模型之前,我们深入研究了四层基本的 DET,例如特征分布、相关性、消除和超参数优化,以确定稳健的特征分类,将恶性和良性分类。这些提出的技术和分类器在威斯康星州诊断乳腺癌(WDBC)和科英布拉乳腺癌数据集(BCCD)上实现。应用标准性能指标,包括混淆矩阵和 K 折交叉验证技术,评估每个分类器的效率和训练时间。我们的 DET 提高了模型的诊断能力,例如多项式 SVM 获得了 99.3%,LR 获得了 98.06%,KNN 获得了 97.35%,EC 获得了 97.61%的准确率,WDBC 数据集。我们还根据准确性与之前的研究进行了比较。实施过程和结果可以指导医生采用有效的模型,以对乳腺癌肿瘤进行实际的了解和预后。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9f0b/8949437/aa0530ed5467/ijerph-19-03211-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9f0b/8949437/de88471d097a/ijerph-19-03211-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9f0b/8949437/59f7f8ed084f/ijerph-19-03211-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9f0b/8949437/329b3c1cd36d/ijerph-19-03211-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9f0b/8949437/a88af14b4fe5/ijerph-19-03211-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9f0b/8949437/1baac44adcc7/ijerph-19-03211-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9f0b/8949437/b4ae5bb4118a/ijerph-19-03211-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9f0b/8949437/ac445b136754/ijerph-19-03211-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9f0b/8949437/aa0530ed5467/ijerph-19-03211-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9f0b/8949437/de88471d097a/ijerph-19-03211-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9f0b/8949437/59f7f8ed084f/ijerph-19-03211-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9f0b/8949437/329b3c1cd36d/ijerph-19-03211-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9f0b/8949437/a88af14b4fe5/ijerph-19-03211-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9f0b/8949437/1baac44adcc7/ijerph-19-03211-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9f0b/8949437/b4ae5bb4118a/ijerph-19-03211-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9f0b/8949437/ac445b136754/ijerph-19-03211-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9f0b/8949437/aa0530ed5467/ijerph-19-03211-g008.jpg

相似文献

1
Improved Machine Learning-Based Predictive Models for Breast Cancer Diagnosis.基于机器学习的乳腺癌诊断预测模型的改进。
Int J Environ Res Public Health. 2022 Mar 9;19(6):3211. doi: 10.3390/ijerph19063211.
2
Prediction and Diagnosis of Breast Cancer Using Machine and Modern Deep Learning Models.使用机器和现代深度学习模型预测和诊断乳腺癌。
Asian Pac J Cancer Prev. 2024 Mar 1;25(3):1077-1085. doi: 10.31557/APJCP.2024.25.3.1077.
3
Parametric optimization and comparative study of machine learning and deep learning algorithms for breast cancer diagnosis.基于机器学习和深度学习算法的乳腺癌诊断的参数优化及对比研究。
Breast Dis. 2024;43(1):257-270. doi: 10.3233/BD-240018.
4
Breast Cancer Subtypes Classification with Hybrid Machine Learning Model.基于混合机器学习模型的乳腺癌亚型分类。
Methods Inf Med. 2022 Sep;61(3-04):68-83. doi: 10.1055/s-0042-1751043. Epub 2022 Sep 12.
5
Comparison of Classification Success Rates of Different Machine Learning Algorithms in the Diagnosis of Breast Cancer.不同机器学习算法在乳腺癌诊断中的分类成功率比较。
Asian Pac J Cancer Prev. 2022 Oct 1;23(10):3287-3297. doi: 10.31557/APJCP.2022.23.10.3287.
6
Breast Cancer Prediction Based on Multiple Machine Learning Algorithms.基于多种机器学习算法的乳腺癌预测。
Technol Cancer Res Treat. 2024 Jan-Dec;23:15330338241234791. doi: 10.1177/15330338241234791.
7
Analysis of Decision Tree and K-Nearest Neighbor Algorithm in the Classification of Breast Cancer.决策树和K近邻算法在乳腺癌分类中的分析
Asian Pac J Cancer Prev. 2019 Dec 1;20(12):3777-3781. doi: 10.31557/APJCP.2019.20.12.3777.
8
Machine learning in medicine: a practical introduction.医学中的机器学习:实用入门
BMC Med Res Methodol. 2019 Mar 19;19(1):64. doi: 10.1186/s12874-019-0681-4.
9
Performance assessment of hybrid machine learning approaches for breast cancer and recurrence prediction.用于乳腺癌及复发预测的混合机器学习方法的性能评估
PLoS One. 2024 Aug 1;19(8):e0304768. doi: 10.1371/journal.pone.0304768. eCollection 2024.
10
Support vector machine based diagnostic system for breast cancer using swarm intelligence.基于群智能的支持向量机乳腺癌诊断系统。
J Med Syst. 2012 Aug;36(4):2505-19. doi: 10.1007/s10916-011-9723-0. Epub 2011 May 3.

引用本文的文献

1
18F-FDG PET/CT-based deep radiomic models for enhancing chemotherapy response prediction in breast cancer.基于18F-FDG PET/CT的深度放射组学模型用于增强乳腺癌化疗反应预测
Med Oncol. 2025 Aug 11;42(9):425. doi: 10.1007/s12032-025-02982-0.
2
Predictive Modeling of Acute Respiratory Distress Syndrome Using Machine Learning: Systematic Review and Meta-Analysis.使用机器学习对急性呼吸窘迫综合征进行预测建模:系统评价与荟萃分析
J Med Internet Res. 2025 May 13;27:e66615. doi: 10.2196/66615.
3
Advanced machine learning framework for enhancing breast cancer diagnostics through transcriptomic profiling.

本文引用的文献

1
Pan-cancer computational histopathology reveals mutations, tumor composition and prognosis.泛癌计算组织病理学揭示了突变、肿瘤组成和预后。
Nat Cancer. 2020 Aug;1(8):800-810. doi: 10.1038/s43018-020-0085-8. Epub 2020 Jul 27.
2
Cancer statistics, 2022.癌症统计数据,2022 年。
CA Cancer J Clin. 2022 Jan;72(1):7-33. doi: 10.3322/caac.21708. Epub 2022 Jan 12.
3
The Importance of Spirituality for Women Facing Breast Cancer Diagnosis: A Qualitative Study.女性乳腺癌诊断中灵性关怀的重要性:一项定性研究。
通过转录组分析增强乳腺癌诊断的先进机器学习框架。
Discov Oncol. 2025 Mar 17;16(1):334. doi: 10.1007/s12672-025-02111-3.
4
Enhancing breast cancer prediction through stacking ensemble and deep learning integration.通过堆叠集成和深度学习集成增强乳腺癌预测
PeerJ Comput Sci. 2025 Feb 3;11:e2461. doi: 10.7717/peerj-cs.2461. eCollection 2025.
5
Transfer Learning and Neural Network-Based Approach on Structural MRI Data for Prediction and Classification of Alzheimer's Disease.基于迁移学习和神经网络的结构磁共振成像数据方法用于阿尔茨海默病的预测和分类
Diagnostics (Basel). 2025 Feb 4;15(3):360. doi: 10.3390/diagnostics15030360.
6
A novel aggregated coefficient ranking based feature selection strategy for enhancing the diagnosis of breast cancer classification using machine learning.一种基于新型聚集系数排序的特征选择策略,用于利用机器学习增强乳腺癌分类诊断。
Sci Rep. 2025 Feb 4;15(1):4171. doi: 10.1038/s41598-025-87826-7.
7
Synthetic Boosted Resampling Using Deep Generative Adversarial Networks: A Novel Approach to Improve Cancer Prediction from Imbalanced Datasets.使用深度生成对抗网络的合成增强重采样:一种从不平衡数据集中改善癌症预测的新方法。
Cancers (Basel). 2024 Dec 2;16(23):4046. doi: 10.3390/cancers16234046.
8
Prediction model for ocular metastasis of breast cancer: machine learning model development and interpretation study.乳腺癌眼部转移预测模型:机器学习模型的开发和解释研究。
BMC Cancer. 2024 Nov 29;24(1):1472. doi: 10.1186/s12885-024-12928-w.
9
Advancements in Exosome Proteins for Breast Cancer Diagnosis and Detection: With a Focus on Nanotechnology.外泌体蛋白在乳腺癌诊断和检测中的进展:聚焦纳米技术。
AAPS PharmSciTech. 2024 Nov 27;25(8):276. doi: 10.1208/s12249-024-02983-8.
10
Learning from Imbalanced Data: Integration of Advanced Resampling Techniques and Machine Learning Models for Enhanced Cancer Diagnosis and Prognosis.从不平衡数据中学习:先进重采样技术与机器学习模型的整合用于增强癌症诊断与预后
Cancers (Basel). 2024 Oct 8;16(19):3417. doi: 10.3390/cancers16193417.
Int J Environ Res Public Health. 2021 Jun 13;18(12):6415. doi: 10.3390/ijerph18126415.
4
A Review of Breast Imaging for Timely Diagnosis of Disease.乳腺影像学检查在疾病及时诊断中的应用评价。
Int J Environ Res Public Health. 2021 May 21;18(11):5509. doi: 10.3390/ijerph18115509.
5
A Decision Tree Model for Breast Reconstruction of Women with Breast Cancer: A Mixed Method Approach.基于混合方法的乳腺癌女性乳房重建决策树模型
Int J Environ Res Public Health. 2021 Mar 30;18(7):3579. doi: 10.3390/ijerph18073579.
6
Deep Learning Feature Extraction Approach for Hematopoietic Cancer Subtype Classification.深度学习特征提取方法在血液肿瘤亚型分类中的应用。
Int J Environ Res Public Health. 2021 Feb 23;18(4):2197. doi: 10.3390/ijerph18042197.
7
Global Cancer Statistics 2020: GLOBOCAN Estimates of Incidence and Mortality Worldwide for 36 Cancers in 185 Countries.《全球癌症统计数据 2020:全球 185 个国家和地区 36 种癌症的发病率和死亡率估计》。
CA Cancer J Clin. 2021 May;71(3):209-249. doi: 10.3322/caac.21660. Epub 2021 Feb 4.
8
Comparing supervised and semi-supervised Machine Learning Models on Diagnosing Breast Cancer.比较监督式和半监督式机器学习模型在乳腺癌诊断中的应用
Ann Med Surg (Lond). 2021 Jan 8;62:53-64. doi: 10.1016/j.amsu.2020.12.043. eCollection 2021 Feb.
9
Detection and classification of breast cancer using logistic regression feature selection and GMDH classifier.使用逻辑回归特征选择和 GMDH 分类器进行乳腺癌检测和分类。
J Biomed Inform. 2020 Nov;111:103591. doi: 10.1016/j.jbi.2020.103591. Epub 2020 Oct 8.
10
Cloud-Based Breast Cancer Prediction Empowered with Soft Computing Approaches.基于云计算的乳腺癌预测方法研究
J Healthc Eng. 2020 May 18;2020:8017496. doi: 10.1155/2020/8017496. eCollection 2020.