• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于乳腺癌及复发预测的混合机器学习方法的性能评估

Performance assessment of hybrid machine learning approaches for breast cancer and recurrence prediction.

作者信息

Pati Abhilash, Panigrahi Amrutanshu, Parhi Manoranjan, Giri Jayant, Qin Hong, Mallik Saurav, Pattanayak Sambit Ranjan, Agrawal Umang Kumar

机构信息

Department of Computer Science and Engineering, Siksha 'O' Anusandhan (Deemed to be University), Bhubaneswar, Odisha, India.

Centre for Data Science, Siksha 'O' Anusandhan (Deemed to be University), Bhubaneswar, Odisha, India.

出版信息

PLoS One. 2024 Aug 1;19(8):e0304768. doi: 10.1371/journal.pone.0304768. eCollection 2024.

DOI:10.1371/journal.pone.0304768
PMID:39088457
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11293701/
Abstract

Breast cancer is a major health concern for women everywhere and a major killer of women. Malignant tumors may be distinguished from benign ones, allowing for early diagnosis of this disease. Therefore, doctors need an accurate method of diagnosing tumors as either malignant or benign. Even if therapy begins immediately after diagnosis, some cancer cells may persist in the body, increasing the risk of a recurrence. Metastasis and recurrence are the leading causes of death from breast cancer. Therefore, detecting a return of breast cancer early has become a pressing medical issue. Evaluating and contrasting various Machine Learning (ML) techniques for breast cancer and recurrence prediction is crucial to choosing the best successful method. Inaccurate forecasts are common when using datasets with a large number of attributes. This study addresses the need for effective feature selection and optimization methods by introducing Recursive Feature Elimination (RFE) and Grey Wolf Optimizer (GWO), in response to the limitations observed in existing approaches. In this research, the performance evaluation of methods is enhanced by employing the RFE and GWO, considering the Wisconsin Diagnostic Breast Cancer (WDBC) and Wisconsin Prognostic Breast Cancer (WPBC) datasets taken from the UCI-ML repository. Various preprocessing techniques are applied to raw data, including imputation, scaling, and others. In the second step, relevant feature correlations are used with RFE to narrow down candidate discriminative features. The GWO chooses the best possible combination of attributes for the most accurate result in the next step. We use seven ML classifiers in both datasets to make a binary decision. On the WDBC and WPBC datasets, several experiments have shown accuracies of 98.25% and 93.27%, precisions of 98.13% and 95.56%, sensitivities of 99.06% and 96.63%, specificities of 96.92% and 73.33%, F1-scores of 98.59% and 96.09% and AUCs of 0.982 and 0.936, respectively. The hybrid approach's superior feature selection improved the accuracy of breast cancer performance indicators and recurrence classification.

摘要

乳腺癌是世界各地女性主要的健康问题,也是女性的主要杀手。恶性肿瘤可与良性肿瘤区分开来,有助于对这种疾病进行早期诊断。因此,医生需要一种准确的方法来诊断肿瘤是恶性还是良性。即使在诊断后立即开始治疗,体内仍可能存在一些癌细胞,从而增加复发风险。转移和复发是乳腺癌死亡的主要原因。因此,早期检测乳腺癌复发已成为一个紧迫的医学问题。评估和对比用于乳腺癌及复发预测的各种机器学习(ML)技术对于选择最佳成功方法至关重要。使用具有大量属性的数据集时,不准确的预测很常见。本研究针对现有方法中观察到的局限性,通过引入递归特征消除(RFE)和灰狼优化器(GWO),满足了对有效特征选择和优化方法的需求。在本研究中,考虑从UCI-ML库获取的威斯康星诊断乳腺癌(WDBC)和威斯康星预后乳腺癌(WPBC)数据集,通过采用RFE和GWO提高了方法的性能评估。对原始数据应用了各种预处理技术,包括插补、缩放等。第二步,使用RFE进行相关特征关联,以缩小候选判别特征范围。GWO在下一步中选择最佳的属性组合以获得最准确的结果。我们在两个数据集中使用七个ML分类器进行二元决策。在WDBC和WPBC数据集上,多项实验分别显示准确率为98.25%和93.27%,精确率为98.13%和95.56%,灵敏度为99.06%和96.63%,特异性为96.92%和73.33%,F1分数为98.59%和96.09%,AUC分别为0.982和0.936。混合方法卓越的特征选择提高了乳腺癌性能指标和复发分类的准确性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b1ac/11293701/915cb2ecc046/pone.0304768.g019.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b1ac/11293701/adadd222a60d/pone.0304768.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b1ac/11293701/e745f9a38e28/pone.0304768.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b1ac/11293701/2a8fca9c7062/pone.0304768.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b1ac/11293701/08d2f092ee7e/pone.0304768.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b1ac/11293701/b93ebb58cae7/pone.0304768.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b1ac/11293701/50819f9c9459/pone.0304768.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b1ac/11293701/38f39827173e/pone.0304768.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b1ac/11293701/d4a67bfc34e8/pone.0304768.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b1ac/11293701/3a9d5e35b509/pone.0304768.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b1ac/11293701/8a45deef74ca/pone.0304768.g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b1ac/11293701/b808cba573e3/pone.0304768.g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b1ac/11293701/11dda546eed1/pone.0304768.g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b1ac/11293701/9e062c7be6b7/pone.0304768.g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b1ac/11293701/ff74d0fcc9b3/pone.0304768.g014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b1ac/11293701/d89cf92adeaf/pone.0304768.g015.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b1ac/11293701/e7b4d6901206/pone.0304768.g016.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b1ac/11293701/09ff39f24aa8/pone.0304768.g017.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b1ac/11293701/967d653a2d58/pone.0304768.g018.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b1ac/11293701/915cb2ecc046/pone.0304768.g019.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b1ac/11293701/adadd222a60d/pone.0304768.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b1ac/11293701/e745f9a38e28/pone.0304768.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b1ac/11293701/2a8fca9c7062/pone.0304768.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b1ac/11293701/08d2f092ee7e/pone.0304768.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b1ac/11293701/b93ebb58cae7/pone.0304768.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b1ac/11293701/50819f9c9459/pone.0304768.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b1ac/11293701/38f39827173e/pone.0304768.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b1ac/11293701/d4a67bfc34e8/pone.0304768.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b1ac/11293701/3a9d5e35b509/pone.0304768.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b1ac/11293701/8a45deef74ca/pone.0304768.g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b1ac/11293701/b808cba573e3/pone.0304768.g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b1ac/11293701/11dda546eed1/pone.0304768.g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b1ac/11293701/9e062c7be6b7/pone.0304768.g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b1ac/11293701/ff74d0fcc9b3/pone.0304768.g014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b1ac/11293701/d89cf92adeaf/pone.0304768.g015.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b1ac/11293701/e7b4d6901206/pone.0304768.g016.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b1ac/11293701/09ff39f24aa8/pone.0304768.g017.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b1ac/11293701/967d653a2d58/pone.0304768.g018.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b1ac/11293701/915cb2ecc046/pone.0304768.g019.jpg

相似文献

1
Performance assessment of hybrid machine learning approaches for breast cancer and recurrence prediction.用于乳腺癌及复发预测的混合机器学习方法的性能评估
PLoS One. 2024 Aug 1;19(8):e0304768. doi: 10.1371/journal.pone.0304768. eCollection 2024.
2
Improved Machine Learning-Based Predictive Models for Breast Cancer Diagnosis.基于机器学习的乳腺癌诊断预测模型的改进。
Int J Environ Res Public Health. 2022 Mar 9;19(6):3211. doi: 10.3390/ijerph19063211.
3
Detection and classification of breast cancer using logistic regression feature selection and GMDH classifier.使用逻辑回归特征选择和 GMDH 分类器进行乳腺癌检测和分类。
J Biomed Inform. 2020 Nov;111:103591. doi: 10.1016/j.jbi.2020.103591. Epub 2020 Oct 8.
4
Enhancing the prediction of IDC breast cancer staging from gene expression profiles using hybrid feature selection methods and deep learning architecture.使用混合特征选择方法和深度学习架构增强从基因表达谱预测浸润性导管癌乳腺癌分期的能力。
Med Biol Eng Comput. 2023 Nov;61(11):2895-2919. doi: 10.1007/s11517-023-02892-1. Epub 2023 Aug 2.
5
A structured combination of ensemble classifier and filter-based feature selection to improve breast cancer diagnosis.基于集成分类器和基于过滤器的特征选择的结构化组合,以提高乳腺癌诊断。
J Cancer Res Clin Oncol. 2023 Nov;149(16):14519-14534. doi: 10.1007/s00432-023-05238-4. Epub 2023 Aug 12.
6
Predicting Breast Cancer Leveraging Supervised Machine Learning Techniques.利用监督机器学习技术预测乳腺癌。
Comput Math Methods Med. 2022 Aug 16;2022:5869529. doi: 10.1155/2022/5869529. eCollection 2022.
7
Feature selection using genetic algorithm for breast cancer diagnosis: experiment on three different datasets.基于遗传算法的乳腺癌诊断特征选择:在三个不同数据集上的实验
Iran J Basic Med Sci. 2016 May;19(5):476-82.
8
Breast Cancer Subtypes Classification with Hybrid Machine Learning Model.基于混合机器学习模型的乳腺癌亚型分类。
Methods Inf Med. 2022 Sep;61(3-04):68-83. doi: 10.1055/s-0042-1751043. Epub 2022 Sep 12.
9
Effective hybrid feature selection using different bootstrap enhances cancers classification performance.使用不同的自助法进行有效的混合特征选择可提高癌症分类性能。
BioData Min. 2022 Sep 30;15(1):24. doi: 10.1186/s13040-022-00304-y.
10
An enhanced and efficient approach for feature selection for chronic human disease prediction: A breast cancer study.一种用于慢性人类疾病预测的特征选择的增强型高效方法:一项乳腺癌研究。
Heliyon. 2024 Feb 28;10(5):e26799. doi: 10.1016/j.heliyon.2024.e26799. eCollection 2024 Mar 15.

引用本文的文献

1
Development and validation of hybrid machine learning approach for predicting survival in patients with cervical cancer: a SEER-based population study.用于预测宫颈癌患者生存率的混合机器学习方法的开发与验证:一项基于监测、流行病学和最终结果(SEER)的人群研究
Front Oncol. 2025 Jun 18;15:1605378. doi: 10.3389/fonc.2025.1605378. eCollection 2025.
2
A population based optimization of convolutional neural networks for chronic kidney disease prediction.基于群体的卷积神经网络优化用于慢性肾脏病预测
Sci Rep. 2025 Apr 25;15(1):14500. doi: 10.1038/s41598-025-99270-8.

本文引用的文献

1
Breast Cancer Diagnosis Based on IoT and Deep Transfer Learning Enabled by Fog Computing.基于物联网和雾计算实现的深度迁移学习的乳腺癌诊断
Diagnostics (Basel). 2023 Jun 27;13(13):2191. doi: 10.3390/diagnostics13132191.
2
Current and future burden of breast cancer: Global statistics for 2020 and 2040.乳腺癌的现状和未来负担:2020 年和 2040 年全球统计数据。
Breast. 2022 Dec;66:15-23. doi: 10.1016/j.breast.2022.08.010. Epub 2022 Sep 2.
3
Machine Learning Algorithms to Predict Recurrence within 10 Years after Breast Cancer Surgery: A Prospective Cohort Study.
预测乳腺癌手术后10年内复发的机器学习算法:一项前瞻性队列研究。
Cancers (Basel). 2020 Dec 17;12(12):3817. doi: 10.3390/cancers12123817.
4
Identifying Breast Cancer Distant Recurrences from Electronic Health Records Using Machine Learning.使用机器学习从电子健康记录中识别乳腺癌远处复发
J Healthc Inform Res. 2019;3(3):283-299. doi: 10.1007/s41666-019-00046-3. Epub 2019 Apr 8.
5
A case-based ensemble learning system for explainable breast cancer recurrence prediction.一种用于可解释性乳腺癌复发预测的基于案例的集成学习系统。
Artif Intell Med. 2020 Jul;107:101858. doi: 10.1016/j.artmed.2020.101858. Epub 2020 Jun 5.
6
Predicting breast cancer risk using personal health data and machine learning models.利用个人健康数据和机器学习模型预测乳腺癌风险。
PLoS One. 2019 Dec 27;14(12):e0226765. doi: 10.1371/journal.pone.0226765. eCollection 2019.
7
Breast Cancer Prognosis Using a Machine Learning Approach.基于机器学习方法的乳腺癌预后分析
Cancers (Basel). 2019 Mar 7;11(3):328. doi: 10.3390/cancers11030328.
8
Application of artificial neural network-based survival analysis on two breast cancer datasets.基于人工神经网络的生存分析在两个乳腺癌数据集上的应用。
AMIA Annu Symp Proc. 2007 Oct 11;2007:130-4.
9
Computerized breast cancer diagnosis and prognosis from fine-needle aspirates.基于细针穿刺抽吸物的计算机辅助乳腺癌诊断与预后评估
Arch Surg. 1995 May;130(5):511-6. doi: 10.1001/archsurg.1995.01430050061010.