Department of Mathematics, The Women University Multan, Multan, 61000, Pakistan.
Comput Biol Med. 2024 Dec;183:109215. doi: 10.1016/j.compbiomed.2024.109215. Epub 2024 Oct 4.
Breast cancer (BC) is a catastrophic global health concern that causes numerous fatalities worldwide. Early detection of breast cancer may mitigate death rates; however, the prevailing diagnostic procedure for the malignancy necessitates numerous multifaceted laboratory tests that must be performed by medical professionals. In this article machine learning, a branch of Artificial Intelligence (AI), has been employed to improve cancer diagnosis, prognoses and survival rates while reducing the vulnerability of humans. Support Vector Machine (SVM), K-nearest Neighbors (KNN), Decision Tree (DT), Random Forest (RF) and Grey Wolf Optimizer (GWO) are implemented to prognosticate breast cancer. Comprehensive insights into the efficacy of these approaches for breast cancer prognosis are provided by the performance assessment that is accomplished using the confusion matrix, Receiver Operating Characteristic (ROC) curves and parallel coordinate plots. Both UCI (University of California Irvine) and SEER (Surveillance, Epidemiology and End Results) datasets have been utilized to confirm the investigation's findings and ensure their generalizability across diverse data sources. The results conclusively demonstrate that SVM is the cohort's most accurate classifier. With a stupendous accuracy rate of 99.1 %, the GWO-SVM compares favorably to all other algorithms. Furthermore, feature reduction approaches such as Minimum Redundancy Maximum Relevance (mRMR), ReliefF and Principal Component Analysis (PCA) are utilized. ReliefF has demonstrated exceptional effectiveness with a maximum accuracy of 98.2 %.
乳腺癌(BC)是一个灾难性的全球健康问题,在全球范围内导致了许多死亡。早期发现乳腺癌可以降低死亡率;然而,目前用于诊断这种恶性肿瘤的方法需要医学专业人员进行大量多方面的实验室测试。在本文中,机器学习,人工智能(AI)的一个分支,已被用于提高癌症诊断、预后和生存率,同时降低人类的脆弱性。支持向量机(SVM)、K 最近邻(KNN)、决策树(DT)、随机森林(RF)和灰狼优化器(GWO)被用于预测乳腺癌。通过使用混淆矩阵、接收器工作特征(ROC)曲线和并行坐标图进行性能评估,提供了对这些方法在乳腺癌预后中的功效的全面了解。加州大学欧文分校(UCI)和监测、流行病学和最终结果(SEER)数据集都被用来验证研究结果,并确保它们在不同的数据源中具有普遍性。结果明确表明 SVM 是该队列中最准确的分类器。GWO-SVM 的准确率高达 99.1%,与所有其他算法相比具有优势。此外,还使用了特征减少方法,如最小冗余最大相关性(mRMR)、ReliefF 和主成分分析(PCA)。ReliefF 的效果非常出色,准确率最高可达 98.2%。