M El-Kenawy El-Sayed, Khodadadi Nima, Eid Marwa M, Khodadadi Ehsaneh, Khodadadi Ehsan, Khafaga Doaa Sami, Alhussan Amel Ali, Ibrahim Abdelhameed, Saber Mohamed
School of ICT, Faculty of Engineering, Design and Information & Communications Technology (EDICT), Bahrain Polytechnic, PO Box 33349, Isa Town, Bahrain.
Department of Civil and Architectural Engineering, University of Miami, Coral Gables, FL, USA.
Sci Rep. 2025 Mar 19;15(1):9483. doi: 10.1038/s41598-025-92187-2.
With the advancement of medical technology, a large amount of complex data on cancers is produced for diagnosing and treating cancers. However, not all this data is useful, as many features are redundant or irrelevant, which can reduce the accuracy of machine learning models. Metaheuristic algorithms have been employed to select features to address this issue. Although the efficacy of these algorithms has been demonstrated, challenges related to scalability and efficiency persist when handling large medical datasets. In this study, a binary version of the Advanced Al-Biruni Earth Radius (bABER) algorithm is proposed for the intelligent removal of unnecessary data and identifying the most essential features for cancer detection. Unlike traditional methods that rely on a single approach, bABER is evaluated using seven medical datasets and compared with eight widely used binary metaheuristic algorithms, including bSC, bPSO, bWAO, bGWO, bMVO, bSBO, bFA, and bGA. Statistical tests such as ANOVA and the Wilcoxon signed-rank test are conducted to ensure a thorough performance assessment. The results indicate that the bABER algorithm significantly outperforms other methods, making it a valuable tool for improving cancer diagnosis. By refining feature selection, this approach enhances existing machine learning models, leading to more accurate and reliable medical predictions. This study contributes to improved data-driven decision-making in healthcare, bringing the field closer to faster and more precise cancer detection.
随着医学技术的进步,产生了大量用于癌症诊断和治疗的复杂癌症数据。然而,并非所有这些数据都是有用的,因为许多特征是冗余的或不相关的,这可能会降低机器学习模型的准确性。元启发式算法已被用于选择特征来解决这个问题。尽管这些算法的有效性已经得到证明,但在处理大型医学数据集时,与可扩展性和效率相关的挑战仍然存在。在本研究中,提出了一种二进制版本的高级阿尔 - 比鲁尼地球半径(bABER)算法,用于智能去除不必要的数据并识别癌症检测中最重要的特征。与依赖单一方法的传统方法不同,bABER 使用七个医学数据集进行评估,并与八种广泛使用的二进制元启发式算法进行比较,包括bSC、bPSO、bWAO、bGWO、bMVO、bSBO、bFA和bGA。进行了方差分析(ANOVA)和威尔科克森符号秩检验等统计测试,以确保进行全面的性能评估。结果表明,bABER算法明显优于其他方法,使其成为改善癌症诊断的有价值工具。通过优化特征选择,这种方法增强了现有的机器学习模型,从而实现更准确、可靠的医学预测。本研究有助于改善医疗保健中数据驱动的决策,使该领域更接近更快、更精确的癌症检测。