Suppr超能文献

用于医学诊断特征选择的具有贪婪交叉的冠状病毒群体免疫优化器

Coronavirus herd immunity optimizer with greedy crossover for feature selection in medical diagnosis.

作者信息

Alweshah Mohammed, Alkhalaileh Saleh, Al-Betar Mohammed Azmi, Bakar Azuraliza Abu

机构信息

Prince Abdullah Bin Ghazi Faculty of Information and Communication Technology, Al-Balqa Applied University, Al-Salt, Jordan.

Center for Artificial Intelligence Technology, Faculty of Information Science and Technology, Universiti Kebangsaan Malaysia, Bangi, Selangor, Malaysia.

出版信息

Knowl Based Syst. 2022 Jan 10;235:107629. doi: 10.1016/j.knosys.2021.107629. Epub 2021 Oct 29.

Abstract

The importance of medical data and the crucial nature of the decisions that are based on such data, as well as the large increase in its volume, has encouraged researchers to develop feature selection (FS)-based approaches to identify the most relevant data for specific medical problems In this paper, two intelligent wrapper FS approaches based on a new metaheuristic algorithm named the coronavirus herd immunity optimizer (CHIO) were applied with and without the incorporation of a greedy crossover (GC) operator strategy to enhance exploration of the search space by CHIO. The two proposed approaches, CHIO and CHIO-GC, were evaluated using 23 medical benchmark datasets and a real-world COVID-19 dataset. The experimental results indicated that CHIO-GC outperformed CHIO in terms of search capability, as reflected in classification accuracy, selection size, F-measure, standard deviation and convergence speed. The GC operator was able to enhance the balance between exploration and exploitation of the CHIO in the search and correct suboptimal solutions for faster convergence. The proposed CHIO-GC was also compared with two previous wrapper FS approaches, namely, binary moth flame optimization with Lévy flight (LBMFO_V3) and the hyper learning binary dragonfly algorithm (HLBDA), as well as four filter methods namely, Chi-square, Relief, correlation-based feature selection and information gain. CHIO-GC surpassed LBMFO_V3 and the four filter methods with an accuracy rate of 0.79 on 23 medical benchmark datasets. CHIO-GC also surpassed HLBDA with an accuracy rate of 0.93 when applied to the COVID-19 dataset. These encouraging results were obtained by striking a sufficient balance between the two search phases of CHIO-GC during the hunt for correct solutions, which also increased the convergence rate. This was accomplished by integrating a greedy crossover technique into the CHIO algorithm to remedy the inferior solutions found during premature convergence and while locked into a local optimum search space.

摘要

医学数据的重要性、基于此类数据的决策的关键性质,以及其数量的大幅增加,促使研究人员开发基于特征选择(FS)的方法,以识别针对特定医学问题的最相关数据。在本文中,两种基于一种名为冠状病毒群体免疫优化器(CHIO)的新型元启发式算法的智能包装器FS方法,在有无纳入贪婪交叉(GC)算子策略的情况下被应用,以增强CHIO对搜索空间的探索。所提出的两种方法,即CHIO和CHIO-GC,使用23个医学基准数据集和一个真实世界的COVID-19数据集进行了评估。实验结果表明,CHIO-GC在搜索能力方面优于CHIO,这体现在分类准确率、选择大小、F值、标准差和收敛速度上。GC算子能够在搜索中增强CHIO在探索和利用之间的平衡,并纠正次优解以实现更快的收敛。所提出的CHIO-GC还与之前的两种包装器FS方法进行了比较,即带 Lévy 飞行的二进制蛾火焰优化(LBMFO_V3)和超学习二进制蜻蜓算法(HLBDA),以及四种过滤方法,即卡方检验、Relief、基于相关性的特征选择和信息增益。在23个医学基准数据集上,CHIO-GC以0.79的准确率超过了LBMFO_V3和四种过滤方法。当应用于COVID-19数据集时,CHIO-GC也以0.93的准确率超过了HLBDA。这些令人鼓舞的结果是通过在寻找正确解的过程中,在CHIO-GC的两个搜索阶段之间取得足够的平衡而获得的,这也提高了收敛速度。这是通过将贪婪交叉技术集成到CHIO算法中,以纠正过早收敛期间以及陷入局部最优搜索空间时找到的劣质解来实现的。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b95/8553647/efd6abf02e71/gr1_lrg.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验