Suppr
超能文献

一种用于COVID-19数据集高效特征选择的新型萤火虫算法方法。

A novel firefly algorithm approach for efficient feature selection with COVID-19 dataset.

作者信息

Bacanin Nebojsa, Venkatachalam K, Bezdan Timea, Zivkovic Miodrag, Abouhawwash Mohamed

机构信息

Singidunum University, Danijelova 32, 11000 Belgrade, Serbia.

Department of Applied Cybernetics, Faculty of Science, University of Hradec Králové, 50003 Hradec Králové, Czech Republic.

出版信息

Microprocess Microsyst. 2023 Apr;98:104778. doi: 10.1016/j.micpro.2023.104778. Epub 2023 Feb 6.

DOI:10.1016/j.micpro.2023.104778

PMID:36785847

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9901218/

Abstract

Feature selection is one of the most important challenges in machine learning and data science. This process is usually performed in the data preprocessing phase, where the data is transformed to a proper format for further operations by machine learning algorithm. Many real-world datasets are highly dimensional with many irrelevant, even redundant features. These kinds of features do not improve classification accuracy and can even shrink down performance of a classifier. The goal of feature selection is to find optimal (or sub-optimal) subset of features that contain relevant information about the dataset from which machine learning algorithms can derive useful conclusions. In this manuscript, a novel version of firefly algorithm (FA) is proposed and adapted for feature selection challenge. Proposed method significantly improves performance of the basic FA, and also outperforms other state-of-the-art metaheuristics for both, benchmark bound-constrained and practical feature selection tasks. Method was first validated on standard unconstrained benchmarks and later it was applied for feature selection by using 21 standard University of California, Irvine (UCL) datasets. Moreover, presented approach was also tested for relatively novel COVID-19 dataset for predicting patients health, and one microcontroller microarray dataset. Results obtained in all practical simulations attest robustness and efficiency of proposed algorithm in terms of convergence, solutions' quality and classification accuracy. More precisely, the proposed approach obtained the best classification accuracy on 13 out of 21 total datasets, significantly outperforming other competitor methods.

摘要

特征选择是机器学习和数据科学中最重要的挑战之一。这个过程通常在数据预处理阶段执行，在该阶段，数据被转换为适合机器学习算法进一步操作的格式。许多现实世界的数据集具有高维度，包含许多不相关甚至冗余的特征。这类特征不会提高分类准确率，甚至可能降低分类器的性能。特征选择的目标是找到最优（或次优）的特征子集，这些子集包含有关数据集的相关信息，机器学习算法可以从中得出有用的结论。在本文中，提出了一种新颖的萤火虫算法（FA）版本，并将其应用于特征选择挑战。所提出的方法显著提高了基本萤火虫算法的性能，并且在基准边界约束和实际特征选择任务中均优于其他现有的元启发式算法。该方法首先在标准无约束基准上进行验证，随后通过使用21个加州大学欧文分校（UCI）的标准数据集进行特征选择。此外，还针对相对新颖的用于预测患者健康的COVID-19数据集以及一个微控制器微阵列数据集对所提出的方法进行了测试。在所有实际模拟中获得的结果证明了所提出算法在收敛性、解的质量和分类准确率方面的稳健性和效率。更确切地说，所提出的方法在总共21个数据集中的13个上获得了最佳分类准确率，显著优于其他竞争方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a79b/9901218/cae0f6761c06/fx1001_lrg.jpg

相似文献

A novel firefly algorithm approach for efficient feature selection with COVID-19 dataset.

Microprocess Microsyst. 2023 Apr;98:104778. doi: 10.1016/j.micpro.2023.104778. Epub 2023 Feb 6.

Quasi-reflection learning arithmetic optimization algorithm firefly search for feature selection.

Heliyon. 2023 Apr 6;9(4):e15378. doi: 10.1016/j.heliyon.2023.e15378. eCollection 2023 Apr.

An Innovative Excited-ACS-IDGWO Algorithm for Optimal Biomedical Data Feature Selection.

Biomed Res Int. 2020 Aug 17;2020:8506365. doi: 10.1155/2020/8506365. eCollection 2020.

Novel chaotic oppositional fruit fly optimization algorithm for feature selection applied on COVID 19 patients' health prediction.

PLoS One. 2022 Oct 10;17(10):e0275727. doi: 10.1371/journal.pone.0275727. eCollection 2022.

Feature Selection by Hybrid Brain Storm Optimization Algorithm for COVID-19 Classification.

J Comput Biol. 2022 Jun;29(6):515-529. doi: 10.1089/cmb.2021.0256. Epub 2022 Apr 19.

A weighted-sum chaotic sparrow search algorithm for interdisciplinary feature selection and data classification.

Sci Rep. 2023 Aug 28;13(1):14061. doi: 10.1038/s41598-023-38252-0.

Novel Improved Salp Swarm Algorithm: An Application for Feature Selection.

Sensors (Basel). 2022 Feb 22;22(5):1711. doi: 10.3390/s22051711.

Feature Selection for Motor Imagery EEG Classification Based on Firefly Algorithm and Learning Automata.

Sensors (Basel). 2017 Nov 8;17(11):2576. doi: 10.3390/s17112576.

Hybrid feature selection based on SLI and genetic algorithm for microarray datasets.

J Supercomput. 2022;78(18):19725-19753. doi: 10.1007/s11227-022-04650-w. Epub 2022 Jun 30.

An Efficient Feature Subset Selection Algorithm for Classification of Multidimensional Dataset.

ScientificWorldJournal. 2015;2015:821798. doi: 10.1155/2015/821798. Epub 2015 Sep 28.

引用本文的文献

An optimal neural network to design generators and stabilizers for multi-machine power systems based on a promoted firefly algorithm.

Sci Rep. 2025 Jul 1;15(1):21663. doi: 10.1038/s41598-025-05547-3.

An Explainable LSTM-Based Intrusion Detection System Optimized by Firefly Algorithm for IoT Networks.

Sensors (Basel). 2025 Apr 4;25(7):2288. doi: 10.3390/s25072288.

A BiLSTM model enhanced with multi-objective arithmetic optimization for COVID-19 diagnosis from CT images.

Sci Rep. 2025 Mar 29;15(1):10841. doi: 10.1038/s41598-025-94654-2.

Multi-feature fusion and dandelion optimizer based model for automatically diagnosing the gastrointestinal diseases.

PeerJ Comput Sci. 2024 Feb 28;10:e1919. doi: 10.7717/peerj-cs.1919. eCollection 2024.

Concordance and generalization of an AI algorithm with real-world clinical data in the pre-omicron and omicron era.

Heliyon. 2024 Feb 2;10(3):e25410. doi: 10.1016/j.heliyon.2024.e25410. eCollection 2024 Feb 15.

本文引用的文献

New theoretical ISM-K2 Bayesian network model for evaluating vaccination effectiveness.

J Ambient Intell Humaniz Comput. 2022 Jul 5:1-17. doi: 10.1007/s12652-022-04199-9.

Feature Selection by Hybrid Brain Storm Optimization Algorithm for COVID-19 Classification.

J Comput Biol. 2022 Jun;29(6):515-529. doi: 10.1089/cmb.2021.0256. Epub 2022 Apr 19.

Novel Improved Salp Swarm Algorithm: An Application for Feature Selection.

Sensors (Basel). 2022 Feb 22;22(5):1711. doi: 10.3390/s22051711.

An Effective Feature Selection Model Using Hybrid Metaheuristic Algorithms for IoT Intrusion Detection.

Sensors (Basel). 2022 Feb 11;22(4):1396. doi: 10.3390/s22041396.

COVID-19 cases prediction by using hybrid machine learning and beetle antennae search approach.

Sustain Cities Soc. 2021 Mar;66:102669. doi: 10.1016/j.scs.2020.102669. Epub 2020 Dec 30.

Firefly algorithm for cardinality constrained mean-variance portfolio optimization problem with entropy diversity constraint.

ScientificWorldJournal. 2014;2014:721521. doi: 10.1155/2014/721521. Epub 2014 May 29.

Wrapper-filter feature selection algorithm using a memetic framework.

IEEE Trans Syst Man Cybern B Cybern. 2007 Feb;37(1):70-6. doi: 10.1109/tsmcb.2006.883267.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

Suppr超能文献

一种用于COVID-19数据集高效特征选择的新型萤火虫算法方法。

A novel firefly algorithm approach for efficient feature selection with COVID-19 dataset.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译