• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

二进制矮狐优化器,用于解决高维特征选择问题。

Binary dwarf mongoose optimizer for solving high-dimensional feature selection problems.

机构信息

School of Mathematics, Statistics, and Computer Science, University of KwaZulu-Natal, Pietermaritzburg, KwaZulu-Natal, South Africa.

Department of Computer Science, Federal University of Lafia, Lafia, Nasarawa State, Nigeria.

出版信息

PLoS One. 2022 Oct 6;17(10):e0274850. doi: 10.1371/journal.pone.0274850. eCollection 2022.

DOI:10.1371/journal.pone.0274850
PMID:36201524
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9536540/
Abstract

Selecting appropriate feature subsets is a vital task in machine learning. Its main goal is to remove noisy, irrelevant, and redundant feature subsets that could negatively impact the learning model's accuracy and improve classification performance without information loss. Therefore, more advanced optimization methods have been employed to locate the optimal subset of features. This paper presents a binary version of the dwarf mongoose optimization called the BDMO algorithm to solve the high-dimensional feature selection problem. The effectiveness of this approach was validated using 18 high-dimensional datasets from the Arizona State University feature selection repository and compared the efficacy of the BDMO with other well-known feature selection techniques in the literature. The results show that the BDMO outperforms other methods producing the least average fitness value in 14 out of 18 datasets which means that it achieved 77.77% on the overall best fitness values. The result also shows BDMO demonstrating stability by returning the least standard deviation (SD) value in 13 of 18 datasets (72.22%). Furthermore, the study achieved higher validation accuracy in 15 of the 18 datasets (83.33%) over other methods. The proposed approach also yielded the highest validation accuracy attainable in the COIL20 and Leukemia datasets which vividly portray the superiority of the BDMO.

摘要

选择合适的特征子集是机器学习中的一项重要任务。其主要目的是去除可能对学习模型的准确性产生负面影响的噪声、不相关和冗余的特征子集,同时在不损失信息的情况下提高分类性能。因此,更多先进的优化方法被用于寻找最优的特征子集。本文提出了一种称为 BDMO 的二进制矮袋鼠优化算法,用于解决高维特征选择问题。该方法的有效性通过使用来自亚利桑那州立大学特征选择库的 18 个高维数据集进行验证,并将 BDMO 的效果与文献中其他著名的特征选择技术进行比较。结果表明,BDMO 在 18 个数据集的 14 个数据集上产生了最小的平均适应度值,这意味着它在整体最佳适应度值上达到了 77.77%。结果还表明,BDMO 在 13 个数据集(72.22%)中返回了最小的标准偏差(SD)值,表现出了稳定性。此外,该方法在 18 个数据集的 15 个数据集(83.33%)中实现了更高的验证准确性,优于其他方法。该方法在 COIL20 和白血病数据集上也取得了最高的验证准确性,生动地展示了 BDMO 的优越性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/00df/9536540/f6aea536f6f8/pone.0274850.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/00df/9536540/4644a1f49628/pone.0274850.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/00df/9536540/4dee9bc23d71/pone.0274850.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/00df/9536540/60b545b71e9d/pone.0274850.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/00df/9536540/da76f50ff867/pone.0274850.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/00df/9536540/f6aea536f6f8/pone.0274850.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/00df/9536540/4644a1f49628/pone.0274850.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/00df/9536540/4dee9bc23d71/pone.0274850.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/00df/9536540/60b545b71e9d/pone.0274850.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/00df/9536540/da76f50ff867/pone.0274850.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/00df/9536540/f6aea536f6f8/pone.0274850.g005.jpg

相似文献

1
Binary dwarf mongoose optimizer for solving high-dimensional feature selection problems.二进制矮狐优化器,用于解决高维特征选择问题。
PLoS One. 2022 Oct 6;17(10):e0274850. doi: 10.1371/journal.pone.0274850. eCollection 2022.
2
A hybrid binary dwarf mongoose optimization algorithm with simulated annealing for feature selection on high dimensional multi-class datasets.一种结合了二进制矮袋鼠优化算法和模拟退火算法的混合算法,用于高维多类数据集上的特征选择。
Sci Rep. 2022 Sep 2;12(1):14945. doi: 10.1038/s41598-022-18993-0.
3
Feature Selection and Dwarf Mongoose Optimization Enabled Deep Learning for Heart Disease Detection.基于特征选择和矮跳蝾螈优化的深度学习在心脏病检测中的应用。
Comput Intell Neurosci. 2022 Dec 7;2022:2819378. doi: 10.1155/2022/2819378. eCollection 2022.
4
CDMO: Chaotic Dwarf Mongoose Optimization Algorithm for feature selection.CDMO:用于特征选择的混沌侏儒猫鼬优化算法
Sci Rep. 2024 Jan 6;14(1):701. doi: 10.1038/s41598-023-50959-8.
5
Hybrid Gradient Descent Grey Wolf Optimizer for Optimal Feature Selection.基于混合梯度下降灰狼优化算法的最优特征选择。
Biomed Res Int. 2021 Aug 28;2021:2555622. doi: 10.1155/2021/2555622. eCollection 2021.
6
Novel chaotic oppositional fruit fly optimization algorithm for feature selection applied on COVID 19 patients' health prediction.新型混沌反对派果蝇优化算法在 COVID-19 患者健康预测中的特征选择应用。
PLoS One. 2022 Oct 10;17(10):e0275727. doi: 10.1371/journal.pone.0275727. eCollection 2022.
7
Advanced dwarf mongoose optimization for solving CEC 2011 and CEC 2017 benchmark problems.高级矮狐优化算法求解 CEC 2011 和 CEC 2017 基准问题。
PLoS One. 2022 Nov 2;17(11):e0275346. doi: 10.1371/journal.pone.0275346. eCollection 2022.
8
An Innovative Excited-ACS-IDGWO Algorithm for Optimal Biomedical Data Feature Selection.一种创新的基于激发 ACS-IDGWO 算法的最优生物医学数据特征选择方法。
Biomed Res Int. 2020 Aug 17;2020:8506365. doi: 10.1155/2020/8506365. eCollection 2020.
9
An improved binary particle swarm optimization algorithm for clinical cancer biomarker identification in microarray data.一种用于微阵列数据中临床癌症生物标志物识别的改进二元粒子群优化算法。
Comput Methods Programs Biomed. 2024 Feb;244:107987. doi: 10.1016/j.cmpb.2023.107987. Epub 2023 Dec 21.
10
A Novel Method Based on Nonlinear Binary Grasshopper Whale Optimization Algorithm for Feature Selection.一种基于非线性二进制蚱蜢鲸鱼优化算法的特征选择新方法。
J Bionic Eng. 2023;20(1):237-252. doi: 10.1007/s42235-022-00253-6. Epub 2022 Sep 7.

引用本文的文献

1
Feature Selection Problem and Metaheuristics: A Systematic Literature Review about Its Formulation, Evaluation and Applications.特征选择问题与元启发式算法:关于其公式化、评估及应用的系统文献综述
Biomimetics (Basel). 2023 Dec 25;9(1):9. doi: 10.3390/biomimetics9010009.
2
Feature selection for high dimensional microarray gene expression data via weighted signal to noise ratio.基于加权信噪比的高维微阵列基因表达数据特征选择。
PLoS One. 2023 Apr 25;18(4):e0284619. doi: 10.1371/journal.pone.0284619. eCollection 2023.
3
Evolutionary binary feature selection using adaptive ebola optimization search algorithm for high-dimensional datasets.

本文引用的文献

1
Multiclass feature selection with metaheuristic optimization algorithms: a review.基于元启发式优化算法的多类特征选择:综述
Neural Comput Appl. 2022;34(22):19751-19790. doi: 10.1007/s00521-022-07705-4. Epub 2022 Aug 30.
2
A hybrid binary dwarf mongoose optimization algorithm with simulated annealing for feature selection on high dimensional multi-class datasets.一种结合了二进制矮袋鼠优化算法和模拟退火算法的混合算法,用于高维多类数据集上的特征选择。
Sci Rep. 2022 Sep 2;12(1):14945. doi: 10.1038/s41598-022-18993-0.
3
Novel Feature Selection and Voting Classifier Algorithms for COVID-19 Classification in CT Images.
使用自适应埃博拉优化搜索算法进行高维数据集的进化二进制特征选择。
PLoS One. 2023 Mar 17;18(3):e0282812. doi: 10.1371/journal.pone.0282812. eCollection 2023.
4
Improved Dwarf Mongoose Optimization for Constrained Engineering Design Problems.用于约束工程设计问题的改进型矮猫鼬优化算法
J Bionic Eng. 2023;20(3):1263-1295. doi: 10.1007/s42235-022-00316-8. Epub 2022 Dec 13.
用于CT图像中COVID-19分类的新型特征选择和投票分类器算法
IEEE Access. 2020 Sep 30;8:179317-179335. doi: 10.1109/ACCESS.2020.3028012. eCollection 2020.
4
Hybrid Gradient Descent Grey Wolf Optimizer for Optimal Feature Selection.基于混合梯度下降灰狼优化算法的最优特征选择。
Biomed Res Int. 2021 Aug 28;2021:2555622. doi: 10.1155/2021/2555622. eCollection 2021.
5
Hybrid Binary Dragonfly Algorithm with Simulated Annealing for Feature Selection.结合模拟退火的混合二进制蜻蜓算法用于特征选择
SN Comput Sci. 2021;2(4):295. doi: 10.1007/s42979-021-00687-5. Epub 2021 May 25.
6
Development of a two-stage gene selection method that incorporates a novel hybrid approach using the cuckoo optimization algorithm and harmony search for cancer classification.一种两阶段基因选择方法的开发,该方法结合了一种使用布谷鸟优化算法和和声搜索的新型混合方法用于癌症分类。
J Biomed Inform. 2017 Mar;67:11-20. doi: 10.1016/j.jbi.2017.01.016. Epub 2017 Feb 3.
7
A Novel Feature Selection Technique for Text Classification Using Naïve Bayes.一种使用朴素贝叶斯进行文本分类的新型特征选择技术。
Int Sch Res Notices. 2014 Oct 28;2014:717092. doi: 10.1155/2014/717092. eCollection 2014.
8
A competitive swarm optimizer for large scale optimization.一种用于大规模优化的竞争型群体智能优化算法。
IEEE Trans Cybern. 2015 Feb;45(2):191-204. doi: 10.1109/TCYB.2014.2322602. Epub 2014 May 20.
9
Particle swarm optimization for feature selection in classification: a multi-objective approach.粒子群优化在分类中的特征选择:一种多目标方法。
IEEE Trans Cybern. 2013 Dec;43(6):1656-71. doi: 10.1109/TSMCB.2012.2227469.
10
Discriminative semi-supervised feature selection via manifold regularization.基于流形正则化的判别式半监督特征选择
IEEE Trans Neural Netw. 2010 Jul;21(7):1033-47. doi: 10.1109/TNN.2010.2047114. Epub 2010 Jun 21.