• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

精英二元狼搜索算法在高维生物信息学数据集启发式特征选择中的应用。

Elitist Binary Wolf Search Algorithm for Heuristic Feature Selection in High-Dimensional Bioinformatics Datasets.

机构信息

Department of Computer and Information Science, University of Macau, Macau SAR, China.

School of Computer Science and Engineering, University of New South Wales, New South Wales, Australia.

出版信息

Sci Rep. 2017 Jun 28;7(1):4354. doi: 10.1038/s41598-017-04037-5.

DOI:10.1038/s41598-017-04037-5
PMID:28659577
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5489518/
Abstract

Due to the high-dimensional characteristics of dataset, we propose a new method based on the Wolf Search Algorithm (WSA) for optimising the feature selection problem. The proposed approach uses the natural strategy established by Charles Darwin; that is, 'It is not the strongest of the species that survives, but the most adaptable'. This means that in the evolution of a swarm, the elitists are motivated to quickly obtain more and better resources. The memory function helps the proposed method to avoid repeat searches for the worst position in order to enhance the effectiveness of the search, while the binary strategy simplifies the feature selection problem into a similar problem of function optimisation. Furthermore, the wrapper strategy gathers these strengthened wolves with the classifier of extreme learning machine to find a sub-dataset with a reasonable number of features that offers the maximum correctness of global classification models. The experimental results from the six public high-dimensional bioinformatics datasets tested demonstrate that the proposed method can best some of the conventional feature selection methods up to 29% in classification accuracy, and outperform previous WSAs by up to 99.81% in computational time.

摘要

由于数据集的高维特征,我们提出了一种基于 Wolf Search Algorithm(WSA)的新方法,用于优化特征选择问题。所提出的方法利用了查尔斯·达尔文(Charles Darwin)建立的自然策略;也就是说,“不是最强的物种生存,而是最适应的物种生存”。这意味着在群体的进化中,精英们被激励着快速获得更多更好的资源。记忆功能有助于所提出的方法避免对最差位置的重复搜索,以提高搜索的有效性,而二进制策略则将特征选择问题简化为类似的函数优化问题。此外,包装器策略将这些强化的狼与极端学习机的分类器结合起来,以找到一个具有合理特征数量的子数据集,从而为全局分类模型提供最大的正确性。从六个公共的高维生物信息学数据集进行的实验结果表明,所提出的方法在分类准确性方面可以胜过一些传统的特征选择方法高达 29%,并且在计算时间方面可以胜过以前的 WSA 高达 99.81%。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/84cb/5489518/e7df36a6f489/41598_2017_4037_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/84cb/5489518/226afbdba467/41598_2017_4037_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/84cb/5489518/9afcc09bb99a/41598_2017_4037_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/84cb/5489518/61c8137d2ae5/41598_2017_4037_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/84cb/5489518/15aad5e041d5/41598_2017_4037_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/84cb/5489518/6f4ee5b6a748/41598_2017_4037_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/84cb/5489518/b0ed3c8f8b9e/41598_2017_4037_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/84cb/5489518/01cbe1a390fa/41598_2017_4037_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/84cb/5489518/b78b90832a13/41598_2017_4037_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/84cb/5489518/b2fade409acd/41598_2017_4037_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/84cb/5489518/e7df36a6f489/41598_2017_4037_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/84cb/5489518/226afbdba467/41598_2017_4037_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/84cb/5489518/9afcc09bb99a/41598_2017_4037_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/84cb/5489518/61c8137d2ae5/41598_2017_4037_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/84cb/5489518/15aad5e041d5/41598_2017_4037_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/84cb/5489518/6f4ee5b6a748/41598_2017_4037_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/84cb/5489518/b0ed3c8f8b9e/41598_2017_4037_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/84cb/5489518/01cbe1a390fa/41598_2017_4037_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/84cb/5489518/b78b90832a13/41598_2017_4037_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/84cb/5489518/b2fade409acd/41598_2017_4037_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/84cb/5489518/e7df36a6f489/41598_2017_4037_Fig10_HTML.jpg

相似文献

1
Elitist Binary Wolf Search Algorithm for Heuristic Feature Selection in High-Dimensional Bioinformatics Datasets.精英二元狼搜索算法在高维生物信息学数据集启发式特征选择中的应用。
Sci Rep. 2017 Jun 28;7(1):4354. doi: 10.1038/s41598-017-04037-5.
2
An Innovative Excited-ACS-IDGWO Algorithm for Optimal Biomedical Data Feature Selection.一种创新的基于激发 ACS-IDGWO 算法的最优生物医学数据特征选择方法。
Biomed Res Int. 2020 Aug 17;2020:8506365. doi: 10.1155/2020/8506365. eCollection 2020.
3
CCFS: A cooperating coevolution technique for large scale feature selection on microarray datasets.CCFS:一种用于微阵列数据集大规模特征选择的协同协同进化技术。
Comput Biol Chem. 2018 Apr;73:171-178. doi: 10.1016/j.compbiolchem.2018.02.006. Epub 2018 Feb 17.
4
An Improved Binary Differential Evolution Algorithm for Feature Selection in Molecular Signatures.基于改进二进制差分进化算法的分子特征选择
Mol Inform. 2018 Apr;37(4):e1700081. doi: 10.1002/minf.201700081. Epub 2017 Nov 6.
5
A new hybrid filter/wrapper algorithm for feature selection in classification.一种用于分类中特征选择的新型混合过滤/包装算法。
Anal Chim Acta. 2019 Nov 8;1080:43-54. doi: 10.1016/j.aca.2019.06.054. Epub 2019 Jun 28.
6
A novel feature selection approach for biomedical data classification.一种用于生物医学数据分类的新特征选择方法。
J Biomed Inform. 2010 Feb;43(1):15-23. doi: 10.1016/j.jbi.2009.07.008. Epub 2009 Jul 30.
7
A Hybrid Feature Selection Method Based on Binary State Transition Algorithm and ReliefF.基于二进制状态转换算法和 ReliefF 的混合特征选择方法。
IEEE J Biomed Health Inform. 2019 Sep;23(5):1888-1898. doi: 10.1109/JBHI.2018.2872811. Epub 2018 Sep 28.
8
Enhanced Ali Baba and the forty thieves algorithm for feature selection.用于特征选择的增强型阿里巴巴与四十大盗算法
Neural Comput Appl. 2023;35(8):6153-6184. doi: 10.1007/s00521-022-08015-5. Epub 2022 Nov 15.
9
Binary Simulated Normal Distribution Optimizer for feature selection: Theory and application in COVID-19 datasets.用于特征选择的二元模拟正态分布优化器:理论及在新冠肺炎数据集上的应用
Expert Syst Appl. 2022 Aug 15;200:116834. doi: 10.1016/j.eswa.2022.116834. Epub 2022 Mar 15.
10
Identifying (Quasi) Equally Informative Subsets in Feature Selection Problems for Classification: A Max-Relevance Min-Redundancy Approach.在分类的特征选择问题中识别(准)等信息量子集:一种最大相关性最小冗余方法。
IEEE Trans Cybern. 2016 Jun;46(6):1424-37. doi: 10.1109/TCYB.2015.2444435. Epub 2015 Jul 6.

引用本文的文献

1
A deep learning based framework for the classification of multi- class capsule gastroscope image in gastroenterologic diagnosis.一种基于深度学习的框架,用于在胃肠病诊断中对多类胶囊胃镜图像进行分类。
Front Physiol. 2022 Nov 18;13:1060591. doi: 10.3389/fphys.2022.1060591. eCollection 2022.
2
Prediction Model of Postoperative Severe Hypocalcemia in Patients with Secondary Hyperparathyroidism Based on Logistic Regression and XGBoost Algorithm.基于逻辑回归和 XGBoost 算法的继发性甲状旁腺功能亢进术后重度低钙血症预测模型。
Comput Math Methods Med. 2022 Jul 25;2022:8752826. doi: 10.1155/2022/8752826. eCollection 2022.
3

本文引用的文献

1
Pretata: predicting TATA binding proteins with novel features and dimensionality reduction strategy.Pretata:运用新特征和降维策略预测TATA结合蛋白
BMC Syst Biol. 2016 Dec 23;10(Suppl 4):114. doi: 10.1186/s12918-016-0353-5.
2
Adaptive swarm cluster-based dynamic multi-objective synthetic minority oversampling technique algorithm for tackling binary imbalanced datasets in biomedical data classification.基于自适应群体聚类的动态多目标合成少数类过采样技术算法,用于处理生物医学数据分类中的二元不平衡数据集。
BioData Min. 2016 Dec 1;9:37. doi: 10.1186/s13040-016-0117-1. eCollection 2016.
3
Survey of MapReduce frame operation in bioinformatics.
Evaluation of Traumatic Subdural Hematoma Volume by Using Image Segmentation Assessment Based on Deep Learning.
基于深度学习的图像分割评估对创伤性硬脑膜下血肿体积的评估。
Comput Math Methods Med. 2022 Jun 28;2022:3830245. doi: 10.1155/2022/3830245. eCollection 2022.
4
Construction of Prediction Model of Renal Damage in Children with Henoch-Schönlein Purpura Based on Machine Learning.基于机器学习的儿童过敏性紫癜肾损害预测模型的构建。
Comput Math Methods Med. 2022 May 23;2022:6991218. doi: 10.1155/2022/6991218. eCollection 2022.
5
BOSO: A novel feature selection algorithm for linear regression with high-dimensional data.BOSO:一种用于高维数据线性回归的新型特征选择算法。
PLoS Comput Biol. 2022 May 31;18(5):e1010180. doi: 10.1371/journal.pcbi.1010180. eCollection 2022 May.
6
An Efficient hybrid filter-wrapper metaheuristic-based gene selection method for high dimensional datasets.基于高效混合过滤-包装元启发式算法的高维数据集基因选择方法。
Sci Rep. 2019 Dec 9;9(1):18580. doi: 10.1038/s41598-019-54987-1.
7
Potential biomarkers of acute myocardial infarction based on weighted gene co-expression network analysis.基于加权基因共表达网络分析的急性心肌梗死潜在生物标志物。
Biomed Eng Online. 2019 Jan 25;18(1):9. doi: 10.1186/s12938-019-0625-6.
8
Semi-Supervised Maximum Discriminative Local Margin for Gene Selection.半监督最大鉴别局部边缘的基因选择。
Sci Rep. 2018 Jun 5;8(1):8619. doi: 10.1038/s41598-018-26806-6.
9
Simulation as a preoperative planning approach in advanced heart failure patients. A retrospective clinical analysis.模拟作为一种术前规划方法在晚期心力衰竭患者中的应用。一项回顾性临床分析。
Biomed Eng Online. 2018 May 2;17(1):52. doi: 10.1186/s12938-018-0491-7.
生物信息学中MapReduce框架操作的调查。
Brief Bioinform. 2014 Jul;15(4):637-47. doi: 10.1093/bib/bbs088. Epub 2013 Feb 7.
4
Feature selection in MLPs and SVMs based on maximum output information.基于最大输出信息的多层感知器和支持向量机中的特征选择
IEEE Trans Neural Netw. 2004 Jul;15(4):937-48. doi: 10.1109/TNN.2004.828772.