• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于混合模糊粗糙集的启发式算法在高维基因数据特征选择中的一种新的种群初始化方法。

A new population initialization of metaheuristic algorithms based on hybrid fuzzy rough set for high-dimensional gene data feature selection.

机构信息

College of Computer Science and Technology, Jilin University, Changchun, 130012, China.

College of Information Technology, Jilin Agricultural University, Changchun, 130118, China.

出版信息

Comput Biol Med. 2023 Nov;166:107538. doi: 10.1016/j.compbiomed.2023.107538. Epub 2023 Oct 4.

DOI:10.1016/j.compbiomed.2023.107538
PMID:37857136
Abstract

In the realm of modern medicine and biology, vast amounts of genetic data with high complexity are available. However, dealing with such high-dimensional data poses challenges due to increased processing complexity and size. Identifying critical genes to reduce data dimensionality is essential. The filter-wrapper hybrid method is a commonly used approach in feature selection. Most of these methods employ filters such as MRMR and ReliefF, but the performance of these simple filters is limited. Rough set methods, on the other hand, are a type of filter method that outperforms traditional filters. Simultaneously, many studies have pointed out the crucial importance of good initialization strategies for the performance of the metaheuristic algorithm (a type of wrapper-based method). Combining these two points, this paper proposes a novel filter-wrapper hybrid method for high-dimensional feature selection. To be specific, we utilize the variant of bWOA (binary Whale Optimization Algorithm) based on Hybrid Fuzzy Rough Set to perform attribute reduction, and the reduced attributes are used as prior knowledge to initialize the population. We then employ metaheuristics for further feature selection based on this initialized population. We conducted experiments using five different algorithms on 14 UCI datasets. The experiment results show that after applying the initialization method proposed in this article, the performance of five enhanced algorithms, has shown significant improvement. Particularly, the improved bMFO using our initialization method: fuzzy_bMFO outperformed six currently advanced algorithms, indicating that our initialization method for metaheuristic algorithms is suitable for high-dimensional feature selection tasks.

摘要

在现代医学和生物学领域,存在大量具有高度复杂性的基因数据。然而,由于处理的高维数据的复杂性和规模增加,对其进行处理面临着挑战。确定关键基因以降低数据维度至关重要。过滤-包装混合方法是特征选择中常用的方法之一。这些方法大多数都采用 MRMR 和 ReliefF 等过滤器,但这些简单过滤器的性能有限。另一方面,粗糙集方法是一种优于传统过滤器的过滤方法。同时,许多研究指出,对于元启发式算法(一种基于包装的方法)的性能而言,良好的初始化策略非常重要。结合这两点,本文提出了一种用于高维特征选择的新型过滤-包装混合方法。具体来说,我们利用基于 Hybrid Fuzzy Rough Set 的二进制鲸鱼优化算法 (bWOA) 变体来执行属性约简,然后将约简后的属性用作初始化种群的先验知识。之后,我们基于这个初始化的种群,利用元启发式算法进一步进行特征选择。我们在 14 个 UCI 数据集上使用五种不同的算法进行了实验。实验结果表明,在应用本文提出的初始化方法后,五种增强算法的性能都得到了显著提高。特别是,我们的初始化方法改进的 bMFO(模糊 bMFO)在性能上优于六个目前先进的算法,这表明我们的元启发式算法初始化方法适用于高维特征选择任务。

相似文献

1
A new population initialization of metaheuristic algorithms based on hybrid fuzzy rough set for high-dimensional gene data feature selection.基于混合模糊粗糙集的启发式算法在高维基因数据特征选择中的一种新的种群初始化方法。
Comput Biol Med. 2023 Nov;166:107538. doi: 10.1016/j.compbiomed.2023.107538. Epub 2023 Oct 4.
2
An Innovative Excited-ACS-IDGWO Algorithm for Optimal Biomedical Data Feature Selection.一种创新的基于激发 ACS-IDGWO 算法的最优生物医学数据特征选择方法。
Biomed Res Int. 2020 Aug 17;2020:8506365. doi: 10.1155/2020/8506365. eCollection 2020.
3
A novel nonlinear time-varying sigmoid transfer function in binary whale optimization algorithm for descriptors selection in drug classification.用于药物分类中描述符选择的二进制鲸鱼优化算法中的一种新型非线性时变Sigmoid传递函数。
Mol Divers. 2023 Feb;27(1):71-80. doi: 10.1007/s11030-022-10410-y. Epub 2022 Mar 7.
4
A Tri-Stage Wrapper-Filter Feature Selection Framework for Disease Classification.三阶段包装器-过滤器特征选择框架用于疾病分类。
Sensors (Basel). 2021 Aug 18;21(16):5571. doi: 10.3390/s21165571.
5
Feature Genes Selection Using Fuzzy Rough Uncertainty Metric for Tumor Diagnosis.基于模糊粗糙不确定性度量的肿瘤诊断特征基因选择
Comput Math Methods Med. 2019 Jan 27;2019:6705648. doi: 10.1155/2019/6705648. eCollection 2019.
6
RSPOP: rough set-based pseudo outer-product fuzzy rule identification algorithm.RSPOP:基于粗糙集的伪外积模糊规则识别算法。
Neural Comput. 2005 Jan;17(1):205-43. doi: 10.1162/0899766052530857.
7
Feature selection based on neighborhood rough sets and Gini index.基于邻域粗糙集和基尼指数的特征选择
PeerJ Comput Sci. 2023 Dec 12;9:e1711. doi: 10.7717/peerj-cs.1711. eCollection 2023.
8
A Hybrid Feature Selection Method Based on Binary State Transition Algorithm and ReliefF.基于二进制状态转换算法和 ReliefF 的混合特征选择方法。
IEEE J Biomed Health Inform. 2019 Sep;23(5):1888-1898. doi: 10.1109/JBHI.2018.2872811. Epub 2018 Sep 28.
9
A Hybrid Gene Selection Method Based on ReliefF and Ant Colony Optimization Algorithm for Tumor Classification.基于 ReliefF 和蚁群优化算法的混合基因选择方法在肿瘤分类中的应用。
Sci Rep. 2019 Jun 20;9(1):8978. doi: 10.1038/s41598-019-45223-x.
10
Two-stage feature selection for classification of gene expression data based on an improved Salp Swarm Algorithm.基于改进的鹽蝽群算法的基因表达数据分类的两阶段特征选择
Math Biosci Eng. 2022 Sep 19;19(12):13747-13781. doi: 10.3934/mbe.2022641.

引用本文的文献

1
A Multi-Strategy Improved Northern Goshawk Optimization Algorithm for Optimizing Engineering Problems.一种用于优化工程问题的多策略改进苍鹰优化算法
Biomimetics (Basel). 2024 Sep 16;9(9):561. doi: 10.3390/biomimetics9090561.