• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于改进型蛙跳算法的高维生物医学数据优化特征选择。

Feature Selection for Optimized High-Dimensional Biomedical Data Using an Improved Shuffled Frog Leaping Algorithm.

出版信息

IEEE/ACM Trans Comput Biol Bioinform. 2018 Nov-Dec;15(6):1765-1773. doi: 10.1109/TCBB.2016.2602263. Epub 2016 Aug 24.

DOI:10.1109/TCBB.2016.2602263
PMID:28113635
Abstract

High dimensional biomedical datasets contain thousands of features which can be used in molecular diagnosis of disease, however, such datasets contain many irrelevant or weak correlation features which influence the predictive accuracy of diagnosis. Without a feature selection algorithm, it is difficult for the existing classification techniques to accurately identify patterns in the features. The purpose of feature selection is to not only identify a feature subset from an original set of features [without reducing the predictive accuracy of classification algorithm] but also reduce the computation overhead in data mining. In this paper, we present our improved shuffled frog leaping algorithm which introduces a chaos memory weight factor, an absolute balance group strategy, and an adaptive transfer factor. Our proposed approach explores the space of possible subsets to obtain the set of features that maximizes the predictive accuracy and minimizes irrelevant features in high-dimensional biomedical data. To evaluate the effectiveness of our proposed method, we have employed the K-nearest neighbor method with a comparative analysis in which we compare our proposed approach with genetic algorithms, particle swarm optimization, and the shuffled frog leaping algorithm. Experimental results show that our improved algorithm achieves improvements in the identification of relevant subsets and in classification accuracy.

摘要

高维生物医学数据集包含数千个特征,可用于疾病的分子诊断,然而,此类数据集包含许多不相关或弱相关的特征,这会影响诊断的预测准确性。如果没有特征选择算法,现有的分类技术就很难准确识别特征中的模式。特征选择的目的不仅是从原始特征集中选择一个特征子集[而不会降低分类算法的预测准确性],还可以降低数据挖掘中的计算开销。在本文中,我们提出了一种改进的蛙跳算法,该算法引入了混沌记忆权重因子、绝对平衡组策略和自适应转移因子。我们提出的方法探索了可能的子集空间,以获得能够最大化预测准确性并最小化高维生物医学数据中不相关特征的特征集。为了评估我们提出的方法的有效性,我们采用了 K-最近邻方法,并进行了对比分析,将我们提出的方法与遗传算法、粒子群优化和蛙跳算法进行了比较。实验结果表明,我们的改进算法在相关子集的识别和分类准确性方面都取得了提高。

相似文献

1
Feature Selection for Optimized High-Dimensional Biomedical Data Using an Improved Shuffled Frog Leaping Algorithm.基于改进型蛙跳算法的高维生物医学数据优化特征选择。
IEEE/ACM Trans Comput Biol Bioinform. 2018 Nov-Dec;15(6):1765-1773. doi: 10.1109/TCBB.2016.2602263. Epub 2016 Aug 24.
2
Feature Selection in High Dimensional Biomedical Data Based on BF-SFLA.基于布谷鸟搜索-正弦余弦算法的高维生物医学数据特征选择
Front Neurosci. 2022 Apr 18;16:854685. doi: 10.3389/fnins.2022.854685. eCollection 2022.
3
A hybrid rough set shuffled frog leaping knowledge inference system for diagnosis of lung cancer disease.一种用于肺癌诊断的混合粗糙集蛙跳知识推理系统。
Comput Biol Med. 2023 Mar;155:106662. doi: 10.1016/j.compbiomed.2023.106662. Epub 2023 Feb 13.
4
A comparative analysis of swarm intelligence techniques for feature selection in cancer classification.用于癌症分类中特征选择的群体智能技术的比较分析。
ScientificWorldJournal. 2014;2014:693831. doi: 10.1155/2014/693831. Epub 2014 Aug 3.
5
An improved binary particle swarm optimization algorithm for clinical cancer biomarker identification in microarray data.一种用于微阵列数据中临床癌症生物标志物识别的改进二元粒子群优化算法。
Comput Methods Programs Biomed. 2024 Feb;244:107987. doi: 10.1016/j.cmpb.2023.107987. Epub 2023 Dec 21.
6
Underwater sonar image detection: A combination of non-local spatial information and quantum-inspired shuffled frog leaping algorithm.水下声纳图像检测:非局部空间信息与量子启发式混合蛙跳算法的结合
PLoS One. 2017 May 18;12(5):e0177666. doi: 10.1371/journal.pone.0177666. eCollection 2017.
7
An Innovative Excited-ACS-IDGWO Algorithm for Optimal Biomedical Data Feature Selection.一种创新的基于激发 ACS-IDGWO 算法的最优生物医学数据特征选择方法。
Biomed Res Int. 2020 Aug 17;2020:8506365. doi: 10.1155/2020/8506365. eCollection 2020.
8
A modified shuffled frog leaping algorithm with inertia weight.一种带有惯性权重的改进型混合蛙跳算法。
Sci Rep. 2024 Feb 20;14(1):4146. doi: 10.1038/s41598-024-51306-1.
9
C-HMOSHSSA: Gene selection for cancer classification using multi-objective meta-heuristic and machine learning methods.C-HMOSHSSA:使用多目标元启发式和机器学习方法进行癌症分类的基因选择。
Comput Methods Programs Biomed. 2019 Sep;178:219-235. doi: 10.1016/j.cmpb.2019.06.029. Epub 2019 Jun 29.
10
An Efficient hybrid filter-wrapper metaheuristic-based gene selection method for high dimensional datasets.基于高效混合过滤-包装元启发式算法的高维数据集基因选择方法。
Sci Rep. 2019 Dec 9;9(1):18580. doi: 10.1038/s41598-019-54987-1.

引用本文的文献

1
An Efficient Binary Sand Cat Swarm Optimization for Feature Selection in High-Dimensional Biomedical Data.一种用于高维生物医学数据特征选择的高效二进制沙猫群优化算法
Bioengineering (Basel). 2023 Sep 25;10(10):1123. doi: 10.3390/bioengineering10101123.
2
A review of swarm-based metaheuristic optimization techniques and their application to doubly fed induction generator.基于群体的元启发式优化技术及其在双馈感应发电机中的应用综述。
Heliyon. 2022 Oct 7;8(10):e10956. doi: 10.1016/j.heliyon.2022.e10956. eCollection 2022 Oct.
3
Feature Selection in High Dimensional Biomedical Data Based on BF-SFLA.
基于布谷鸟搜索-正弦余弦算法的高维生物医学数据特征选择
Front Neurosci. 2022 Apr 18;16:854685. doi: 10.3389/fnins.2022.854685. eCollection 2022.
4
Data analysis methods for defining biomarkers from omics data.用于从组学数据中定义生物标志物的数据分析方法。
Anal Bioanal Chem. 2022 Jan;414(1):235-250. doi: 10.1007/s00216-021-03813-7. Epub 2021 Dec 24.
5
Bio-inspired computation for big data fusion, storage, processing, learning and visualization: state of the art and future directions.用于大数据融合、存储、处理、学习和可视化的生物启发式计算:现状与未来方向。
Neural Comput Appl. 2021 Aug 3:1-31. doi: 10.1007/s00521-021-06332-9.
6
Review of Big Data Analytics, Artificial Intelligence and Nature-Inspired Computing Models towards Accurate Detection of COVID-19 Pandemic Cases and Contact Tracing.大数据分析、人工智能和受自然启发的计算模型在新冠疫情病例的准确检测和接触者追踪方面的研究综述。
Int J Environ Res Public Health. 2020 Jul 24;17(15):5330. doi: 10.3390/ijerph17155330.
7
An Efficient hybrid filter-wrapper metaheuristic-based gene selection method for high dimensional datasets.基于高效混合过滤-包装元启发式算法的高维数据集基因选择方法。
Sci Rep. 2019 Dec 9;9(1):18580. doi: 10.1038/s41598-019-54987-1.
8
Supervised Nonnegative Matrix Factorization to Predict ICU Mortality Risk.监督非负矩阵分解以预测重症监护病房死亡风险。
Proceedings (IEEE Int Conf Bioinformatics Biomed). 2018 Dec;2018:1189-1194. doi: 10.1109/BIBM.2018.8621403. Epub 2019 Jan 24.
9
Classification of high dimensional biomedical data based on feature selection using redundant removal.基于冗余消除的特征选择对高维生物医学数据的分类。
PLoS One. 2019 Apr 9;14(4):e0214406. doi: 10.1371/journal.pone.0214406. eCollection 2019.