• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

GeFeS:一种用于优化分类性能的广义包装特征选择方法。

GeFeS: A generalized wrapper feature selection approach for optimizing classification performance.

作者信息

Sahebi Golnaz, Movahedi Parisa, Ebrahimi Masoumeh, Pahikkala Tapio, Plosila Juha, Tenhunen Hannu

机构信息

Department of Future Technologies, University of Turku, Turku, FI-20014, Turun yliopisto, Finland.

Department of Future Technologies, University of Turku, Turku, FI-20014, Turun yliopisto, Finland.

出版信息

Comput Biol Med. 2020 Oct;125:103974. doi: 10.1016/j.compbiomed.2020.103974. Epub 2020 Aug 20.

DOI:10.1016/j.compbiomed.2020.103974
PMID:32890978
Abstract

In this paper, we propose a generalized wrapper-based feature selection, called GeFeS, which is based on a parallel new intelligent genetic algorithm (GA). The proposed GeFeS works properly under different numerical dataset dimensions and sizes, carefully tries to avoid overfitting and significantly enhances classification accuracy. To make the GA more accurate, robust and intelligent, we have proposed a new operator for features weighting, improved the mutation and crossover operators, and integrated nested cross-validation into the GA process to properly validate the learning model. The k-nearest neighbor (kNN) classifier is utilized to evaluate the goodness of selected features. We have evaluated the efficiency of GeFeS on various datasets selected from the UCI machine learning repository. The performance is compared with state-of-the-art classification and feature selection methods. The results demonstrate that GeFeS can significantly generalize the proposed multi-population intelligent genetic algorithm under different sizes of two-class and multi-class datasets. We have achieved the average classification accuracy of 95.83%, 97.62%, 99.02%, 98.51%, and 94.28% while reducing the number of features from 56 to 28, 34 to 18, 279 to 135, 30 to 16, and 19 to 9 under lung cancer, dermatology, arrhythmia, WDBC, and hepatitis, respectively.

摘要

在本文中,我们提出了一种基于包装器的广义特征选择方法,称为GeFeS,它基于一种并行的新型智能遗传算法(GA)。所提出的GeFeS在不同的数值数据集维度和大小下都能正常工作,仔细避免过拟合,并显著提高分类准确率。为了使GA更准确、更稳健和更智能,我们提出了一种新的特征加权算子,改进了变异和交叉算子,并将嵌套交叉验证集成到GA过程中以正确验证学习模型。使用k近邻(kNN)分类器来评估所选特征的优劣。我们在从UCI机器学习库中选择的各种数据集上评估了GeFeS的效率。将其性能与当前最先进的分类和特征选择方法进行了比较。结果表明,GeFeS可以在不同大小的二类和多类数据集下显著推广所提出的多种群智能遗传算法。在肺癌、皮肤病学、心律失常、WDBC和肝炎数据集上,我们分别将特征数量从56个减少到28个、34个减少到18个、279个减少到135个、30个减少到16个以及19个减少到9个,同时实现了95.83%、97.62%、99.02%、98.51%和94.28%的平均分类准确率。

相似文献

1
GeFeS: A generalized wrapper feature selection approach for optimizing classification performance.GeFeS:一种用于优化分类性能的广义包装特征选择方法。
Comput Biol Med. 2020 Oct;125:103974. doi: 10.1016/j.compbiomed.2020.103974. Epub 2020 Aug 20.
2
Wrapper method for feature selection to classify cardiac arrhythmia.用于心律失常分类的特征选择包装方法。
Annu Int Conf IEEE Eng Med Biol Soc. 2017 Jul;2017:3656-3659. doi: 10.1109/EMBC.2017.8037650.
3
A Tri-Stage Wrapper-Filter Feature Selection Framework for Disease Classification.三阶段包装器-过滤器特征选择框架用于疾病分类。
Sensors (Basel). 2021 Aug 18;21(16):5571. doi: 10.3390/s21165571.
4
AVNM: A Voting based Novel Mathematical Rule for Image Classification.AVNM:一种基于投票的图像分类新数学规则。
Comput Methods Programs Biomed. 2016 Dec;137:195-201. doi: 10.1016/j.cmpb.2016.08.015. Epub 2016 Sep 26.
5
A universal deep learning approach for modeling the flow of patients under different severities.一种通用的深度学习方法,用于对不同严重程度的患者进行建模。
Comput Methods Programs Biomed. 2018 Feb;154:191-203. doi: 10.1016/j.cmpb.2017.11.003. Epub 2017 Nov 7.
6
Clinical data classification using an enhanced SMOTE and chaotic evolutionary feature selection.使用增强型SMOTE和混沌进化特征选择的临床数据分类
Comput Biol Med. 2020 Nov;126:103991. doi: 10.1016/j.compbiomed.2020.103991. Epub 2020 Sep 18.
7
Upper-Limb Motion Recognition Based on Hybrid Feature Selection: Algorithm Development and Validation.基于混合特征选择的上肢运动识别:算法开发与验证。
JMIR Mhealth Uhealth. 2021 Sep 2;9(9):e24402. doi: 10.2196/24402.
8
Correlation-Based Ensemble Feature Selection Using Bioinspired Algorithms and Classification Using Backpropagation Neural Network.基于生物启发算法的相关性集成特征选择和反向传播神经网络分类。
Comput Math Methods Med. 2019 Sep 23;2019:7398307. doi: 10.1155/2019/7398307. eCollection 2019.
9
A Hybrid Feature Selection Method Based on Binary State Transition Algorithm and ReliefF.基于二进制状态转换算法和 ReliefF 的混合特征选择方法。
IEEE J Biomed Health Inform. 2019 Sep;23(5):1888-1898. doi: 10.1109/JBHI.2018.2872811. Epub 2018 Sep 28.
10
NSICA: Multi-objective imperialist competitive algorithm for feature selection in arrhythmia diagnosis.NSICA:心律失常诊断中特征选择的多目标帝国主义竞争算法。
Comput Biol Med. 2023 Jul;161:107025. doi: 10.1016/j.compbiomed.2023.107025. Epub 2023 May 24.

引用本文的文献

1
Effective clinical decision support implementation using a multi filter and wrapper optimisation model for Internet of Things based healthcare data.基于物联网的医疗数据的多过滤器和封装器优化模型在有效的临床决策支持实施中的应用。
Sci Rep. 2024 Sep 18;14(1):21820. doi: 10.1038/s41598-024-71726-3.
2
A novel hybrid algorithm based on Harris Hawks for tumor feature gene selection.一种基于哈里斯鹰算法的新型混合肿瘤特征基因选择算法
PeerJ Comput Sci. 2023 Feb 13;9:e1229. doi: 10.7717/peerj-cs.1229. eCollection 2023.
3
Multiclass feature selection with metaheuristic optimization algorithms: a review.
基于元启发式优化算法的多类特征选择:综述
Neural Comput Appl. 2022;34(22):19751-19790. doi: 10.1007/s00521-022-07705-4. Epub 2022 Aug 30.
4
Decoding clinical biomarker space of COVID-19: Exploring matrix factorization-based feature selection methods.解析 COVID-19 的临床生物标志物空间:探索基于矩阵分解的特征选择方法。
Comput Biol Med. 2022 Jul;146:105426. doi: 10.1016/j.compbiomed.2022.105426. Epub 2022 Apr 5.
5
Developing Multiagent E-Learning System-Based Machine Learning and Feature Selection Techniques.开发基于多主体的机器学习和特征选择技术的电子学习系统。
Comput Intell Neurosci. 2022 Jan 30;2022:2941840. doi: 10.1155/2022/2941840. eCollection 2022.
6
Zoo: Selecting Transcriptomic and Methylomic Biomarkers by Ensembling Animal-Inspired Swarm Intelligence Feature Selection Algorithms.动物园:通过集成受动物启发的群体智能特征选择算法来选择转录组学和甲基组学生物标志物。
Genes (Basel). 2021 Nov 18;12(11):1814. doi: 10.3390/genes12111814.
7
A Tri-Stage Wrapper-Filter Feature Selection Framework for Disease Classification.三阶段包装器-过滤器特征选择框架用于疾病分类。
Sensors (Basel). 2021 Aug 18;21(16):5571. doi: 10.3390/s21165571.
8
Decoding Clinical Biomarker Space of COVID-19: Exploring Matrix Factorization-based Feature Selection Methods.解码新冠病毒病的临床生物标志物空间:探索基于矩阵分解的特征选择方法。
medRxiv. 2021 Jul 9:2021.07.07.21259699. doi: 10.1101/2021.07.07.21259699.