• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于BPSO辅助粗糙超长方体方法的大规模元启发式特征选择

Large-Scale Meta-Heuristic Feature Selection Based on BPSO Assisted Rough Hypercuboid Approach.

作者信息

Luo Chuan, Wang Sizhao, Li Tianrui, Chen Hongmei, Lv Jiancheng, Yi Zhang

出版信息

IEEE Trans Neural Netw Learn Syst. 2023 Dec;34(12):10889-10903. doi: 10.1109/TNNLS.2022.3171614. Epub 2023 Nov 30.

DOI:10.1109/TNNLS.2022.3171614
PMID:35552142
Abstract

The selection of prominent features for building more compact and efficient models is an important data preprocessing task in the field of data mining. The rough hypercuboid approach is an emerging technique that can be applied to eliminate irrelevant and redundant features, especially for the inexactness problem in approximate numerical classification. By integrating the meta-heuristic-based evolutionary search technique, a novel global search method for numerical feature selection is proposed in this article based on the hybridization of the rough hypercuboid approach and binary particle swarm optimization (BPSO) algorithm, namely RH-BPSO. To further alleviate the issue of high computational cost when processing large-scale datasets, parallelization approaches for calculating the hybrid feature evaluation criteria are presented by decomposing and recombining hypercuboid equivalence partition matrix via horizontal data partitioning. A distributed meta-heuristic optimized rough hypercuboid feature selection (DiRH-BPSO) algorithm is thus developed and embedded in the Apache Spark cloud computing model. Extensive experimental results indicate that RH-BPSO is promising and can significantly outperform the other representative feature selection algorithms in terms of classification accuracy, the cardinality of the selected feature subset, and execution efficiency. Moreover, experiments on distributed-memory multicore clusters show that DiRH-BPSO is significantly faster than its sequential counterpart and is perfectly capable of completing large-scale feature selection tasks that fail on a single node due to memory constraints. Parallel scalability and extensibility analysis also demonstrate that DiRH-BPSO could scale out and extend well with the growth of computational nodes and the volume of data.

摘要

选择显著特征以构建更紧凑、高效的模型是数据挖掘领域一项重要的数据预处理任务。粗糙超长方体方法是一种新兴技术,可用于消除不相关和冗余特征,尤其适用于近似数值分类中的不精确性问题。通过集成基于元启发式的进化搜索技术,本文基于粗糙超长方体方法与二进制粒子群优化(BPSO)算法的融合,提出了一种用于数值特征选择的新型全局搜索方法,即RH-BPSO。为了进一步缓解处理大规模数据集时计算成本高的问题,通过水平数据分区对超长方体等价划分矩阵进行分解和重组,提出了计算混合特征评估标准的并行化方法。由此开发了一种分布式元启发式优化粗糙超长方体特征选择(DiRH-BPSO)算法,并将其嵌入到Apache Spark云计算模型中。大量实验结果表明,RH-BPSO很有前景,在分类准确率、所选特征子集的基数和执行效率方面,能显著优于其他代表性特征选择算法。此外,在分布式内存多核集群上的实验表明,DiRH-BPSO比其顺序对应算法快得多,并且完全能够完成由于内存限制在单个节点上失败的大规模特征选择任务。并行可扩展性和扩展性分析还表明,DiRH-BPSO能够随着计算节点的增加和数据量的增长很好地进行扩展和延伸。

相似文献

1
Large-Scale Meta-Heuristic Feature Selection Based on BPSO Assisted Rough Hypercuboid Approach.基于BPSO辅助粗糙超长方体方法的大规模元启发式特征选择
IEEE Trans Neural Netw Learn Syst. 2023 Dec;34(12):10889-10903. doi: 10.1109/TNNLS.2022.3171614. Epub 2023 Nov 30.
2
Gene selection using hybrid binary black hole algorithm and modified binary particle swarm optimization.使用混合二进制黑洞算法和改进二进制粒子群优化的基因选择。
Genomics. 2019 Jul;111(4):669-686. doi: 10.1016/j.ygeno.2018.04.004. Epub 2018 Apr 14.
3
Reflective Distributed Denial of Service Detection: A Novel Model Utilizing Binary Particle Swarm Optimization-Simulated Annealing for Feature Selection and Gray Wolf Optimization-Optimized LightGBM Algorithm.反射式分布式拒绝服务检测:一种利用二进制粒子群优化-模拟退火进行特征选择以及灰狼优化-优化的LightGBM算法的新型模型
Sensors (Basel). 2024 Sep 24;24(19):6179. doi: 10.3390/s24196179.
4
Multiswarm heterogeneous binary PSO using win-win approach for improved feature selection in liver and kidney disease diagnosis.基于双赢策略的多群异质二进制粒子群优化算法在肝肾病诊断中特征选择的改进。
Comput Med Imaging Graph. 2018 Dec;70:135-154. doi: 10.1016/j.compmedimag.2018.10.003. Epub 2018 Oct 17.
5
missForest with feature selection using binary particle swarm optimization improves the imputation accuracy of continuous data.使用二进制粒子群优化进行特征选择的 missForest 提高了连续数据的插补准确性。
Genes Genomics. 2022 Jun;44(6):651-658. doi: 10.1007/s13258-022-01247-8. Epub 2022 Apr 6.
6
An Innovative Excited-ACS-IDGWO Algorithm for Optimal Biomedical Data Feature Selection.一种创新的基于激发 ACS-IDGWO 算法的最优生物医学数据特征选择方法。
Biomed Res Int. 2020 Aug 17;2020:8506365. doi: 10.1155/2020/8506365. eCollection 2020.
7
Rough Hypercuboid Based Generalized and Robust IT2 Fuzzy C-Means Algorithm.基于粗糙超长方体的广义鲁棒型区间二型模糊C均值算法
IEEE Trans Cybern. 2021 Jul;51(7):3641-3652. doi: 10.1109/TCYB.2019.2925130. Epub 2021 Jun 23.
8
RST-DE: Rough Sets-Based New Differential Evolution Algorithm for Scalable Big Data Feature Selection in Distributed Computing Platforms.基于粗糙集的新差分进化算法在分布式计算平台中的可扩展大数据特征选择。
Big Data. 2022 Aug;10(4):356-367. doi: 10.1089/big.2021.0267. Epub 2022 May 4.
9
CCFS: A Confidence-Based Cost-Effective Feature Selection Scheme for Healthcare Data Classification.CCFS:一种基于置信度的医疗保健数据分类的成本效益特征选择方案。
IEEE/ACM Trans Comput Biol Bioinform. 2021 May-Jun;18(3):902-911. doi: 10.1109/TCBB.2019.2903804. Epub 2021 Jun 3.
10
Particle Swarm Optimization approach to defect detection in armour ceramics.用于装甲陶瓷缺陷检测的粒子群优化方法。
Ultrasonics. 2017 Mar;75:124-131. doi: 10.1016/j.ultras.2016.07.008. Epub 2016 Jul 18.