• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

混合增量集成学习在噪声实际数据分类中的应用。

Hybrid Incremental Ensemble Learning for Noisy Real-World Data Classification.

出版信息

IEEE Trans Cybern. 2019 Feb;49(2):403-416. doi: 10.1109/TCYB.2017.2774266. Epub 2017 Dec 4.

DOI:10.1109/TCYB.2017.2774266
PMID:29990215
Abstract

Traditional ensemble learning approaches explore the feature space and the sample space, respectively, which will prevent them to construct more powerful learning models for noisy real-world dataset classification. The random subspace method only search for the selection of features. Meanwhile, the bagging approach only search for the selection of samples. To overcome these limitations, we propose the hybrid incremental ensemble learning (HIEL) approach which takes into consideration the feature space and the sample space simultaneously to handle noisy dataset. Specifically, HIEL first adopts the bagging technique and linear discriminant analysis to remove noisy attributes, and generates a set of bootstraps and the corresponding ensemble members in the subspaces. Then, the classifiers are selected incrementally based on a classifier-specific criterion function and an ensemble criterion function. The corresponding weights for the classifiers are assigned during the same process. Finally, the final label is summarized by a weighted voting scheme, which serves as the final result of the classification. We also explore various classifier-specific criterion functions based on different newly proposed similarity measures, which will alleviate the effect of noisy samples on the distance functions. In addition, the computational cost of HIEL is analyzed theoretically. A set of nonparametric tests are adopted to compare HIEL and other algorithms over several datasets. The experiment results show that HIEL performs well on the noisy datasets. HIEL outperforms most of the compared classifier ensemble methods on 14 out of 24 noisy real-world UCI and KEEL datasets.

摘要

传统的集成学习方法分别探索特征空间和样本空间,这将阻止它们为嘈杂的现实世界数据集分类构建更强大的学习模型。随机子空间方法仅搜索特征的选择。同时,装袋方法仅搜索样本的选择。为了克服这些限制,我们提出了混合增量集成学习 (HIEL) 方法,该方法同时考虑特征空间和样本空间,以处理嘈杂的数据集。具体来说,HIEL 首先采用装袋技术和线性判别分析来去除嘈杂属性,并在子空间中生成一组引导和相应的集成成员。然后,根据特定于分类器的准则函数和集成准则函数,逐步选择分类器。在同一过程中,为分类器分配相应的权重。最后,通过加权投票方案总结最终标签,作为分类的最终结果。我们还探索了各种基于新提出的相似性度量的特定于分类器的准则函数,这将减轻嘈杂样本对距离函数的影响。此外,还从理论上分析了 HIEL 的计算成本。采用了一组非参数检验来比较 HIEL 和其他算法在多个数据集上的性能。实验结果表明,HIEL 在嘈杂数据集上表现良好。在 24 个嘈杂的 UCI 和 KEEL 数据集的 14 个数据集上,HIEL 优于大多数比较的分类器集成方法。

相似文献

1
Hybrid Incremental Ensemble Learning for Noisy Real-World Data Classification.混合增量集成学习在噪声实际数据分类中的应用。
IEEE Trans Cybern. 2019 Feb;49(2):403-416. doi: 10.1109/TCYB.2017.2774266. Epub 2017 Dec 4.
2
Hybrid adaptive classifier ensemble.混合自适应分类器集成。
IEEE Trans Cybern. 2015 Feb;45(2):177-90. doi: 10.1109/TCYB.2014.2322195. Epub 2014 May 20.
3
Progressive Semisupervised Learning of Multiple Classifiers.多分类器的渐进式半监督学习。
IEEE Trans Cybern. 2018 Feb;48(2):689-702. doi: 10.1109/TCYB.2017.2651114. Epub 2017 Jan 19.
4
Hybrid k -Nearest Neighbor Classifier.混合 k-最近邻分类器。
IEEE Trans Cybern. 2016 Jun;46(6):1263-75. doi: 10.1109/TCYB.2015.2443857. Epub 2015 Jun 26.
5
Multiobjective Semisupervised Classifier Ensemble.多目标半监督分类器集成。
IEEE Trans Cybern. 2019 Jun;49(6):2280-2293. doi: 10.1109/TCYB.2018.2824299. Epub 2018 Apr 20.
6
Adaptive Semi-Supervised Classifier Ensemble for High Dimensional Data Classification.高维数据分类的自适应半监督分类器集成。
IEEE Trans Cybern. 2019 Feb;49(2):366-379. doi: 10.1109/TCYB.2017.2761908. Epub 2017 Oct 26.
7
A Machine Learning Ensemble Classifier for Early Prediction of Diabetic Retinopathy.机器学习集成分类器在糖尿病视网膜病变早期预测中的应用。
J Med Syst. 2017 Nov 9;41(12):201. doi: 10.1007/s10916-017-0853-x.
8
A New Kind of Nonparametric Test for Statistical Comparison of Multiple Classifiers Over Multiple Datasets.一种用于在多个数据集上对多个分类器进行统计比较的新型非参数检验方法。
IEEE Trans Cybern. 2017 Dec;47(12):4418-4431. doi: 10.1109/TCYB.2016.2611020. Epub 2016 Oct 3.
9
Improving Classification Performance through an Advanced Ensemble Based Heterogeneous Extreme Learning Machines.通过基于高级集成的异构极端学习机提高分类性能。
Comput Intell Neurosci. 2017;2017:3405463. doi: 10.1155/2017/3405463. Epub 2017 May 4.
10
Adaptive Fusion Based Method for Imbalanced Data Classification.基于自适应融合的不平衡数据分类方法
Front Neurorobot. 2022 Feb 28;16:827913. doi: 10.3389/fnbot.2022.827913. eCollection 2022.

引用本文的文献

1
Application of an improved LightGBM hybrid integration model combining gradient harmonization and Jacobian regularization for breast cancer diagnosis.一种结合梯度协调和雅可比正则化的改进型LightGBM混合集成模型在乳腺癌诊断中的应用。
Sci Rep. 2025 Jan 20;15(1):2569. doi: 10.1038/s41598-025-86014-x.