通过多模态扰动整合局部学习者。

Ensembling local learners through multimodal perturbation.

作者信息

Zhou Zhi-Hua, Yu Yang

机构信息

National Laboratory for Novel Software Technology, Nanjing University, Nanjing 210093, China.

出版信息

IEEE Trans Syst Man Cybern B Cybern. 2005 Aug;35(4):725-35. doi: 10.1109/tsmcb.2005.845396.

DOI:10.1109/tsmcb.2005.845396

PMID:16128456

Abstract

Ensemble learning algorithms train multiple component learners and then combine their predictions. In order to generate a strong ensemble, the component learners should be with high accuracy as well as high diversity. A popularly used scheme in generating accurate but diverse component learners is to perturb the training data with resampling methods, such as the bootstrap sampling used in bagging. However, such a scheme is not very effective on local learners such as nearest-neighbor classifiers because a slight change in training data can hardly result in local learners with big differences. In this paper, a new ensemble algorithm named Filtered Attribute Subspace based Bagging with Injected Randomness (FASBIR) is proposed for building ensembles of local learners, which utilizes multimodal perturbation to help generate accurate but diverse component learners. In detail, FASBIR employs the perturbation on the training data with bootstrap sampling, the perturbation on the input attributes with attribute filtering and attribute subspace selection, and the perturbation on the learning parameters with randomly configured distance metrics. A large empirical study shows that FASBIR is effective in building ensembles of nearest-neighbor classifiers, whose performance is better than that of many other ensemble algorithms.

摘要

集成学习算法训练多个组件学习器，然后组合它们的预测结果。为了生成一个强大的集成，组件学习器应该具有高精度和高多样性。在生成准确但多样的组件学习器时，一种常用的方案是使用重采样方法对训练数据进行扰动，例如在装袋法中使用的自助采样。然而，这种方案对局部学习器（如最近邻分类器）不是很有效，因为训练数据的微小变化很难导致局部学习器有很大差异。本文提出了一种名为基于注入随机性的过滤属性子空间装袋法（FASBIR）的新集成算法，用于构建局部学习器的集成，该算法利用多模态扰动来帮助生成准确但多样的组件学习器。具体来说，FASBIR对训练数据采用自助采样进行扰动，对输入属性采用属性过滤和属性子空间选择进行扰动，对学习参数采用随机配置的距离度量进行扰动。大量实证研究表明，FASBIR在构建最近邻分类器的集成方面是有效的，其性能优于许多其他集成算法。

相似文献

Ensembling local learners through multimodal perturbation.

IEEE Trans Syst Man Cybern B Cybern. 2005 Aug;35(4):725-35. doi: 10.1109/tsmcb.2005.845396.

Rotation forest: A new classifier ensemble method.

IEEE Trans Pattern Anal Mach Intell. 2006 Oct;28(10):1619-30. doi: 10.1109/TPAMI.2006.211.

Learning weighted metrics to minimize nearest-neighbor classification error.

IEEE Trans Pattern Anal Mach Intell. 2006 Jul;28(7):1100-10. doi: 10.1109/TPAMI.2006.145.

On visualization and aggregation of nearest neighbor classifiers.

IEEE Trans Pattern Anal Mach Intell. 2005 Oct;27(10):1592-602. doi: 10.1109/TPAMI.2005.204.

Kernel pooled local subspaces for classification.

IEEE Trans Syst Man Cybern B Cybern. 2005 Jun;35(3):489-502. doi: 10.1109/tsmcb.2005.846641.

Adaptive quasiconformal kernel nearest neighbor classification.

IEEE Trans Pattern Anal Mach Intell. 2004 May;26(5):656-61. doi: 10.1109/TPAMI.2004.1273978.

A comparison of decision tree ensemble creation techniques.

IEEE Trans Pattern Anal Mach Intell. 2007 Jan;29(1):173-80. doi: 10.1109/tpami.2007.250609.

Polynomial-time metrics for attributed trees.

IEEE Trans Pattern Anal Mach Intell. 2005 Jul;27(7):1087-99. doi: 10.1109/tpami.2005.146.

Indexing hierarchical structures using graph spectra.

IEEE Trans Pattern Anal Mach Intell. 2005 Jul;27(7):1125-40. doi: 10.1109/TPAMI.2005.142.

On utilizing search methods to select subspace dimensions for kernel-based nonlinear subspace classifiers.

IEEE Trans Pattern Anal Mach Intell. 2005 Jan;27(1):136-41. doi: 10.1109/TPAMI.2005.15.

引用本文的文献

High-accuracy prediction of transmembrane inter-helix contacts and application to GPCR 3D structure modeling.

Bioinformatics. 2013 Oct 15;29(20):2579-87. doi: 10.1093/bioinformatics/btt440. Epub 2013 Aug 14.

LabCaS: labeling calpain substrate cleavage sites from amino acid sequence using conditional random fields.

Proteins. 2013 Apr;81(4):622-34. doi: 10.1002/prot.24217. Epub 2012 Dec 24.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

通过多模态扰动整合局部学习者。

Ensembling local learners through multimodal perturbation.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献