旋转森林：一种新的分类器集成方法。

Rotation forest: A new classifier ensemble method.

作者信息

Rodríguez Juan J, Kuncheva Ludmila I, Alonso Carlos J

机构信息

Escuela Politécnica Superior, Edificio C, Universidad de Burgos, c/ Francisco de Vitoria s/n, 09006 Burgos, Spain.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2006 Oct;28(10):1619-30. doi: 10.1109/TPAMI.2006.211.

DOI:10.1109/TPAMI.2006.211

PMID:16986543

Abstract

We propose a method for generating classifier ensembles based on feature extraction. To create the training data for a base classifier, the feature set is randomly split into K subsets (K is a parameter of the algorithm) and Principal Component Analysis (PCA) is applied to each subset. All principal components are retained in order to preserve the variability information in the data. Thus, K axis rotations take place to form the new features for a base classifier. The idea of the rotation approach is to encourage simultaneously individual accuracy and diversity within the ensemble. Diversity is promoted through the feature extraction for each base classifier. Decision trees were chosen here because they are sensitive to rotation of the feature axes, hence the name "forest." Accuracy is sought by keeping all principal components and also using the whole data set to train each base classifier. Using WEKA, we examined the Rotation Forest ensemble on a random selection of 33 benchmark data sets from the UCI repository and compared it with Bagging, AdaBoost, and Random Forest. The results were favorable to Rotation Forest and prompted an investigation into diversity-accuracy landscape of the ensemble models. Diversity-error diagrams revealed that Rotation Forest ensembles construct individual classifiers which are more accurate than these in AdaBoost and Random Forest, and more diverse than these in Bagging, sometimes more accurate as well.

摘要

我们提出了一种基于特征提取生成分类器集成的方法。为了为基础分类器创建训练数据，将特征集随机划分为K个子集（K是该算法的一个参数），并对每个子集应用主成分分析（PCA）。保留所有主成分以保留数据中的变异性信息。因此，进行K次轴旋转以形成基础分类器的新特征。旋转方法的理念是同时提高集成内个体的准确性和多样性。通过为每个基础分类器进行特征提取来促进多样性。这里选择决策树是因为它们对特征轴的旋转敏感，因此得名“森林”。通过保留所有主成分并使用整个数据集来训练每个基础分类器来追求准确性。使用WEKA，我们在从UCI存储库中随机选择的33个基准数据集上检验了旋转森林集成，并将其与装袋法、AdaBoost和随机森林进行了比较。结果对旋转森林有利，并促使我们对集成模型的多样性-准确性格局进行研究。多样性-误差图显示，旋转森林集成构建的个体分类器比AdaBoost和随机森林中的更准确，比装袋法中的更多样化，有时也更准确。

相似文献

Rotation forest: A new classifier ensemble method.

IEEE Trans Pattern Anal Mach Intell. 2006 Oct;28(10):1619-30. doi: 10.1109/TPAMI.2006.211.

Cancer classification using Rotation Forest.

Comput Biol Med. 2008 May;38(5):601-10. doi: 10.1016/j.compbiomed.2008.02.007. Epub 2008 Apr 3.

Evaluation of stability of k-means cluster ensembles with respect to random initialization.

IEEE Trans Pattern Anal Mach Intell. 2006 Nov;28(11):1798-808. doi: 10.1109/TPAMI.2006.226.

Random subspace ensembles for FMRI classification.

IEEE Trans Med Imaging. 2010 Feb;29(2):531-42. doi: 10.1109/TMI.2009.2037756.

On feature extraction via kernels.

IEEE Trans Syst Man Cybern B Cybern. 2008 Apr;38(2):553-7. doi: 10.1109/TSMCB.2007.913604.

A theoretical analysis of bagging as a linear combination of classifiers.

IEEE Trans Pattern Anal Mach Intell. 2008 Jul;30(7):1293-9. doi: 10.1109/TPAMI.2008.30.

Learning weighted metrics to minimize nearest-neighbor classification error.

IEEE Trans Pattern Anal Mach Intell. 2006 Jul;28(7):1100-10. doi: 10.1109/TPAMI.2006.145.

LESS: a model-based classifier for sparse subspaces.

IEEE Trans Pattern Anal Mach Intell. 2005 Sep;27(9):1496-500. doi: 10.1109/TPAMI.2005.182.

Automated variable weighting in k-means type clustering.

IEEE Trans Pattern Anal Mach Intell. 2005 May;27(5):657-68. doi: 10.1109/TPAMI.2005.95.

Onvergence and application of online active sampling using orthogonal pillar vectors.

IEEE Trans Pattern Anal Mach Intell. 2004 Sep;26(9):1197-207. doi: 10.1109/TPAMI.2004.61.

引用本文的文献

Motor Imagery EEG Classification Based on Multi-Domain Feature Rotation and Stacking Ensemble.

Brain Sci. 2025 Jan 7;15(1):50. doi: 10.3390/brainsci15010050.

Combining Postural Sway Parameters and Machine Learning to Assess Biomechanical Risk Associated with Load-Lifting Activities.

Diagnostics (Basel). 2025 Jan 4;15(1):105. doi: 10.3390/diagnostics15010105.

Language task-based fMRI analysis using machine learning and deep learning.

Front Radiol. 2024 Nov 27;4:1495181. doi: 10.3389/fradi.2024.1495181. eCollection 2024.

Machine Learning-Driven Methods for Nanobody Affinity Prediction.

ACS Omega. 2024 Nov 19;9(48):47893-47902. doi: 10.1021/acsomega.4c09718. eCollection 2024 Dec 3.

Daily river flow simulation using ensemble disjoint aggregating M5-Prime model.

Heliyon. 2024 Sep 30;10(20):e37965. doi: 10.1016/j.heliyon.2024.e37965. eCollection 2024 Oct 30.

Electrical impedance tomography image reconstruction for lung monitoring based on ensemble learning algorithms.

Healthc Technol Lett. 2024 Apr 30;11(5):271-282. doi: 10.1049/htl2.12085. eCollection 2024 Oct.

Harnessing the power of artificial intelligence for human living organoid research.

Bioact Mater. 2024 Aug 30;42:140-164. doi: 10.1016/j.bioactmat.2024.08.027. eCollection 2024 Dec.

Metabolomics Unveils Disrupted Pathways in Parkinson's Disease: Toward Biomarker-Based Diagnosis.

ACS Chem Neurosci. 2024 Sep 4;15(17):3168-3180. doi: 10.1021/acschemneuro.4c00355. Epub 2024 Aug 23.

iCRBP-LKHA: Large convolutional kernel and hybrid channel-spatial attention for identifying circRNA-RBP interaction sites.

PLoS Comput Biol. 2024 Aug 22;20(8):e1012399. doi: 10.1371/journal.pcbi.1012399. eCollection 2024 Aug.

Presurgery and postsurgery: advancements in artificial intelligence and machine learning models for enhancing patient management in infective endocarditis.

Int J Surg. 2024 Nov 1;110(11):7202-7214. doi: 10.1097/JS9.0000000000002003.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

旋转森林：一种新的分类器集成方法。

Rotation forest: A new classifier ensemble method.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献