二元分类问题中多数投票性能的理论界限。

Theoretical bounds of majority voting performance for a binary classification problem.

作者信息

Narasimhamurthy Anand

机构信息

Department of Computer Science and Engineering, 341 IST Building, Pennsylvania State University, University Park, PA 16802, USA.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2005 Dec;27(12):1988-95. doi: 10.1109/TPAMI.2005.249.

DOI:10.1109/TPAMI.2005.249

PMID:16355665

Abstract

A number of earlier studies that have attempted a theoretical analysis of majority voting assume independence of the classifiers. We formulate the majority voting problem as an optimization problem with linear constraints. No assumptions on the independence of classifiers are made. For a binary classification problem, given the accuracies of the classifiers in the team, the theoretical upper and lower bounds for performance obtained by combining them through majority voting are shown to be solutions of the corresponding optimization problem. The objective function of the optimization problem is nonlinear in the case of an even number of classifiers when rejection is allowed, for the other cases the objective function is linear and hence the problem is a linear program (LP). Using the framework we provide some insights and investigate the relationship between two candidate classifier diversity measures and majority voting performance.

摘要

许多早期尝试对多数投票进行理论分析的研究都假定分类器是独立的。我们将多数投票问题表述为一个具有线性约束的优化问题。没有对分类器的独立性做出任何假设。对于二元分类问题，给定团队中分类器的准确率，通过多数投票组合这些分类器所获得的性能的理论上限和下限被证明是相应优化问题的解。当允许拒绝时，在分类器数量为偶数的情况下，优化问题的目标函数是非线性的，对于其他情况，目标函数是线性的，因此该问题是一个线性规划（LP）。利用这个框架，我们提供了一些见解，并研究了两种候选分类器多样性度量与多数投票性能之间的关系。

相似文献

Theoretical bounds of majority voting performance for a binary classification problem.二元分类问题中多数投票性能的理论界限。

IEEE Trans Pattern Anal Mach Intell. 2005 Dec;27(12):1988-95. doi: 10.1109/TPAMI.2005.249.

Performance-based classifier combination in atlas-based image segmentation using expectation-maximization parameter estimation.基于期望最大化参数估计的基于图谱的图像分割中基于性能的分类器组合

IEEE Trans Med Imaging. 2004 Aug;23(8):983-94. doi: 10.1109/TMI.2004.830803.

Tensor voting for image correction by global and local intensity alignment.通过全局和局部强度对齐进行张量投票以校正图像

IEEE Trans Pattern Anal Mach Intell. 2005 Jan;27(1):36-50. doi: 10.1109/TPAMI.2005.20.

A shape-from-shading method of polyhedral objects using prior information.一种利用先验信息的多面体物体明暗形状恢复方法。

IEEE Trans Pattern Anal Mach Intell. 2006 Apr;28(4):612-24. doi: 10.1109/TPAMI.2006.67.

Simultaneous two-view epipolar geometry estimation and motion segmentation by 4D tensor voting.基于四维张量投票的同步双视图对极几何估计与运动分割

IEEE Trans Pattern Anal Mach Intell. 2004 Sep;26(9):1167-84. doi: 10.1109/TPAMI.2004.72.

Reduced complexity rotation invariant texture classification using a blind deconvolution approach.使用盲反卷积方法的低复杂度旋转不变纹理分类

IEEE Trans Pattern Anal Mach Intell. 2006 Jan;28(1):145-9. doi: 10.1109/TPAMI.2006.24.

Subclass problem-dependent design for error-correcting output codes.用于纠错输出码的子类问题相关设计。

IEEE Trans Pattern Anal Mach Intell. 2008 Jun;30(6):1041-54. doi: 10.1109/TPAMI.2008.38.

Ensemble tracking.集成跟踪

IEEE Trans Pattern Anal Mach Intell. 2007 Feb;29(2):261-71. doi: 10.1109/TPAMI.2007.35.

BoostMap: an embedding method for efficient nearest neighbor retrieval.BoostMap：一种用于高效最近邻检索的嵌入方法。

IEEE Trans Pattern Anal Mach Intell. 2008 Jan;30(1):89-104. doi: 10.1109/TPAMI.2007.1140.

Topology-preserving tissue classification of magnetic resonance brain images.磁共振脑图像的拓扑保持组织分类

IEEE Trans Med Imaging. 2007 Apr;26(4):487-96. doi: 10.1109/TMI.2007.893283.

引用本文的文献

A probabilistic approach for building disease phenotypes across electronic health records.一种基于电子健康记录构建疾病表型的概率方法。

BioData Min. 2025 Jun 11;18(1):39. doi: 10.1186/s13040-025-00454-9.

Detecting cognitive impairment in cerebrovascular disease using gait, dual tasks, and machine learning.利用步态、双重任务和机器学习检测脑血管疾病中的认知障碍。

BMC Med Inform Decis Mak. 2025 Apr 1;25(1):157. doi: 10.1186/s12911-025-02979-9.

Swing-phase detection of locomotive mode transitions for smooth multi-functional robotic lower-limb prosthesis control.用于平滑多功能机器人下肢假肢控制的运动模式转换的摆动阶段检测。

Front Robot AI. 2024 Apr 12;11:1267072. doi: 10.3389/frobt.2024.1267072. eCollection 2024.

GACEM: Genetic Algorithm Based Classifier Ensemble in a Multi-sensor System.GACEM：多传感器系统中基于遗传算法的分类器集成

Sensors (Basel). 2008 Oct 1;8(10):6203-6224. doi: 10.3390/s8106203.

The use of genetic programming in the analysis of quantitative gene expression profiles for identification of nodal status in bladder cancer.基因编程在分析定量基因表达谱以确定膀胱癌淋巴结状态中的应用。

BMC Cancer. 2006 Jun 16;6:159. doi: 10.1186/1471-2407-6-159.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

二元分类问题中多数投票性能的理论界限。

Theoretical bounds of majority voting performance for a binary classification problem.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献