一种全球变化局部恒定的模型，用于融合来自多个不同专家的标签，而无需使用参考标签。

A globally-variant locally-constant model for fusion of labels from multiple diverse experts without using reference labels.

机构信息

Electrical Engineering Department, University of Southern California, 3740 McClintock Avenue, Los Angeles, CA 90089-2564, USA.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2013 Apr;35(4):769-83. doi: 10.1109/TPAMI.2012.139.

DOI:10.1109/TPAMI.2012.139

PMID:22732663

Abstract

Researchers have shown that fusion of categorical labels from multiple experts—humans or machine classifiers—improves the accuracy and generalizability of the overall classification system. Simple plurality is a popular technique for performing this fusion, but it gives equal importance to labels from all experts, who may not be equally reliable or consistent across the dataset. Estimation of expert reliability without knowing the reference labels is, however, a challenging problem. Most previous works deal with these challenges by modeling expert reliability as constant over the entire data (feature) space. This paper presents a model based on the consideration that in dealing with real-world data, expert reliability is variable over the complete feature space but constant over local clusters of homogeneous instances. This model jointly learns a classifier and expert reliability parameters without assuming knowledge of the reference labels using the Expectation-Maximization (EM) algorithm. Classification experiments on simulated data, data from the UCI Machine Learning Repository, and two emotional speech classification datasets show the benefits of the proposed model. Using a metric based on the Jensen-Shannon divergence, we empirically show that the proposed model gives greater benefit for datasets where expert reliability is highly variable over the feature space.

摘要

研究人员已经表明，融合来自多个专家的分类标签——人类或机器分类器——可以提高整体分类系统的准确性和泛化能力。简单多数是执行这种融合的一种流行技术，但它对所有专家的标签同等重视，而这些专家在整个数据集上可能并不具有同等的可靠性或一致性。然而，在不知道参考标签的情况下估计专家的可靠性是一个具有挑战性的问题。大多数先前的工作通过将专家可靠性建模为在整个数据（特征）空间上保持不变来应对这些挑战。本文提出了一种模型，其考虑到在处理真实世界的数据时，专家可靠性在整个特征空间上是可变的，但在同质实例的局部聚类上是不变的。该模型使用期望最大化（EM）算法，在不假设参考标签知识的情况下，联合学习分类器和专家可靠性参数。在模拟数据、UCI 机器学习知识库中的数据以及两个情感语音分类数据集上的分类实验表明了该模型的优势。使用基于 Jensen-Shannon 散度的度量，我们从经验上证明，对于专家可靠性在特征空间上高度变化的数据集，所提出的模型带来了更大的好处。

相似文献

A globally-variant locally-constant model for fusion of labels from multiple diverse experts without using reference labels.一种全球变化局部恒定的模型，用于融合来自多个不同专家的标签，而无需使用参考标签。

IEEE Trans Pattern Anal Mach Intell. 2013 Apr;35(4):769-83. doi: 10.1109/TPAMI.2012.139.

Medical Dataset Classification: A Machine Learning Paradigm Integrating Particle Swarm Optimization with Extreme Learning Machine Classifier.医学数据集分类：一种将粒子群优化与极限学习机分类器相结合的机器学习范式。

ScientificWorldJournal. 2015;2015:418060. doi: 10.1155/2015/418060. Epub 2015 Sep 30.

CARSVM: a class association rule-based classification framework and its application to gene expression data.CARSVM：一种基于类关联规则的分类框架及其在基因表达数据中的应用。

Artif Intell Med. 2008 Sep;44(1):7-25. doi: 10.1016/j.artmed.2008.05.002. Epub 2008 Jun 30.

Performance-based classifier combination in atlas-based image segmentation using expectation-maximization parameter estimation.基于期望最大化参数估计的基于图谱的图像分割中基于性能的分类器组合

IEEE Trans Med Imaging. 2004 Aug;23(8):983-94. doi: 10.1109/TMI.2004.830803.

A constraint-based evolutionary learning approach to the expectation maximization for optimal estimation of the hidden Markov model for speech signal modeling.一种基于约束的进化学习方法，用于语音信号建模的隐马尔可夫模型最优估计的期望最大化。

IEEE Trans Syst Man Cybern B Cybern. 2009 Feb;39(1):182-97. doi: 10.1109/TSMCB.2008.2004051. Epub 2008 Dec 9.

Mixture classification model based on clinical markers for breast cancer prognosis.基于临床标志物的乳腺癌预后混合分类模型。

Artif Intell Med. 2010 Feb-Mar;48(2-3):129-37. doi: 10.1016/j.artmed.2009.07.008. Epub 2009 Dec 14.

Relabeling algorithm for retrieval of noisy instances and improving prediction quality.重新标记算法用于检索有噪声的实例并提高预测质量。

Comput Biol Med. 2010 Mar;40(3):288-99. doi: 10.1016/j.compbiomed.2009.12.005. Epub 2010 Jan 25.

An extended EM algorithm for joint feature extraction and classification in brain-computer interfaces.一种用于脑机接口中联合特征提取与分类的扩展期望最大化算法。

Neural Comput. 2006 Nov;18(11):2730-61. doi: 10.1162/neco.2006.18.11.2730.

Switching between selection and fusion in combining classifiers: an experiment.

IEEE Trans Syst Man Cybern B Cybern. 2002;32(2):146-56. doi: 10.1109/3477.990871.

A new local-global approach for classification.一种新的局部-全局分类方法。

Neural Netw. 2010 Sep;23(7):887-91. doi: 10.1016/j.neunet.2010.04.010. Epub 2010 May 5.

引用本文的文献

Selecting optimal software code descriptors-The case of Java.选择最佳的软件代码描述符——以 Java 为例。

PLoS One. 2024 Nov 1;19(11):e0310840. doi: 10.1371/journal.pone.0310840. eCollection 2024.

Modeling multiple time series annotations as noisy distortions of the ground truth: An Expectation-Maximization approach.将多个时间序列注释建模为真实情况的噪声失真：一种期望最大化方法。

IEEE Trans Affect Comput. 2018 Jan-Mar;9(1):76-89. doi: 10.1109/TAFFC.2016.2592918. Epub 2016 Jul 19.

Applying machine learning to facilitate autism diagnostics: pitfalls and promises.应用机器学习促进自闭症诊断：陷阱与前景。

J Autism Dev Disord. 2015 May;45(5):1121-36. doi: 10.1007/s10803-014-2268-6.

Behavioral Signal Processing: Deriving Human Behavioral Informatics From Speech and Language: Computational techniques are presented to analyze and model expressed and perceived human behavior-variedly characterized as typical, atypical, distressed, and disordered-from speech and language cues and their applications in health, commerce, education, and beyond.行为信号处理：从语音和语言中提取人类行为信息学：本文介绍了计算技术，用于从语音和语言线索中分析和建模所表达和感知到的人类行为——这些行为具有典型、非典型、苦恼和紊乱等不同特征——及其在健康、商业、教育等领域的应用。

Proc IEEE Inst Electr Electron Eng. 2013 Feb 7;101(5):1203-1233. doi: 10.1109/JPROC.2012.2236291.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

一种全球变化局部恒定的模型，用于融合来自多个不同专家的标签，而无需使用参考标签。

A globally-variant locally-constant model for fusion of labels from multiple diverse experts without using reference labels.

机构信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献