Suppr超能文献

学习带有软标签信息的分类模型。

Learning classification models with soft-label information.

机构信息

Computer Science Department, University of Pittsburgh, Pittsburgh, Pennsylvania, USA.

出版信息

J Am Med Inform Assoc. 2014 May-Jun;21(3):501-8. doi: 10.1136/amiajnl-2013-001964. Epub 2013 Nov 20.

Abstract

OBJECTIVE

Learning of classification models in medicine often relies on data labeled by a human expert. Since labeling of clinical data may be time-consuming, finding ways of alleviating the labeling costs is critical for our ability to automatically learn such models. In this paper we propose a new machine learning approach that is able to learn improved binary classification models more efficiently by refining the binary class information in the training phase with soft labels that reflect how strongly the human expert feels about the original class labels.

MATERIALS AND METHODS

Two types of methods that can learn improved binary classification models from soft labels are proposed. The first relies on probabilistic/numeric labels, the other on ordinal categorical labels. We study and demonstrate the benefits of these methods for learning an alerting model for heparin induced thrombocytopenia. The experiments are conducted on the data of 377 patient instances labeled by three different human experts. The methods are compared using the area under the receiver operating characteristic curve (AUC) score.

RESULTS

Our AUC results show that the new approach is capable of learning classification models more efficiently compared to traditional learning methods. The improvement in AUC is most remarkable when the number of examples we learn from is small.

CONCLUSIONS

A new classification learning framework that lets us learn from auxiliary soft-label information provided by a human expert is a promising new direction for learning classification models from expert labels, reducing the time and cost needed to label data.

摘要

目的

医学领域的分类模型学习通常依赖于人类专家标记的数据。由于临床数据的标记可能很耗时,因此寻找减轻标记成本的方法对于我们自动学习此类模型的能力至关重要。在本文中,我们提出了一种新的机器学习方法,该方法能够通过在训练阶段使用软标签来改进二进制分类模型,这些软标签反映了人类专家对原始类标签的强烈感受,从而更有效地学习二进制分类模型。

材料与方法

提出了两种可从软标签中学习改进的二进制分类模型的方法。第一种方法依赖于概率/数值标签,另一种方法依赖于有序分类标签。我们研究并展示了这些方法在学习肝素诱导的血小板减少症警报模型中的优势。该实验在由三位不同的人类专家标记的 377 个患者实例的数据上进行。使用接收者操作特征曲线(AUC)得分来比较这些方法。

结果

我们的 AUC 结果表明,与传统学习方法相比,新方法能够更有效地学习分类模型。当我们要学习的示例数量较少时,AUC 的提高最为显著。

结论

一种新的分类学习框架,允许我们从人类专家提供的辅助软标签信息中学习,这是从专家标签学习分类模型的一个很有前途的新方向,可以减少标记数据所需的时间和成本。

相似文献

1
Learning classification models with soft-label information.学习带有软标签信息的分类模型。
J Am Med Inform Assoc. 2014 May-Jun;21(3):501-8. doi: 10.1136/amiajnl-2013-001964. Epub 2013 Nov 20.
4
Learning classification models from multiple experts.从多个专家处学习分类模型。
J Biomed Inform. 2013 Dec;46(6):1125-35. doi: 10.1016/j.jbi.2013.08.007. Epub 2013 Sep 13.
7
Learning classification with auxiliary probabilistic information.利用辅助概率信息进行学习分类。
Proc IEEE Int Conf Data Min. 2011;2011:477-486. doi: 10.1109/ICDM.2011.84.

引用本文的文献

1
Hierarchical Active Learning with Label Proportions on Data Regions.基于数据区域标签比例的分层主动学习
IEEE Trans Knowl Data Eng. 2024 Dec;36(12):8434-8446. doi: 10.1109/tkde.2024.3419588.
4
American Society of Retina Specialists Artificial Intelligence Task Force Report.美国视网膜专家协会人工智能特别工作组报告。
J Vitreoretin Dis. 2024 Apr 20;8(4):373-380. doi: 10.1177/24741264241247602. eCollection 2024 Jul-Aug.
6
Hierarchical Active Learning With Qualitative Feedback on Regions.基于区域定性反馈的分层主动学习
IEEE Trans Hum Mach Syst. 2023 Jun;53(3):581-589. doi: 10.1109/thms.2023.3252815. Epub 2023 Mar 23.
8
10
Hierarchical Active Learning with Overlapping Regions.具有重叠区域的分层主动学习
Proc ACM Int Conf Inf Knowl Manag. 2020 Oct;2020:1045-1054. doi: 10.1145/3340531.3412022.

本文引用的文献

2
Outlier detection for patient monitoring and alerting.患者监测和报警的异常值检测。
J Biomed Inform. 2013 Feb;46(1):47-55. doi: 10.1016/j.jbi.2012.08.004. Epub 2012 Aug 27.
3
A Pattern Mining Approach for Classifying Multivariate Temporal Data.一种用于对多变量时态数据进行分类的模式挖掘方法。
Proceedings (IEEE Int Conf Bioinformatics Biomed). 2011 Nov 12;2011:358-365. doi: 10.1109/BIBM.2011.39.
7
Support vector ordinal regression.支持向量序数回归
Neural Comput. 2007 Mar;19(3):792-815. doi: 10.1162/neco.2007.19.3.792.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验