• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用噪声校正提高众包标签质量。

Improving Crowdsourced Label Quality Using Noise Correction.

出版信息

IEEE Trans Neural Netw Learn Syst. 2018 May;29(5):1675-1688. doi: 10.1109/TNNLS.2017.2677468. Epub 2017 Mar 22.

DOI:10.1109/TNNLS.2017.2677468
PMID:28333645
Abstract

Crowdsourcing systems provide a cost effective and convenient way to collect labels, but they often fail to guarantee the quality of the labels. This paper proposes a novel framework that introduces noise correction techniques to further improve the quality of integrated labels that are inferred from the multiple noisy labels of objects. In the proposed general framework, information about the qualities of labelers estimated by a front-end ground truth inference algorithm is utilized to supervise subsequent label noise filtering and correction. The framework uses a novel algorithm termed adaptive voting noise correction (AVNC) to precisely identify and correct the potential noisy labels. After filtering out the instances with noisy labels, the remaining cleansed data set is used to create multiple weak classifiers, based on which a powerful ensemble classifier is induced to correct these noises. Experimental results on eight simulated data sets with different kinds of features and two real-world crowdsourcing data sets in different domains consistently show that: 1) the proposed framework can improve label quality regardless of inference algorithms, especially under the circumstance that each instance has a few repeated labels and 2) since the proposed AVNC algorithm considers both the number of and the probability of potential label noises, it outperforms the state-of-the-art noise correction algorithms.

摘要

众包系统提供了一种经济有效的方法来收集标签,但它们往往无法保证标签的质量。本文提出了一种新颖的框架,该框架引入了噪声校正技术,以进一步提高从多个对象的嘈杂标签推断出的综合标签的质量。在所提出的通用框架中,利用前端真实推断算法估计的关于标注者质量的信息来监督后续的标签噪声过滤和校正。该框架使用一种称为自适应投票噪声校正 (AVNC) 的新颖算法来精确识别和校正潜在的嘈杂标签。过滤掉具有嘈杂标签的实例后,使用剩余的清理数据集创建多个弱分类器,然后基于这些弱分类器诱导出一个强大的集成分类器来纠正这些噪声。在具有不同特征的八个模拟数据集和两个不同领域的真实众包数据集上的实验结果一致表明:1) 无论推理算法如何,所提出的框架都可以提高标签质量,特别是在每个实例具有少量重复标签的情况下;2) 由于所提出的 AVNC 算法同时考虑了潜在标签噪声的数量和概率,因此它优于最先进的噪声校正算法。

相似文献

1
Improving Crowdsourced Label Quality Using Noise Correction.利用噪声校正提高众包标签质量。
IEEE Trans Neural Netw Learn Syst. 2018 May;29(5):1675-1688. doi: 10.1109/TNNLS.2017.2677468. Epub 2017 Mar 22.
2
Crowdsourced Label Aggregation Using Bilayer Collaborative Clustering.基于双层协同聚类的众包标签聚合。
IEEE Trans Neural Netw Learn Syst. 2019 Oct;30(10):3172-3185. doi: 10.1109/TNNLS.2018.2890148. Epub 2019 Jan 25.
3
Beyond Majority Voting: A Coarse-to-Fine Label Filtration for Heavily Noisy Labels.超越多数投票:一种用于严重噪声标签的粗到细标签过滤方法。
IEEE Trans Neural Netw Learn Syst. 2019 Dec;30(12):3774-3787. doi: 10.1109/TNNLS.2019.2899045. Epub 2019 Mar 15.
4
Learning With Auxiliary Less-Noisy Labels.辅助少噪声标签学习。
IEEE Trans Neural Netw Learn Syst. 2017 Jul;28(7):1716-1721. doi: 10.1109/TNNLS.2016.2546956. Epub 2016 Apr 6.
5
Learning From Crowds With Multiple Noisy Label Distribution Propagation.基于多噪声标签分布传播的众包学习
IEEE Trans Neural Netw Learn Syst. 2022 Nov;33(11):6558-6568. doi: 10.1109/TNNLS.2021.3082496. Epub 2022 Oct 27.
6
Label Consistency-Based Ground Truth Inference for Crowdsourcing.基于标签一致性的众包真值推断
IEEE Trans Neural Netw Learn Syst. 2025 May;36(5):9408-9421. doi: 10.1109/TNNLS.2024.3438680. Epub 2025 May 2.
7
Active Crowdsourcing for Multilabel Annotation.用于多标签标注的主动众包
IEEE Trans Neural Netw Learn Syst. 2024 Mar;35(3):3549-3559. doi: 10.1109/TNNLS.2022.3194022. Epub 2024 Feb 29.
8
Typicality- and instance-dependent label noise-combating: a novel framework for simulating and combating real-world noisy labels for endoscopic polyp classification.典型性和实例依赖的标签噪声对抗:一种用于模拟和对抗内镜息肉分类中现实世界噪声标签的新框架。
Vis Comput Ind Biomed Art. 2024 May 6;7(1):10. doi: 10.1186/s42492-024-00162-x.
9
Partial Multi-Label Learning With Noisy Label Identification.基于噪声标签识别的部分多标签学习
IEEE Trans Pattern Anal Mach Intell. 2022 Jul;44(7):3676-3687. doi: 10.1109/TPAMI.2021.3059290. Epub 2022 Jun 3.
10
Active learning with imbalanced multiple noisy labeling.基于不平衡多噪声标注的主动学习
IEEE Trans Cybern. 2015 May;45(5):1081-93. doi: 10.1109/TCYB.2014.2344674. Epub 2014 Aug 14.

引用本文的文献

1
nnSegNeXt: A 3D Convolutional Network for Brain Tissue Segmentation Based on Quality Evaluation.nnSegNeXt:基于质量评估的用于脑组织分割的3D卷积网络。
Bioengineering (Basel). 2024 Jun 6;11(6):575. doi: 10.3390/bioengineering11060575.
2
Methodology for the Detection of Contaminated Training Datasets for Machine Learning-Based Network Intrusion-Detection Systems.用于基于机器学习的网络入侵检测系统的受污染训练数据集检测方法。
Sensors (Basel). 2024 Jan 12;24(2):479. doi: 10.3390/s24020479.
3
Computer vision in surgery: from potential to clinical value.
外科手术中的计算机视觉:从潜力到临床价值。
NPJ Digit Med. 2022 Oct 28;5(1):163. doi: 10.1038/s41746-022-00707-5.