• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于众包的领域加权多数投票

Domain-Weighted Majority Voting for Crowdsourcing.

作者信息

Tao Dapeng, Cheng Jun, Yu Zhengtao, Yue Kun, Wang Lizhen

出版信息

IEEE Trans Neural Netw Learn Syst. 2019 Jan;30(1):163-174. doi: 10.1109/TNNLS.2018.2836969. Epub 2018 Jun 5.

DOI:10.1109/TNNLS.2018.2836969
PMID:29994339
Abstract

Crowdsourcing labeling systems provide an efficient way to generate multiple inaccurate labels for given observations. If the competence level or the "reputation," which can be explained as the probabilities of annotating the right label, for each crowdsourcing annotators is equal and biased to annotate the right label, majority voting (MV) is the optimal decision rule for merging the multiple labels into a single reliable one. However, in practice, the competence levels of annotators employed by the crowdsourcing labeling systems are often diverse very much. In these cases, weighted MV is more preferred. The weights should be determined by the competence levels. However, since the annotators are anonymous and the ground-truth labels are usually unknown, it is hard to compute the competence levels of the annotators directly. In this paper, we propose to learn the weights for weighted MV by exploiting the expertise of annotators. Specifically, we model the domain knowledge of different annotators with different distributions and treat the crowdsourcing problem as a domain adaptation problem. The annotators provide labels to the source domains and the target domain is assumed to be associated with the ground-truth labels. The weights are obtained by matching the source domains with the target domain. Although the target-domain labels are unknown, we prove that they could be estimated under mild conditions. Both theoretical and empirical analyses verify the effectiveness of the proposed method. Large performance gains are shown for specific data sets.

摘要

众包标注系统提供了一种为给定观测生成多个不准确标签的有效方法。如果每个众包标注者的能力水平或“声誉”(可解释为标注正确标签的概率)相等且偏向于标注正确标签,多数投票(MV)是将多个标签合并为单个可靠标签的最优决策规则。然而,在实际中,众包标注系统所雇佣标注者的能力水平往往差异很大。在这些情况下,加权多数投票更受青睐。权重应由能力水平决定。然而,由于标注者是匿名的且真实标签通常未知,很难直接计算标注者的能力水平。在本文中,我们提议通过利用标注者的专业知识来学习加权多数投票的权重。具体而言,我们用不同的分布对不同标注者的领域知识进行建模,并将众包问题视为一个领域适应问题。标注者为源域提供标签,且假设目标域与真实标签相关联。通过使源域与目标域匹配来获得权重。尽管目标域标签未知,但我们证明在温和条件下它们是可以估计的。理论和实证分析都验证了所提方法的有效性。针对特定数据集展示了显著的性能提升。

相似文献

1
Domain-Weighted Majority Voting for Crowdsourcing.用于众包的领域加权多数投票
IEEE Trans Neural Netw Learn Syst. 2019 Jan;30(1):163-174. doi: 10.1109/TNNLS.2018.2836969. Epub 2018 Jun 5.
2
Crowdsourcing with the drift diffusion model of decision making.利用决策的漂移扩散模型进行众包。
Sci Rep. 2024 May 17;14(1):11311. doi: 10.1038/s41598-024-61687-y.
3
Beyond Majority Voting: A Coarse-to-Fine Label Filtration for Heavily Noisy Labels.超越多数投票:一种用于严重噪声标签的粗到细标签过滤方法。
IEEE Trans Neural Netw Learn Syst. 2019 Dec;30(12):3774-3787. doi: 10.1109/TNNLS.2019.2899045. Epub 2019 Mar 15.
4
Deep Learning From Multiple Noisy Annotators as A Union.将来自多个有噪声标注的深度学习作为一个联合。
IEEE Trans Neural Netw Learn Syst. 2023 Dec;34(12):10552-10562. doi: 10.1109/TNNLS.2022.3168696. Epub 2023 Nov 30.
5
Crowdsourcing for Machine Learning in Public Health Surveillance: Lessons Learned From Amazon Mechanical Turk.公共卫生监测中机器学习的众包:从亚马逊土耳其机器人学到的经验教训。
J Med Internet Res. 2022 Jan 18;24(1):e28749. doi: 10.2196/28749.
6
Improving Crowdsourced Label Quality Using Noise Correction.利用噪声校正提高众包标签质量。
IEEE Trans Neural Netw Learn Syst. 2018 May;29(5):1675-1688. doi: 10.1109/TNNLS.2017.2677468. Epub 2017 Mar 22.
7
Active learning with imbalanced multiple noisy labeling.基于不平衡多噪声标注的主动学习
IEEE Trans Cybern. 2015 May;45(5):1081-93. doi: 10.1109/TCYB.2014.2344674. Epub 2014 Aug 14.
8
Active Crowdsourcing for Multilabel Annotation.用于多标签标注的主动众包
IEEE Trans Neural Netw Learn Syst. 2024 Mar;35(3):3549-3559. doi: 10.1109/TNNLS.2022.3194022. Epub 2024 Feb 29.
9
Dual Alignment for Partial Domain Adaptation.双对齐用于部分领域自适应。
IEEE Trans Cybern. 2021 Jul;51(7):3404-3416. doi: 10.1109/TCYB.2020.2983337. Epub 2021 Jun 23.
10
Learning from crowds in digital pathology using scalable variational Gaussian processes.基于可扩展变分高斯过程的数字病理学众包学习。
Sci Rep. 2021 Jun 2;11(1):11612. doi: 10.1038/s41598-021-90821-3.

引用本文的文献

1
Data quality in crowdsourcing and spamming behavior detection.众包中的数据质量与垃圾邮件行为检测。
Behav Res Methods. 2025 Aug 8;57(9):251. doi: 10.3758/s13428-025-02757-5.
2
A truth inference scheme for crowdsourcing using NLP and swin transformers.一种使用自然语言处理和Swin变压器进行众包的真值推理方案。
Sci Rep. 2025 Aug 4;15(1):28338. doi: 10.1038/s41598-025-10942-x.
3
Incorporating label uncertainty during the training of convolutional neural networks improves performance for the discrimination between certain and inconclusive cases in dopamine transporter SPECT.
在卷积神经网络训练过程中纳入标签不确定性可提高多巴胺转运体单光子发射计算机断层扫描中确定病例与不确定病例之间鉴别的性能。
Eur J Nucl Med Mol Imaging. 2025 Mar;52(4):1535-1548. doi: 10.1007/s00259-024-06988-0. Epub 2024 Nov 27.
4
Ensemble Learning Using Individual Neonatal Data for Seizure Detection.基于个体新生儿数据的集成学习用于癫痫发作检测。
IEEE J Transl Eng Health Med. 2022 Aug 23;10:4901111. doi: 10.1109/JTEHM.2022.3201167. eCollection 2022.
5
Development of a Convolutional Neural Network-Based Colonoscopy Image Assessment Model for Differentiating Crohn's Disease and Ulcerative Colitis.基于卷积神经网络的结肠镜检查图像评估模型的开发,用于区分克罗恩病和溃疡性结肠炎。
Front Med (Lausanne). 2022 Apr 8;9:789862. doi: 10.3389/fmed.2022.789862. eCollection 2022.
6
A Fuzzy Fusion Rotating Machinery Fault Diagnosis Framework Based on the Enhancement Deep Convolutional Neural Networks.基于增强型深度卷积神经网络的模糊融合旋转机械故障诊断框架。
Sensors (Basel). 2022 Jan 16;22(2):671. doi: 10.3390/s22020671.
7
DSNetwork: An Integrative Approach to Visualize Predictions of Variants' Deleteriousness.DSNetwork:一种可视化变异有害性预测的综合方法。
Front Genet. 2020 Jan 17;10:1349. doi: 10.3389/fgene.2019.01349. eCollection 2019.