• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

众包中的数据质量与垃圾邮件行为检测。

Data quality in crowdsourcing and spamming behavior detection.

作者信息

Ba Yang, Mancenido Michelle V, Chiou Erin K, Pan Rong

机构信息

Ira A. Fulton Schools of Engineering, School of Computing and Augmented Intelligence, Data Science, Analytics and Engineering, Arizona State University, Suite 342AE, 3rd floor 699 S. Mill Avenue, 85281, Tempe, AZ, USA.

School of Mathematical and Natural Sciences, Arizona State University, Tempe, AZ, USA.

出版信息

Behav Res Methods. 2025 Aug 8;57(9):251. doi: 10.3758/s13428-025-02757-5.

DOI:10.3758/s13428-025-02757-5
PMID:40778971
Abstract

As crowdsourcing emerges as an efficient and cost-effective method for obtaining labels for machine learning datasets, it is important to assess the quality of crowd-provided data to improve analysis performance and reduce biases in subsequent machine learning tasks. Given the lack of ground truth in most cases of crowdsourcing, we refer to data quality as the annotators' consistency and credibility. Unlike the simple scenarios where kappa coefficient and intraclass correlation coefficient usually can apply, online crowdsourcing requires dealing with more complex situations. We introduce a systematic method for evaluating data quality and detecting spamming threats via variance decomposition, and we classify spammers into three categories based on their different behavioral patterns. A spammer index is proposed to assess entire data consistency, and two metrics are developed to measure crowd workers' credibility by utilizing the Markov chain and generalized random effects models. Furthermore, we demonstrate the practicality of our techniques and their advantages by applying them to a face verification task using both simulated and real-world data collected from two crowdsourcing platforms.

摘要

随着众包成为一种为机器学习数据集获取标签的高效且经济高效的方法,评估众包数据的质量对于提高分析性能和减少后续机器学习任务中的偏差至关重要。鉴于在大多数众包情况下缺乏地面真值,我们将数据质量定义为注释者的一致性和可信度。与通常可以应用卡帕系数和类内相关系数的简单场景不同,在线众包需要处理更复杂的情况。我们引入了一种通过方差分解评估数据质量和检测垃圾邮件威胁的系统方法,并根据垃圾邮件发送者的不同行为模式将其分为三类。提出了一个垃圾邮件发送者指数来评估整个数据的一致性,并开发了两个指标来利用马尔可夫链和广义随机效应模型来衡量众包工作者的可信度。此外,我们通过将我们的技术应用于使用从两个人工众包平台收集的模拟数据和真实世界数据的面部验证任务,证明了我们技术的实用性及其优势。

相似文献

1
Data quality in crowdsourcing and spamming behavior detection.众包中的数据质量与垃圾邮件行为检测。
Behav Res Methods. 2025 Aug 8;57(9):251. doi: 10.3758/s13428-025-02757-5.
2
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
3
Gamified Crowdsourcing as a Novel Approach to Lung Ultrasound Data Set Labeling: Prospective Analysis.游戏化众包作为一种新型的肺部超声数据集标注方法:前瞻性分析。
J Med Internet Res. 2024 Jul 4;26:e51397. doi: 10.2196/51397.
4
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.
5
Healthcare workers' informal uses of mobile phones and other mobile devices to support their work: a qualitative evidence synthesis.医护人员非正规使用手机和其他移动设备来支持工作:定性证据综合评价。
Cochrane Database Syst Rev. 2024 Aug 27;8(8):CD015705. doi: 10.1002/14651858.CD015705.pub2.
6
Evaluating crowdsourcing for ICU EEG annotation: A comparison with expert performance.评估众包用于重症监护病房脑电图注释:与专家表现的比较。
Epilepsia. 2025 Aug 6. doi: 10.1111/epi.18547.
7
Automated devices for identifying peripheral arterial disease in people with leg ulceration: an evidence synthesis and cost-effectiveness analysis.用于识别下肢溃疡患者外周动脉疾病的自动化设备:证据综合和成本效益分析。
Health Technol Assess. 2024 Aug;28(37):1-158. doi: 10.3310/TWCG3912.
8
Falls prevention interventions for community-dwelling older adults: systematic review and meta-analysis of benefits, harms, and patient values and preferences.社区居住的老年人跌倒预防干预措施:系统评价和荟萃分析的益处、危害以及患者的价值观和偏好。
Syst Rev. 2024 Nov 26;13(1):289. doi: 10.1186/s13643-024-02681-3.
9
Comparison of self-administered survey questionnaire responses collected using mobile apps versus other methods.使用移动应用程序与其他方法收集的自我管理调查问卷回复的比较。
Cochrane Database Syst Rev. 2015 Jul 27;2015(7):MR000042. doi: 10.1002/14651858.MR000042.pub2.
10
The quantity, quality and findings of network meta-analyses evaluating the effectiveness of GLP-1 RAs for weight loss: a scoping review.评估胰高血糖素样肽-1受体激动剂(GLP-1 RAs)减肥效果的网状Meta分析的数量、质量及结果:一项范围综述
Health Technol Assess. 2025 Jun 25:1-73. doi: 10.3310/SKHT8119.

本文引用的文献

1
Data quality of platforms and panels for online behavioral research.在线行为研究的平台和面板的数据质量。
Behav Res Methods. 2022 Aug;54(4):1643-1662. doi: 10.3758/s13428-021-01694-3. Epub 2021 Sep 29.
2
Kappa and Beyond: Is There Agreement?卡帕值及其他:是否存在一致性?
Global Spine J. 2020 Jun;10(4):499-501. doi: 10.1177/2192568220911648. Epub 2020 Mar 3.
3
Intraclass correlation - A discussion and demonstration of basic features.组内相关系数 - 基本特征的讨论与演示。
PLoS One. 2019 Jul 22;14(7):e0219854. doi: 10.1371/journal.pone.0219854. eCollection 2019.
4
Domain-Weighted Majority Voting for Crowdsourcing.用于众包的领域加权多数投票
IEEE Trans Neural Netw Learn Syst. 2019 Jan;30(1):163-174. doi: 10.1109/TNNLS.2018.2836969. Epub 2018 Jun 5.
5
Performance of intraclass correlation coefficient (ICC) as a reliability index under various distributions in scale reliability studies.在不同分布下的量表信度研究中,组内相关系数(ICC)作为可靠性指标的表现。
Stat Med. 2018 Aug 15;37(18):2734-2752. doi: 10.1002/sim.7679. Epub 2018 Apr 29.
6
Validity and Reliability of the Brazilian Version of the Rapid Estimate of Adult Literacy in Dentistry--BREALD-30.巴西版牙科成人识字率快速评估量表(BREALD-30)的效度和信度
PLoS One. 2015 Jul 9;10(7):e0131600. doi: 10.1371/journal.pone.0131600. eCollection 2015.
7
Interrater reliability: the kappa statistic.组内一致性:kappa 统计量。
Biochem Med (Zagreb). 2012;22(3):276-82.
8
Distributions of the Kullback-Leibler divergence with applications.Kullback-Leibler 散度分布及其应用。
Br J Math Stat Psychol. 2011 May;64(Pt 2):291-309. doi: 10.1348/000711010X522227.
9
Kappa coefficients in medical research.医学研究中的卡帕系数。
Stat Med. 2002 Jul 30;21(14):2109-29. doi: 10.1002/sim.1180.