• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

需要多少人对同一张图像进行分类?一种优化二进制地理分类中志愿者贡献的方法。

How many people need to classify the same image? A method for optimizing volunteer contributions in binary geographical classifications.

机构信息

Southern Swedish Forest Research Centre, Swedish University of Agricultural Sciences, Alnarp, Sweden.

Faculty of International Studies, Utsunomiya University, Utsunomiya, Japan.

出版信息

PLoS One. 2022 May 19;17(5):e0267114. doi: 10.1371/journal.pone.0267114. eCollection 2022.

DOI:10.1371/journal.pone.0267114
PMID:35587481
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9119495/
Abstract

Involving members of the public in image classification tasks that can be tricky to automate is increasingly recognized as a way to complete large amounts of these tasks and promote citizen involvement in science. While this labor is usually provided for free, it is still limited, making it important for researchers to use volunteer contributions as efficiently as possible. Using volunteer labor efficiently becomes complicated when individual tasks are assigned to multiple volunteers to increase confidence that the correct classification has been reached. In this paper, we develop a system to decide when enough information has been accumulated to confidently declare an image to be classified and remove it from circulation. We use a Bayesian approach to estimate the posterior distribution of the mean rating in a binary image classification task. Tasks are removed from circulation when user-defined certainty thresholds are reached. We demonstrate this process using a set of over 4.5 million unique classifications by 2783 volunteers of over 190,000 images assessed for the presence/absence of cropland. If the system outlined here had been implemented in the original data collection campaign, it would have eliminated the need for 59.4% of volunteer ratings. Had this effort been applied to new tasks, it would have allowed an estimated 2.46 times as many images to have been classified with the same amount of labor, demonstrating the power of this method to make more efficient use of limited volunteer contributions. To simplify implementation of this method by other investigators, we provide cutoff value combinations for one set of confidence levels.

摘要

将公众成员纳入到难以实现自动化的图像分类任务中,已逐渐被视为完成大量此类任务和促进公民参与科学的一种方式。虽然此类劳动通常是无偿的,但它仍然是有限的,因此研究人员有必要尽可能有效地利用志愿者的贡献。当将单个任务分配给多个志愿者以提高对正确分类的信心时,有效地利用志愿者劳动就变得复杂了。在本文中,我们开发了一种系统,用于确定何时已经积累了足够的信息,可以有把握地宣布对图像进行分类并将其从流通中删除。我们使用贝叶斯方法来估计二进制图像分类任务中平均值的后验分布。当达到用户定义的置信度阈值时,任务将从流通中删除。我们使用一组超过 450 万个独特分类和 2783 名志愿者对超过 190000 张图像进行评估,以确定是否存在农田,来演示此过程。如果在此处概述的系统已经在原始数据收集活动中实施,那么它将消除 59.4%的志愿者评级的需求。如果将此努力应用于新任务,那么它将允许使用相同数量的劳动力对大约 2.46 倍的图像进行分类,从而证明了这种方法在更有效地利用有限的志愿者贡献方面的强大功能。为了简化其他研究人员对这种方法的实施,我们为一组置信水平提供了截止值组合。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1f69/9119495/213045459f00/pone.0267114.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1f69/9119495/b0c6f0d628ba/pone.0267114.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1f69/9119495/90836a69e689/pone.0267114.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1f69/9119495/5e912a94df36/pone.0267114.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1f69/9119495/311d602f36e5/pone.0267114.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1f69/9119495/953187ba979e/pone.0267114.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1f69/9119495/213045459f00/pone.0267114.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1f69/9119495/b0c6f0d628ba/pone.0267114.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1f69/9119495/90836a69e689/pone.0267114.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1f69/9119495/5e912a94df36/pone.0267114.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1f69/9119495/311d602f36e5/pone.0267114.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1f69/9119495/953187ba979e/pone.0267114.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1f69/9119495/213045459f00/pone.0267114.g006.jpg

相似文献

1
How many people need to classify the same image? A method for optimizing volunteer contributions in binary geographical classifications.需要多少人对同一张图像进行分类?一种优化二进制地理分类中志愿者贡献的方法。
PLoS One. 2022 May 19;17(5):e0267114. doi: 10.1371/journal.pone.0267114. eCollection 2022.
2
A generalized approach for producing, quantifying, and validating citizen science data from wildlife images.一种用于生成、量化和验证来自野生动物图像的公民科学数据的通用方法。
Conserv Biol. 2016 Jun;30(3):520-31. doi: 10.1111/cobi.12695. Epub 2016 Apr 25.
3
Using demographics toward efficient data classification in citizen science: a Bayesian approach.利用人口统计学实现公民科学中的高效数据分类:一种贝叶斯方法。
PeerJ Comput Sci. 2019 Nov 25;5:e239. doi: 10.7717/peerj-cs.239. eCollection 2019.
4
The future of Cochrane Neonatal.考克兰新生儿协作网的未来。
Early Hum Dev. 2020 Nov;150:105191. doi: 10.1016/j.earlhumdev.2020.105191. Epub 2020 Sep 12.
5
Assessing citizen contributions to butterfly monitoring in two large cities.评估两个大城市中公民对蝴蝶监测的贡献。
Conserv Biol. 2012 Jun;26(3):557-64. doi: 10.1111/j.1523-1739.2012.01825.x. Epub 2012 Mar 22.
6
The role of automated feedback in training and retaining biological recorders for citizen science.自动反馈在公民科学中培训和留住生物记录员方面的作用。
Conserv Biol. 2016 Jun;30(3):550-61. doi: 10.1111/cobi.12705. Epub 2016 Apr 25.
7
Volunteer involvement in the organisation of palliative care: A survey study of the healthcare system in Flanders and Dutch-speaking Brussels, Belgium.志愿者在姑息治疗组织中的参与:对比利时佛兰德斯和荷语布鲁塞尔医疗保健系统的调查研究。
Health Soc Care Community. 2019 Mar;27(2):459-471. doi: 10.1111/hsc.12666. Epub 2018 Oct 16.
8
[A new approach for studying the retinal and choroidal circulation].[一种研究视网膜和脉络膜循环的新方法]
Nippon Ganka Gakkai Zasshi. 2004 Dec;108(12):836-61; discussion 862.
9
[Characteristics and challenges of health promotion volunteer organizations in Japan: Findings from a national municipality survey].[日本健康促进志愿者组织的特征与挑战:全国市级调查结果]
Nihon Koshu Eisei Zasshi. 2019;66(11):712-722. doi: 10.11236/jph.66.11_712.
10
Binary tissue classification on wound images with neural networks and bayesian classifiers.基于神经网络和贝叶斯分类器的伤口图像二进制组织分类。
IEEE Trans Med Imaging. 2010 Feb;29(2):410-27. doi: 10.1109/TMI.2009.2033595. Epub 2009 Oct 13.

引用本文的文献

1
New directions in mapping the Earth's surface with citizen science and generative AI.利用公民科学和生成式人工智能绘制地球表面的新方向。
iScience. 2025 Jan 30;28(3):111919. doi: 10.1016/j.isci.2025.111919. eCollection 2025 Mar 21.

本文引用的文献

1
Scalable deep learning to identify brick kilns and aid regulatory capacity.可扩展的深度学习技术,用于识别砖窑并增强监管能力。
Proc Natl Acad Sci U S A. 2021 Apr 27;118(17). doi: 10.1073/pnas.2018863118.
2
Amazon's Mechanical Turk: A New Source of Inexpensive, Yet High-Quality, Data?亚马逊土耳其机器人:一种新的廉价、高质量数据来源?
Perspect Psychol Sci. 2011 Jan;6(1):3-5. doi: 10.1177/1745691610393980. Epub 2011 Feb 3.
3
Snapshot Serengeti, high-frequency annotated camera trap images of 40 mammalian species in an African savanna.
《塞伦盖蒂快照》,非洲热带稀树草原 40 种哺乳动物高频标注的相机陷阱图像。
Sci Data. 2015 Jun 9;2:150026. doi: 10.1038/sdata.2015.26. eCollection 2015.