• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用人口统计学实现公民科学中的高效数据分类:一种贝叶斯方法。

Using demographics toward efficient data classification in citizen science: a Bayesian approach.

作者信息

De Lellis Pietro, Nakayama Shinnosuke, Porfiri Maurizio

机构信息

Department of Electrical Engineering and Information Technology, University of Naples Federico II, Naples, Italy.

Department of Mechanical and Aerospace Engineering, New York University Tandon School of Engineering, Brooklyn, NY, USA.

出版信息

PeerJ Comput Sci. 2019 Nov 25;5:e239. doi: 10.7717/peerj-cs.239. eCollection 2019.

DOI:10.7717/peerj-cs.239
PMID:33816892
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7924415/
Abstract

Public participation in scientific activities, often called citizen science, offers a possibility to collect and analyze an unprecedentedly large amount of data. However, diversity of volunteers poses a challenge to obtain accurate information when these data are aggregated. To overcome this problem, we propose a classification algorithm using Bayesian inference that harnesses diversity of volunteers to improve data accuracy. In the algorithm, each volunteer is grouped into a distinct class based on a survey regarding either their level of education or motivation to citizen science. We obtained the behavior of each class through a training set, which was then used as a prior information to estimate performance of new volunteers. By applying this approach to an existing citizen science dataset to classify images into categories, we demonstrate improvement in data accuracy, compared to the traditional majority voting. Our algorithm offers a simple, yet powerful, way to improve data accuracy under limited effort of volunteers by predicting the behavior of a class of individuals, rather than attempting at a granular description of each of them.

摘要

公众参与科学活动,通常称为公民科学,为收集和分析数量空前庞大的数据提供了一种可能。然而,志愿者的多样性给汇总这些数据时获取准确信息带来了挑战。为克服这一问题,我们提出一种使用贝叶斯推理的分类算法,该算法利用志愿者的多样性来提高数据准确性。在该算法中,根据关于志愿者教育水平或参与公民科学的动机的一项调查,将每个志愿者归入一个不同的类别。我们通过一个训练集得出每个类别的行为,然后将其用作先验信息来估计新志愿者的表现。通过将这种方法应用于现有的公民科学数据集以将图像分类,与传统的多数投票相比,我们证明了数据准确性的提高。我们的算法提供了一种简单却强大的方法,通过预测一类个体的行为,而非试图对每个个体进行细致描述,在志愿者付出有限努力的情况下提高数据准确性。

相似文献

1
Using demographics toward efficient data classification in citizen science: a Bayesian approach.利用人口统计学实现公民科学中的高效数据分类:一种贝叶斯方法。
PeerJ Comput Sci. 2019 Nov 25;5:e239. doi: 10.7717/peerj-cs.239. eCollection 2019.
2
Camera settings and biome influence the accuracy of citizen science approaches to camera trap image classification.相机设置和生物群落会影响公民科学方法对相机陷阱图像分类的准确性。
Ecol Evol. 2020 Oct 6;10(21):11954-11965. doi: 10.1002/ece3.6722. eCollection 2020 Nov.
3
The Role of Social Interactions in Motor Performance: Feasibility Study Toward Enhanced Motivation in Telerehabilitation.社交互动在运动表现中的作用:远程康复中增强动机的可行性研究。
J Med Internet Res. 2019 May 15;21(5):e12708. doi: 10.2196/12708.
4
The potential for citizen science to produce reliable and useful information in ecology.公民科学在生态学中产生可靠和有用信息的潜力。
Conserv Biol. 2019 Jun;33(3):561-569. doi: 10.1111/cobi.13223. Epub 2018 Nov 27.
5
Accuracy of long-term volunteer water monitoring data: A multiscale analysis from a statewide citizen science program.长期志愿者水质监测数据的准确性:来自全州公民科学计划的多尺度分析。
PLoS One. 2020 Jan 29;15(1):e0227540. doi: 10.1371/journal.pone.0227540. eCollection 2020.
6
A generalized approach for producing, quantifying, and validating citizen science data from wildlife images.一种用于生成、量化和验证来自野生动物图像的公民科学数据的通用方法。
Conserv Biol. 2016 Jun;30(3):520-31. doi: 10.1111/cobi.12695. Epub 2016 Apr 25.
7
The role of automated feedback in training and retaining biological recorders for citizen science.自动反馈在公民科学中培训和留住生物记录员方面的作用。
Conserv Biol. 2016 Jun;30(3):550-61. doi: 10.1111/cobi.12705. Epub 2016 Apr 25.
8
A Citizen Science Approach to Determine Physical Activity Patterns and Demographics of Greenway Users in Winston-Salem, North Carolina.一种公民科学方法,用于确定北卡罗来纳州温斯顿-塞勒姆市绿道使用者的体力活动模式和人口统计学特征。
Int J Environ Res Public Health. 2019 Aug 29;16(17):3150. doi: 10.3390/ijerph16173150.
9
Incorporating citizen science data in spatially explicit integrated population models.将公民科学数据纳入具有空间显式的综合人口模型中。
Ecology. 2019 Sep;100(9):e02777. doi: 10.1002/ecy.2777. Epub 2019 Jul 18.
10
Design principles for engaging and retaining virtual citizen scientists.吸引和留住虚拟公民科学家的设计原则。
Conserv Biol. 2016 Jun;30(3):562-70. doi: 10.1111/cobi.12627. Epub 2016 Apr 25.

本文引用的文献

1
Producing knowledge by admitting ignorance: Enhancing data quality through an "I don't know" option in citizen science.通过承认无知来产生知识:通过公民科学中的“我不知道”选项来提高数据质量。
PLoS One. 2019 Feb 27;14(2):e0211907. doi: 10.1371/journal.pone.0211907. eCollection 2019.
2
Quality of non-expert citizen science data collected for habitat type conservation status assessment in Natura 2000 protected areas.非专业公民科学数据在 Natura 2000 保护区生境类型保护状况评估中的质量。
Sci Rep. 2017 Aug 21;7(1):8873. doi: 10.1038/s41598-017-09316-9.
3
A natural user interface to integrate citizen science and physical exercise.
一种整合公民科学与体育锻炼的自然用户界面。
PLoS One. 2017 Feb 23;12(2):e0172587. doi: 10.1371/journal.pone.0172587. eCollection 2017.
4
Activating social strategies: Face-to-face interaction in technology-mediated citizen science.激活社交策略:技术介导的公民科学中的面对面互动
J Environ Manage. 2016 Nov 1;182:374-384. doi: 10.1016/j.jenvman.2016.07.092. Epub 2016 Aug 4.
5
Snapshot Serengeti, high-frequency annotated camera trap images of 40 mammalian species in an African savanna.《塞伦盖蒂快照》,非洲热带稀树草原 40 种哺乳动物高频标注的相机陷阱图像。
Sci Data. 2015 Jun 9;2:150026. doi: 10.1038/sdata.2015.26. eCollection 2015.
6
Increasing patient engagement in rehabilitation exercises using computer-based citizen science.利用基于计算机的公民科学提高患者在康复锻炼中的参与度。
PLoS One. 2015 Mar 20;10(3):e0117013. doi: 10.1371/journal.pone.0117013. eCollection 2015.
7
Crowd science user contribution patterns and their implications.众包科学用户贡献模式及其影响。
Proc Natl Acad Sci U S A. 2015 Jan 20;112(3):679-84. doi: 10.1073/pnas.1408907112. Epub 2015 Jan 5.
8
Classification in the presence of label noise: a survey.带标签噪声的分类:综述。
IEEE Trans Neural Netw Learn Syst. 2014 May;25(5):845-69. doi: 10.1109/TNNLS.2013.2292894.
9
Scientists@Home: what drives the quantity and quality of online citizen science participation?居家科学家:是什么驱动了在线公民科学参与的数量和质量?
PLoS One. 2014 Apr 1;9(4):e90375. doi: 10.1371/journal.pone.0090375. eCollection 2014.
10
Citizen science. Next steps for citizen science.公民科学。公民科学的下一步发展。
Science. 2014 Mar 28;343(6178):1436-7. doi: 10.1126/science.1251554.