Suppr
超能文献

论公民科学数据质量对海洋图像中基于深度学习的分类的影响。

On the impact of Citizen Science-derived data quality on deep learning based classification in marine images.

机构信息

Biodata Mining Group, Faculty of Technology, Bielefeld University, Bielefeld, Germany.

National Oceanography Centre, University of Southampton Waterfront Campus, Southampton, United Kingdom.

出版信息

PLoS One. 2019 Jun 12;14(6):e0218086. doi: 10.1371/journal.pone.0218086. eCollection 2019.

DOI:10.1371/journal.pone.0218086

PMID:31188894

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6561570/

Abstract

The evaluation of large amounts of digital image data is of growing importance for biology, including for the exploration and monitoring of marine habitats. However, only a tiny percentage of the image data collected is evaluated by marine biologists who manually interpret and annotate the image contents, which can be slow and laborious. In order to overcome the bottleneck in image annotation, two strategies are increasingly proposed: "citizen science" and "machine learning". In this study, we investigated how the combination of citizen science, to detect objects, and machine learning, to classify megafauna, could be used to automate annotation of underwater images. For this purpose, multiple large data sets of citizen science annotations with different degrees of common errors and inaccuracies observed in citizen science data were simulated by modifying "gold standard" annotations done by an experienced marine biologist. The parameters of the simulation were determined on the basis of two citizen science experiments. It allowed us to analyze the relationship between the outcome of a citizen science study and the quality of the classifications of a deep learning megafauna classifier. The results show great potential for combining citizen science with machine learning, provided that the participants are informed precisely about the annotation protocol. Inaccuracies in the position of the annotation had the most substantial influence on the classification accuracy, whereas the size of the marking and false positive detections had a smaller influence.

摘要

大量数字图像数据的评估对于生物学越来越重要，包括对海洋栖息地的探索和监测。然而，只有一小部分图像数据被海洋生物学家进行评估，他们手动解释和注释图像内容，这可能既缓慢又费力。为了克服图像注释的瓶颈，越来越多地提出了两种策略：“公民科学”和“机器学习”。在这项研究中，我们研究了如何结合公民科学来检测物体，以及机器学习来对大型动物进行分类，从而实现水下图像的自动注释。为此，我们通过修改由经验丰富的海洋生物学家完成的“黄金标准”注释，模拟了具有不同程度常见错误和不准确的公民科学注释的多个大数据集。模拟的参数是基于两个公民科学实验确定的。这使我们能够分析公民科学研究的结果与深度学习大型动物分类器的分类质量之间的关系。结果表明，只要参与者准确了解注释协议，就可以极大地发挥公民科学与机器学习相结合的潜力。注释位置的不准确性对分类准确性的影响最大，而标记的大小和假阳性检测的影响较小。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fa67/6561570/93db5b0a6047/pone.0218086.g001.jpg

相似文献

On the impact of Citizen Science-derived data quality on deep learning based classification in marine images.

PLoS One. 2019 Jun 12;14(6):e0218086. doi: 10.1371/journal.pone.0218086. eCollection 2019.

Quantification of litter in cities using a smartphone application and citizen science in conjunction with deep learning-based image processing.

Waste Manag. 2024 Sep 15;186:271-279. doi: 10.1016/j.wasman.2024.06.026. Epub 2024 Jun 28.

Deep learning identification for citizen science surveillance of tiger mosquitoes.

Sci Rep. 2021 Feb 25;11(1):4718. doi: 10.1038/s41598-021-83657-4.

Deep learning is combined with massive-scale citizen science to improve large-scale image classification.

Nat Biotechnol. 2018 Oct;36(9):820-828. doi: 10.1038/nbt.4225. Epub 2018 Aug 20.

Crowdsourcing image segmentation for deep learning: integrated platform for citizen science, paid microtask, and gamification.

Biomed Tech (Berl). 2023 Dec 26;69(3):293-305. doi: 10.1515/bmt-2023-0148. Print 2024 Jun 25.

An open-source, citizen science and machine learning approach to analyse subsea movies.

Biodivers Data J. 2021 Feb 24;9:e60548. doi: 10.3897/BDJ.9.e60548. eCollection 2021.

Litter Detection with Deep Learning: A Comparative Study.

Sensors (Basel). 2022 Jan 11;22(2):548. doi: 10.3390/s22020548.

MAIA-A machine learning assisted image annotation method for environmental monitoring and exploration.

PLoS One. 2018 Nov 16;13(11):e0207498. doi: 10.1371/journal.pone.0207498. eCollection 2018.

RIL-Contour: a Medical Imaging Dataset Annotation Tool for and with Deep Learning.

J Digit Imaging. 2019 Aug;32(4):571-581. doi: 10.1007/s10278-019-00232-0.

Deep learning increases the availability of organism photographs taken by citizens in citizen science programs.

Sci Rep. 2022 Jan 24;12(1):1210. doi: 10.1038/s41598-022-05163-5.

引用本文的文献

Making sense of fossils and artefacts: a review of best practices for the design of a successful workflow for machine learning-assisted citizen science projects.

PeerJ. 2025 Feb 13;13:e18927. doi: 10.7717/peerj.18927. eCollection 2025.

Modelling heterogeneity in the classification process in multi-species distribution models can improve predictive performance.

Ecol Evol. 2024 Mar 7;14(3):e11092. doi: 10.1002/ece3.11092. eCollection 2024 Mar.

Machine learning to support citizen science in urban environmental management.

Heliyon. 2023 Nov 22;9(12):e22688. doi: 10.1016/j.heliyon.2023.e22688. eCollection 2023 Dec.

NEAL: an open-source tool for audio annotation.

PeerJ. 2023 Aug 25;11:e15913. doi: 10.7717/peerj.15913. eCollection 2023.

Applications of Machine Learning in Chemical and Biological Oceanography.

ACS Omega. 2023 Apr 27;8(18):15831-15853. doi: 10.1021/acsomega.2c06441. eCollection 2023 May 9.

The Impact of Data Augmentations on Deep Learning-Based Marine Object Classification in Benthic Image Transects.

Sensors (Basel). 2022 Jul 19;22(14):5383. doi: 10.3390/s22145383.

ALMI-A Generic Active Learning System for Computational Object Classification in Marine Observation Images.

Sensors (Basel). 2021 Feb 6;21(4):1134. doi: 10.3390/s21041134.

本文引用的文献

MAIA-A machine learning assisted image annotation method for environmental monitoring and exploration.

PLoS One. 2018 Nov 16;13(11):e0207498. doi: 10.1371/journal.pone.0207498. eCollection 2018.

An evaluation of the error and uncertainty in epibenthos cover estimates from AUV images collected with an efficient, spatially-balanced design.

PLoS One. 2018 Sep 18;13(9):e0203827. doi: 10.1371/journal.pone.0203827. eCollection 2018.

Deep learning is combined with massive-scale citizen science to improve large-scale image classification.

Nat Biotechnol. 2018 Oct;36(9):820-828. doi: 10.1038/nbt.4225. Epub 2018 Aug 20.

Biological responses to disturbance from simulated deep-sea polymetallic nodule mining.

PLoS One. 2017 Feb 8;12(2):e0171750. doi: 10.1371/journal.pone.0171750. eCollection 2017.

What Is Citizen Science?--A Scientometric Meta-Analysis.

PLoS One. 2016 Jan 14;11(1):e0147152. doi: 10.1371/journal.pone.0147152. eCollection 2016.

Deep learning.

Nature. 2015 May 28;521(7553):436-44. doi: 10.1038/nature14539.

From principles to practice: a spatial approach to systematic conservation planning in the deep sea.

Proc Biol Sci. 2013 Nov 6;280(1773):20131684. doi: 10.1098/rspb.2013.1684. Print 2013 Dec 22.

Semi-automated image analysis for the assessment of megafaunal densities at the Arctic deep-sea observatory HAUSGARTEN.

PLoS One. 2012;7(6):e38179. doi: 10.1371/journal.pone.0038179. Epub 2012 Jun 5.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

Suppr超能文献

论公民科学数据质量对海洋图像中基于深度学习的分类的影响。

On the impact of Citizen Science-derived data quality on deep learning based classification in marine images.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译