• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

"野外 UDE 硅藻 2024":一个用于训练深度学习模型的淡水硅藻新图像数据集。

"UDE DIATOMS in the Wild 2024": a new image dataset of freshwater diatoms for training deep learning models.

机构信息

Université de Lorraine, CNRS, LIEC, F-57000 Metz, France.

Georgia Tech Europe, CNRS IRL 2958, F-57000 Metz, France.

出版信息

Gigascience. 2024 Jan 2;13. doi: 10.1093/gigascience/giae087.

DOI:10.1093/gigascience/giae087
PMID:39607983
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11604061/
Abstract

BACKGROUND

Diatoms are microalgae with finely ornamented microscopic silica shells. Their taxonomic identification by light microscopy is routinely used as part of community ecological research as well as ecological status assessment of aquatic ecosystems, and a need for digitalization of these methods has long been recognized. Alongside their high taxonomic and morphological diversity, several other factors make diatoms highly challenging for deep learning-based identification using light microscopy images. These include (i) an unusually high intraclass variability combined with small between-class differences, (ii) a rather different visual appearance of specimens depending on their orientation on the microscope slide, and (iii) the limited availability of diatom experts for accurate taxonomic annotation.

FINDINGS

We present the largest diatom image dataset thus far, aimed at facilitating the application and benchmarking of innovative deep learning methods to the diatom identification problem on realistic research data, "UDE DIATOMS in the Wild 2024." The dataset contains 83,570 images of 611 diatom taxa, 101 of which are represented by at least 100 examples and 144 by at least 50 examples each. We showcase this dataset in 2 innovative analyses that address individual aspects of the above challenges using subclustering to deal with visually heterogeneous classes, out-of-distribution sample detection, and semi-supervised learning.

CONCLUSIONS

The problem of image-based identification of diatoms is both important for environmental research and challenging from the machine learning perspective. By making available the so far largest image dataset, accompanied by innovative analyses, this contribution will facilitate addressing these points by the scientific community.

摘要

背景

硅藻是一种具有精细装饰性微硅壳的微藻。它们的分类鉴定通过光学显微镜被常规用于群落生态研究以及水生生态系统的生态状况评估,并且长期以来一直需要将这些方法数字化。除了具有高度的分类和形态多样性外,还有其他几个因素使得基于光学显微镜图像的深度学习识别对硅藻来说极具挑战性。这些因素包括:(i)异常高的类内变异性与小的类间差异相结合,(ii)由于在显微镜载玻片上的取向不同,标本的外观相当不同,以及(iii)可用于准确分类注释的硅藻专家的有限可用性。

结果

我们提出了迄今为止最大的硅藻图像数据集,旨在促进创新的深度学习方法在现实研究数据上应用和基准测试,“UDE DIATOMS in the Wild 2024”。该数据集包含 611 个硅藻类别的 83570 张图像,其中 101 个类别至少有 100 个样本,144 个类别至少有 50 个样本。我们通过子聚类来处理视觉上异质的类别、离群样本检测和半监督学习,展示了这个数据集在 2 个创新分析中的应用,这些分析解决了上述挑战的各个方面。

结论

基于图像的硅藻识别问题对环境研究很重要,从机器学习的角度来看也具有挑战性。通过提供迄今为止最大的图像数据集,并结合创新的分析,本研究将有助于科学界解决这些问题。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a4d/11604061/5135de33f14d/giae087fig10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a4d/11604061/de3ba02333ff/giae087fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a4d/11604061/3c27a034c5b6/giae087fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a4d/11604061/69ce22260fe7/giae087fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a4d/11604061/bf86df0fda0a/giae087fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a4d/11604061/7c5de6f2d078/giae087fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a4d/11604061/503390a83352/giae087fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a4d/11604061/352f37637adf/giae087fig7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a4d/11604061/d1f42c49d87c/giae087fig8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a4d/11604061/3794b1a31019/giae087fig9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a4d/11604061/5135de33f14d/giae087fig10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a4d/11604061/de3ba02333ff/giae087fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a4d/11604061/3c27a034c5b6/giae087fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a4d/11604061/69ce22260fe7/giae087fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a4d/11604061/bf86df0fda0a/giae087fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a4d/11604061/7c5de6f2d078/giae087fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a4d/11604061/503390a83352/giae087fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a4d/11604061/352f37637adf/giae087fig7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a4d/11604061/d1f42c49d87c/giae087fig8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a4d/11604061/3794b1a31019/giae087fig9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a4d/11604061/5135de33f14d/giae087fig10.jpg

相似文献

1
"UDE DIATOMS in the Wild 2024": a new image dataset of freshwater diatoms for training deep learning models."野外 UDE 硅藻 2024":一个用于训练深度学习模型的淡水硅藻新图像数据集。
Gigascience. 2024 Jan 2;13. doi: 10.1093/gigascience/giae087.
2
Improving deep learning-based segmentation of diatoms in gigapixel-sized virtual slides by object-based tile positioning and object integrity constraint.通过基于对象的图块定位和对象完整性约束来改进千兆像素大小虚拟幻灯片中基于深度学习的硅藻分割。
PLoS One. 2023 Feb 24;18(2):e0272103. doi: 10.1371/journal.pone.0272103. eCollection 2023.
3
Learning Diatoms Classification from a Dry Test Slide by Holographic Microscopy.基于全息显微镜的干片测试幻灯片的硅藻分类研究。
Sensors (Basel). 2020 Nov 7;20(21):6353. doi: 10.3390/s20216353.
4
Microscopic image recognition of diatoms based on deep learning.基于深度学习的硅藻微观图像识别。
J Phycol. 2023 Dec;59(6):1166-1178. doi: 10.1111/jpy.13390. Epub 2023 Nov 23.
5
Metabarcoding vs. morphological identification to assess diatom diversity in environmental studies.代谢条码与形态鉴定在环境研究中评估硅藻多样性的比较。
Mol Ecol Resour. 2015 May;15(3):526-42. doi: 10.1111/1755-0998.12336. Epub 2014 Oct 20.
6
Diatoms.org: supporting taxonomists, connecting communities.硅藻.org:支持分类学家,连接各群体。
Diatom Res. 2022 Jan 11;36(4):291-304. doi: 10.1080/0269249X.2021.2006790.
7
R-Syst::diatom: an open-access and curated barcode database for diatoms and freshwater monitoring.R-Syst::硅藻:一个用于硅藻和淡水监测的开放获取且经过整理的条形码数据库。
Database (Oxford). 2016 Mar 17;2016. doi: 10.1093/database/baw016. Print 2016.
8
Deep learning-based diatom taxonomy on virtual slides.基于深度学习的虚拟切片上的硅藻分类学。
Sci Rep. 2020 Sep 2;10(1):14416. doi: 10.1038/s41598-020-71165-w.
9
Freshwater diatom transfer to clothing: Spatial and temporal influences on trace evidence in forensic reconstructions.淡水硅藻转移至衣物:法医学重建中对微量证据的时空影响
Sci Justice. 2019 May;59(3):292-305. doi: 10.1016/j.scijus.2018.11.004. Epub 2018 Nov 19.
10
On the objectivity, reliability, and validity of deep learning enabled bioimage analyses.深度学习赋能的生物影像分析的客观性、可靠性和有效性。
Elife. 2020 Oct 19;9:e59780. doi: 10.7554/eLife.59780.

引用本文的文献

1
Scaling down annotation needs: The capacity of self-supervised learning on diatom classification.缩减注释需求:硅藻分类中自监督学习的能力
iScience. 2025 Mar 20;28(4):112236. doi: 10.1016/j.isci.2025.112236. eCollection 2025 Apr 18.

本文引用的文献

1
Microscopic image recognition of diatoms based on deep learning.基于深度学习的硅藻微观图像识别。
J Phycol. 2023 Dec;59(6):1166-1178. doi: 10.1111/jpy.13390. Epub 2023 Nov 23.
2
DiatomNet v1.0: A novel approach for automatic diatom testing for drowning diagnosis in forensically biomedical application.藻种自动检测分析软件 1.0 版:法医学中生物医学应用中溺死诊断的一种新的自动硅藻检测方法。
Comput Methods Programs Biomed. 2023 Apr;232:107434. doi: 10.1016/j.cmpb.2023.107434. Epub 2023 Feb 21.
3
Improving deep learning-based segmentation of diatoms in gigapixel-sized virtual slides by object-based tile positioning and object integrity constraint.
通过基于对象的图块定位和对象完整性约束来改进千兆像素大小虚拟幻灯片中基于深度学习的硅藻分割。
PLoS One. 2023 Feb 24;18(2):e0272103. doi: 10.1371/journal.pone.0272103. eCollection 2023.
4
Machine learning techniques to characterize functional traits of plankton from image data.利用机器学习技术从图像数据中表征浮游生物的功能特征。
Limnol Oceanogr. 2022 Aug;67(8):1647-1669. doi: 10.1002/lno.12101. Epub 2022 Jun 30.
5
An improved automated diatom detection method based on YOLOv5 framework and its preliminary study for taxonomy recognition in the forensic diatom test.一种基于YOLOv5框架的改进型硅藻自动检测方法及其在法医硅藻检验中分类识别的初步研究
Front Microbiol. 2022 Aug 19;13:963059. doi: 10.3389/fmicb.2022.963059. eCollection 2022.
6
Diatoms.org: supporting taxonomists, connecting communities.硅藻.org:支持分类学家,连接各群体。
Diatom Res. 2022 Jan 11;36(4):291-304. doi: 10.1080/0269249X.2021.2006790.
7
Identification of diatom taxonomy by a combination of region-based full convolutional network, online hard example mining, and shape priors of diatoms.基于区域的全卷积网络、在线硬例挖掘和硅藻形状先验相结合的硅藻分类学鉴定。
Int J Legal Med. 2021 Nov;135(6):2519-2530. doi: 10.1007/s00414-021-02664-2. Epub 2021 Jul 20.
8
Epiphytic diatom community structure and richness is determined by macroalgal host and location in the South Shetland Islands (Antarctica).附生硅藻群落结构和丰富度由南极南设得兰群岛的大型藻类宿主和位置决定。
PLoS One. 2021 Apr 30;16(4):e0250629. doi: 10.1371/journal.pone.0250629. eCollection 2021.
9
The Biological Assessment and Rehabilitation of the World's Rivers: An Overview.世界河流的生物评估与修复:概述
Water (Basel). 2021 Jan 31;13(3):371. doi: 10.3390/w13030371.
10
An efficient method for building a database of diatom populations for drowning site inference using a deep learning algorithm.利用深度学习算法建立用于推断溺水地点的硅藻种群数据库的有效方法。
Int J Legal Med. 2021 May;135(3):817-827. doi: 10.1007/s00414-020-02497-5. Epub 2021 Jan 3.