• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于领域内与领域外的浮游生物图像分类迁移学习。

In-domain versus out-of-domain transfer learning in plankton image classification.

机构信息

Istituto Italiano di Tecnologia, Genoa, Italy.

MaLGa-DIBRIS, Università degli studi di Genova, Genoa, Italy.

出版信息

Sci Rep. 2023 Jun 27;13(1):10443. doi: 10.1038/s41598-023-37627-7.

DOI:10.1038/s41598-023-37627-7
PMID:37369770
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10300094/
Abstract

Plankton microorganisms play a huge role in the aquatic food web. Recently, it has been proposed to use plankton as a biosensor, since they can react to even minimal perturbations of the aquatic environment with specific physiological changes, which may lead to alterations in morphology and behavior. Nowadays, the development of high-resolution in-situ automatic acquisition systems allows the research community to obtain a large amount of plankton image data. Fundamental examples are the ZooScan and Woods Hole Oceanographic Institution (WHOI) datasets, comprising up to millions of plankton images. However, obtaining unbiased annotations is expensive both in terms of time and resources, and in-situ acquired datasets generally suffer from severe imbalance, with only a few images available for several species. Transfer learning is a popular solution to these challenges, with ImageNet1K being the most-used source dataset for pre-training. On the other hand, datasets like the ZooScan and the WHOI may represent a valuable opportunity to compare out-of-domain and large-scale plankton in-domain source datasets, in terms of performance for the task at hand.In this paper, we design three transfer learning pipelines for plankton image classification, with the aim of comparing in-domain and out-of-domain transfer learning on three popular benchmark plankton datasets. The general framework consists in fine-tuning a pre-trained model on a plankton target dataset. In the first pipeline, the model is pre-trained from scratch on a large-scale plankton dataset, in the second, it is pre-trained on large-scale natural image datasets (ImageNet1K or ImageNet22K), while in the third, a two-stage fine-tuning is implemented (ImageNet [Formula: see text] large-scale plankton dataset [Formula: see text] target plankton dataset). Our results show that an out-of-domain ImageNet22K pre-training outperforms the plankton in-domain ones, with an average boost in test accuracy of around 6%. In the next part of this work, we adopt three ImageNet22k pre-trained Vision Transformers and one ConvNeXt, obtaining results on par (or slightly superior) with the state-of-the-art, corresponding to the usage of CNN models ensembles, with a single model. Finally, we design and test an ensemble of our Vision Transformers and the ConvNeXt, outperforming the state-of-the-art existing works on plankton image classification on the three target datasets. To support scientific community contribution and further research, our implemented code is open-source and available at https://github.com/Malga-Vision/plankton_transfer .

摘要

浮游微生物在水生食物网中起着巨大的作用。最近,有人提议将浮游生物用作生物传感器,因为它们可以对水生环境的微小干扰做出反应,并产生特定的生理变化,从而导致形态和行为的改变。如今,高分辨率原位自动采集系统的发展使得研究人员能够获得大量的浮游生物图像数据。一个基本的例子是 ZooScan 和 Woods Hole 海洋学研究所 (WHOI) 的数据集,其中包含多达数百万张浮游生物图像。然而,获得无偏注释既费时又费资源,而原位采集的数据集通常存在严重的不平衡,只有少数几种图像适用于几种物种。迁移学习是解决这些挑战的一种流行方法,其中 ImageNet1K 是最常用的预训练源数据集。另一方面,像 ZooScan 和 WHOI 这样的数据集可能代表了一个有价值的机会,可以比较手头任务的域外和大规模浮游生物的域外和大规模源数据集的性能。在本文中,我们设计了三个用于浮游生物图像分类的迁移学习管道,目的是在三个流行的浮游生物基准数据集上比较域内和域外迁移学习。总体框架包括在浮游生物目标数据集上微调预先训练好的模型。在第一个管道中,模型从头开始在大规模浮游生物数据集上进行预训练,在第二个管道中,模型在大规模自然图像数据集(ImageNet1K 或 ImageNet22K)上进行预训练,而在第三个管道中,执行两阶段微调(ImageNet [Formula: see text] 大规模浮游生物数据集 [Formula: see text] 目标浮游生物数据集)。我们的结果表明,域外的 ImageNet22K 预训练优于域内的预训练,测试准确率平均提高了约 6%。在这项工作的下一部分中,我们采用了三个 ImageNet22k 预训练的 Vision Transformers 和一个 ConvNeXt,并在三个目标数据集上获得了与最先进的结果相当(或略优)的结果,对应于使用 CNN 模型集成,而仅使用单个模型。最后,我们设计并测试了我们的 Vision Transformers 和 ConvNeXt 的集成,在三个目标数据集上的浮游生物图像分类方面优于现有的最先进工作。为了支持科学界的贡献和进一步的研究,我们实现的代码是开源的,并可在 https://github.com/Malga-Vision/plankton_transfer 上获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0210/10300094/24a1ce13b56e/41598_2023_37627_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0210/10300094/deb3e29197c3/41598_2023_37627_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0210/10300094/24a1ce13b56e/41598_2023_37627_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0210/10300094/deb3e29197c3/41598_2023_37627_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0210/10300094/24a1ce13b56e/41598_2023_37627_Fig2_HTML.jpg

相似文献

1
In-domain versus out-of-domain transfer learning in plankton image classification.基于领域内与领域外的浮游生物图像分类迁移学习。
Sci Rep. 2023 Jun 27;13(1):10443. doi: 10.1038/s41598-023-37627-7.
2
EcoTransLearn: an R-package to easily use transfer learning for ecological studies-a plankton case study.EcoTransLearn:一个用于生态研究的易于使用迁移学习的 R 包——以浮游生物案例研究为例。
Bioinformatics. 2022 Dec 13;38(24):5469-5471. doi: 10.1093/bioinformatics/btac703.
3
Combining weakly and strongly supervised learning improves strong supervision in Gleason pattern classification.弱监督和强监督学习的结合提高了 Gleason 模式分类中的强监督。
BMC Med Imaging. 2021 May 8;21(1):77. doi: 10.1186/s12880-021-00609-0.
4
Foundation models in gastrointestinal endoscopic AI: Impact of architecture, pre-training approach and data efficiency.胃肠道内镜 AI 中的基础模型:架构、预训练方法和数据效率的影响。
Med Image Anal. 2024 Dec;98:103298. doi: 10.1016/j.media.2024.103298. Epub 2024 Aug 12.
5
A real use case of semi-supervised learning for mammogram classification in a local clinic of Costa Rica.半监督学习在哥斯达黎加当地诊所的乳房 X 光分类中的实际应用案例。
Med Biol Eng Comput. 2022 Apr;60(4):1159-1175. doi: 10.1007/s11517-021-02497-6. Epub 2022 Mar 3.
6
TEM virus images: Benchmark dataset and deep learning classification.TEM 病毒图像:基准数据集和深度学习分类。
Comput Methods Programs Biomed. 2021 Sep;209:106318. doi: 10.1016/j.cmpb.2021.106318. Epub 2021 Jul 29.
7
Automatic plankton image classification combining multiple view features via multiple kernel learning.基于多核学习的多视角特征融合浮游生物图像自动分类
BMC Bioinformatics. 2017 Dec 28;18(Suppl 16):570. doi: 10.1186/s12859-017-1954-8.
8
Targeted transfer learning to improve performance in small medical physics datasets.靶向迁移学习以提高小型医学物理数据集的性能。
Med Phys. 2020 Dec;47(12):6246-6256. doi: 10.1002/mp.14507. Epub 2020 Oct 25.
9
Synthesizing CT images from MR images with deep learning: model generalization for different datasets through transfer learning.深度学习合成 CT 图像从磁共振图像:通过迁移学习实现不同数据集的模型泛化。
Biomed Phys Eng Express. 2021 Feb 24;7(2). doi: 10.1088/2057-1976/abe3a7.
10
Analyzing Transfer Learning of Vision Transformers for Interpreting Chest Radiography.分析视觉Transformer在解读胸部 X 光片方面的迁移学习。
J Digit Imaging. 2022 Dec;35(6):1445-1462. doi: 10.1007/s10278-022-00666-z. Epub 2022 Jul 11.

引用本文的文献

1
Application of open domain adaptive models in image annotation and classification.开放域自适应模型在图像标注与分类中的应用。
PLoS One. 2025 May 14;20(5):e0322836. doi: 10.1371/journal.pone.0322836. eCollection 2025.
2
Scaling down annotation needs: The capacity of self-supervised learning on diatom classification.缩减注释需求:硅藻分类中自监督学习的能力
iScience. 2025 Mar 20;28(4):112236. doi: 10.1016/j.isci.2025.112236. eCollection 2025 Apr 18.

本文引用的文献

1
Deep Learning Classification of Lake Zooplankton.湖泊浮游动物的深度学习分类
Front Microbiol. 2021 Nov 15;12:746297. doi: 10.3389/fmicb.2021.746297. eCollection 2021.
2
Fully automatic detection and classification of phytoplankton specimens in digital microscopy images.数字显微镜图像中浮游植物标本的全自动检测与分类。
Comput Methods Programs Biomed. 2021 Mar;200:105923. doi: 10.1016/j.cmpb.2020.105923. Epub 2021 Jan 15.
3
Annotation-free learning of plankton for classification and anomaly detection.无注释浮游生物学习用于分类和异常检测。
Sci Rep. 2020 Jul 22;10(1):12142. doi: 10.1038/s41598-020-68662-3.
4
MorphoCluster: Efficient Annotation of Plankton Images by Clustering.MorphoCluster:通过聚类实现浮游生物图像的高效标注。
Sensors (Basel). 2020 May 28;20(11):3060. doi: 10.3390/s20113060.
5
Automatic plankton image classification combining multiple view features via multiple kernel learning.基于多核学习的多视角特征融合浮游生物图像自动分类
BMC Bioinformatics. 2017 Dec 28;18(Suppl 16):570. doi: 10.1186/s12859-017-1954-8.
6
Deep learning.深度学习。
Nature. 2015 May 28;521(7553):436-44. doi: 10.1038/nature14539.
7
Global phytoplankton decline over the past century.过去一个世纪全球浮游植物减少。
Nature. 2010 Jul 29;466(7306):591-6. doi: 10.1038/nature09268.
8
Biospheric primary production during an ENSO transition.厄尔尼诺-南方涛动(ENSO)转变期间的生物圈初级生产。
Science. 2001 Mar 30;291(5513):2594-7. doi: 10.1126/science.1055071.