• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

DiffusionFR:通过扩散与注意力实现模糊场景下鱼类的物种识别

DiffusionFR: Species Recognition of Fish in Blurry Scenarios via Diffusion and Attention.

作者信息

Wang Guoying, Shi Bing, Yi Xiaomei, Wu Peng, Kong Linjun, Mo Lufeng

机构信息

College of Mathematics and Computer Science, Zhejiang A&F University, Hangzhou 311300, China.

Office of Information Technology, Zhejiang University of Finance & Economics, Hangzhou 310018, China.

出版信息

Animals (Basel). 2024 Feb 2;14(3):499. doi: 10.3390/ani14030499.

DOI:10.3390/ani14030499
PMID:38338141
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10854938/
Abstract

Blurry scenarios, such as light reflections and water ripples, often affect the clarity and signal-to-noise ratio of fish images, posing significant challenges for traditional deep learning models in accurately recognizing fish species. Firstly, deep learning models rely on a large amount of labeled data. However, it is often difficult to label data in blurry scenarios. Secondly, existing deep learning models need to be more effective for the processing of bad, blurry, and otherwise inadequate images, which is an essential reason for their low recognition rate. A method based on the diffusion model and attention mechanism for fish image recognition in blurry scenarios, DiffusionFR, is proposed to solve these problems and improve the performance of species recognition of fish images in blurry scenarios. This paper presents the selection and application of this correcting technique. In the method, DiffusionFR, a two-stage diffusion network model, TSD, is designed to deblur bad, blurry, and otherwise inadequate fish scene pictures to restore clarity, and a learnable attention module, LAM, is intended to improve the accuracy of fish recognition. In addition, a new dataset of fish images in blurry scenarios, BlurryFish, was constructed and used to validate the effectiveness of DiffusionFR, combining bad, blurry, and otherwise inadequate images from the publicly available dataset Fish4Knowledge. The experimental results demonstrate that DiffusionFR achieves outstanding performance on various datasets. On the original dataset, DiffusionFR achieved the highest training accuracy of 97.55%, as well as a Top-1 accuracy test score of 92.02% and a Top-5 accuracy test score of 95.17%. Furthermore, on nine datasets with light reflection noise, the mean values of training accuracy reached a peak at 96.50%, while the mean values of the Top-1 accuracy test and Top-5 accuracy test were at their highest at 90.96% and 94.12%, respectively. Similarly, on three datasets with water ripple noise, the mean values of training accuracy reached a peak at 95.00%, while the mean values of the Top-1 accuracy test and Top-5 accuracy test were at their highest at 89.54% and 92.73%, respectively. These results demonstrate that the method showcases superior accuracy and enhanced robustness in handling original datasets and datasets with light reflection and water ripple noise.

摘要

模糊场景,如光反射和水波涟漪,常常影响鱼类图像的清晰度和信噪比,给传统深度学习模型准确识别鱼类物种带来重大挑战。首先,深度学习模型依赖大量标注数据。然而,在模糊场景下标注数据往往很困难。其次,现有的深度学习模型在处理质量差、模糊及其他不充分的图像方面效果欠佳,这是其识别率低的一个重要原因。为了解决这些问题并提高模糊场景下鱼类图像的物种识别性能,提出了一种基于扩散模型和注意力机制的模糊场景鱼类图像识别方法DiffusionFR。本文介绍了这种校正技术的选择与应用。在DiffusionFR方法中,设计了一个两阶段扩散网络模型TSD,用于对质量差、模糊及其他不充分的鱼类场景图片进行去模糊处理以恢复清晰度,还设计了一个可学习的注意力模块LAM,旨在提高鱼类识别的准确性。此外,构建了一个新的模糊场景鱼类图像数据集BlurryFish,并结合公开可用数据集Fish4Knowledge中的质量差、模糊及其他不充分的图像,用于验证DiffusionFR的有效性。实验结果表明,DiffusionFR在各种数据集上都取得了优异的性能。在原始数据集上,DiffusionFR的训练准确率最高达到97.55%,Top-1准确率测试得分92.02%,Top-5准确率测试得分95.17%。此外,在九个存在光反射噪声的数据集上,训练准确率的平均值在96.57%时达到峰值,而Top-1准确率测试和Top-5准确率测试的平均值分别在90.96%和94.12%时达到最高。同样,在三个存在水波涟漪噪声的数据集上,训练准确率的平均值在95.00%时达到峰值,而Top-1准确率测试和Top-5准确率测试的平均值分别在89.54%和92.73%时达到最高。这些结果表明,该方法在处理原始数据集以及存在光反射和水波涟漪噪声的数据集时,展现出了卓越的准确性和更强的鲁棒性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6705/10854938/3ad819326a00/animals-14-00499-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6705/10854938/64c43983428a/animals-14-00499-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6705/10854938/06aa5b0358c7/animals-14-00499-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6705/10854938/4f1241975538/animals-14-00499-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6705/10854938/a88c0dfa3bbe/animals-14-00499-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6705/10854938/c75a31bbc7ef/animals-14-00499-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6705/10854938/e934e813b6a2/animals-14-00499-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6705/10854938/82d691dadded/animals-14-00499-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6705/10854938/eb30694f58f8/animals-14-00499-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6705/10854938/5818d7f3f742/animals-14-00499-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6705/10854938/b25f80eba1f0/animals-14-00499-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6705/10854938/3ad819326a00/animals-14-00499-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6705/10854938/64c43983428a/animals-14-00499-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6705/10854938/06aa5b0358c7/animals-14-00499-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6705/10854938/4f1241975538/animals-14-00499-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6705/10854938/a88c0dfa3bbe/animals-14-00499-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6705/10854938/c75a31bbc7ef/animals-14-00499-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6705/10854938/e934e813b6a2/animals-14-00499-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6705/10854938/82d691dadded/animals-14-00499-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6705/10854938/eb30694f58f8/animals-14-00499-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6705/10854938/5818d7f3f742/animals-14-00499-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6705/10854938/b25f80eba1f0/animals-14-00499-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6705/10854938/3ad819326a00/animals-14-00499-g011.jpg

相似文献

1
DiffusionFR: Species Recognition of Fish in Blurry Scenarios via Diffusion and Attention.DiffusionFR:通过扩散与注意力实现模糊场景下鱼类的物种识别
Animals (Basel). 2024 Feb 2;14(3):499. doi: 10.3390/ani14030499.
2
Deep Deblurring in Teledermatology: Deep Learning Models Restore the Accuracy of Blurry Images' Classification.远程皮肤病学中的深度去模糊:深度学习模型提高模糊图像分类的准确性。
Telemed J E Health. 2024 Sep;30(9):2477-2482. doi: 10.1089/tmj.2023.0703. Epub 2024 Jun 27.
3
A Metric-Based Few-Shot Learning Method for Fish Species Identification with Limited Samples.一种基于度量的少样本学习方法用于有限样本下的鱼类物种识别
Animals (Basel). 2024 Feb 28;14(5):755. doi: 10.3390/ani14050755.
4
A Bayesian Scene-Prior-Based Deep Network Model for Face Verification.基于贝叶斯场景先验的深度网络人脸验证模型。
Sensors (Basel). 2018 Jun 11;18(6):1906. doi: 10.3390/s18061906.
5
Domain-adaptive denoising network for low-dose CT via noise estimation and transfer learning.基于噪声估计和迁移学习的适用于低剂量 CT 的域自适应去噪网络。
Med Phys. 2023 Jan;50(1):74-88. doi: 10.1002/mp.15952. Epub 2022 Sep 2.
6
Self-supervised structural similarity-based convolutional neural network for cardiac diffusion tensor image denoising.基于自监督结构相似性的卷积神经网络用于心脏扩散张量图像去噪
Med Phys. 2023 Oct;50(10):6137-6150. doi: 10.1002/mp.16301. Epub 2023 Apr 17.
7
TEM virus images: Benchmark dataset and deep learning classification.TEM 病毒图像:基准数据集和深度学习分类。
Comput Methods Programs Biomed. 2021 Sep;209:106318. doi: 10.1016/j.cmpb.2021.106318. Epub 2021 Jul 29.
8
CLIB: Contrastive learning of ignoring background for underwater fish image classification.CLIB:用于水下鱼类图像分类的忽略背景的对比学习
Front Neurorobot. 2024 Jul 31;18:1423848. doi: 10.3389/fnbot.2024.1423848. eCollection 2024.
9
Detection of ERBB2 and CEN17 signals in fluorescent in situ hybridization and dual in situ hybridization for guiding breast cancer HER2 target therapy.荧光原位杂交和双荧光原位杂交检测 ERBB2 和 CEN17 信号指导乳腺癌 HER2 靶向治疗。
Artif Intell Med. 2023 Jul;141:102568. doi: 10.1016/j.artmed.2023.102568. Epub 2023 May 4.
10
An Improved EfficientNetV2 Model Based on Visual Attention Mechanism: Application to Identification of Cassava Disease.基于视觉注意机制的改进型 EfficientNetV2 模型:在木薯病害识别中的应用。
Comput Intell Neurosci. 2022 Aug 5;2022:1569911. doi: 10.1155/2022/1569911. eCollection 2022.

本文引用的文献

1
Estimation of rice seedling growth traits with an end-to-end multi-objective deep learning framework.使用端到端多目标深度学习框架估计水稻幼苗生长性状。
Front Plant Sci. 2023 Jun 2;14:1165552. doi: 10.3389/fpls.2023.1165552. eCollection 2023.
2
Neuro-heuristic analysis of surveillance video in a centralized IoT system.集中式物联网系统中监控视频的神经启发式分析
ISA Trans. 2023 Sep;140:402-411. doi: 10.1016/j.isatra.2023.05.024. Epub 2023 Jun 2.
3
U-Shape Transformer for Underwater Image Enhancement.U 型变换在水下图像增强中的应用。
IEEE Trans Image Process. 2023;32:3066-3079. doi: 10.1109/TIP.2023.3276332. Epub 2023 May 30.
4
Invertible Residual Blocks in Deep Learning Networks.深度学习网络中的可逆残差块
IEEE Trans Neural Netw Learn Syst. 2024 Jul;35(7):10167-10173. doi: 10.1109/TNNLS.2023.3238397. Epub 2024 Jul 8.
5
Image restoration for blurry optical images caused by photon diffusion with deep learning.利用深度学习对光子扩散导致的模糊光学图像进行图像复原。
J Opt Soc Am A Opt Image Sci Vis. 2023 Jan 1;40(1):96-107. doi: 10.1364/JOSAA.475890.
6
Class-Aware Fish Species Recognition Using Deep Learning for an Imbalanced Dataset.使用深度学习对不均衡数据集进行类别感知的鱼类物种识别
Sensors (Basel). 2022 Oct 28;22(21):8268. doi: 10.3390/s22218268.
7
Targeted Data Augmentation and Hierarchical Classification with Deep Learning for Fish Species Identification in Underwater Images.基于深度学习的水下图像鱼类物种识别中的目标数据增强与层次分类
J Imaging. 2022 Aug 1;8(8):214. doi: 10.3390/jimaging8080214.
8
Applying Image Recognition and Tracking Methods for Fish Physiology Detection Based on a Visual Sensor.基于视觉传感器的鱼类生理学检测的图像识别与跟踪方法应用
Sensors (Basel). 2022 Jul 25;22(15):5545. doi: 10.3390/s22155545.
9
The movement ecology of fishes.鱼类运动生态学。
J Fish Biol. 2022 Oct;101(4):756-779. doi: 10.1111/jfb.15153. Epub 2022 Jul 25.
10
Leveraging social media and deep learning to detect rare megafauna in video surveys.利用社交媒体和深度学习技术在视频调查中检测珍稀大型动物。
Conserv Biol. 2022 Feb;36(1):e13798. doi: 10.1111/cobi.13798. Epub 2021 Aug 6.