• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于图像-文本跨模态特征融合的蔬菜叶片病害识别模型

A Vegetable Leaf Disease Identification Model Based on Image-Text Cross-Modal Feature Fusion.

作者信息

Feng Xuguang, Zhao Chunjiang, Wang Chunshan, Wu Huarui, Miao Yisheng, Zhang Jingjian

机构信息

School of Information Science and Technology, Hebei Agricultural University, Baoding, China.

National Engineering Research Center for Information Technology in Agriculture, Beijing, China.

出版信息

Front Plant Sci. 2022 Jun 24;13:918940. doi: 10.3389/fpls.2022.918940. eCollection 2022.

DOI:10.3389/fpls.2022.918940
PMID:35812910
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9263697/
Abstract

In view of the differences in appearance and the complex backgrounds of crop diseases, automatic identification of field diseases is an extremely challenging topic in smart agriculture. To address this challenge, a popular approach is to design a Deep Convolutional Neural Network (DCNN) model that extracts visual disease features in the images and then identifies the diseases based on the extracted features. This approach performs well under simple background conditions, but has low accuracy and poor robustness under complex backgrounds. In this paper, an end-to-end disease identification model composed of a disease-spot region detector and a disease classifier (YOLOv5s + BiCMT) was proposed. Specifically, the YOLOv5s network was used to detect the disease-spot regions so as to provide a regional attention mechanism to facilitate the disease identification task of the classifier. For the classifier, a Bidirectional Cross-Modal Transformer (BiCMT) model combining the image and text modal information was constructed, which utilizes the correlation and complementarity between the features of the two modalities to achieve the fusion and recognition of disease features. Meanwhile, the problem of inconsistent lengths among different modal data sequences was solved. Eventually, the YOLOv5s + BiCMT model achieved the optimal results on a small dataset. Its Accuracy, Precision, Sensitivity, and Specificity reached 99.23, 97.37, 97.54, and 99.54%, respectively. This paper proves that the bidirectional cross-modal feature fusion by combining disease images and texts is an effective method to identify vegetable diseases in field environments.

摘要

鉴于作物病害外观的差异和复杂背景,田间病害的自动识别是智慧农业中极具挑战性的课题。为应对这一挑战,一种常用方法是设计深度卷积神经网络(DCNN)模型,该模型在图像中提取视觉病害特征,然后基于提取的特征识别病害。这种方法在简单背景条件下表现良好,但在复杂背景下准确率低且鲁棒性差。本文提出了一种由病斑区域检测器和病害分类器(YOLOv5s + BiCMT)组成的端到端病害识别模型。具体而言,使用YOLOv5s网络检测病斑区域,以提供区域注意力机制,便于分类器的病害识别任务。对于分类器,构建了一个结合图像和文本模态信息的双向跨模态Transformer(BiCMT)模型,该模型利用两种模态特征之间的相关性和互补性来实现病害特征的融合与识别。同时,解决了不同模态数据序列长度不一致的问题。最终,YOLOv5s + BiCMT模型在一个小数据集上取得了最优结果。其准确率、精确率、灵敏度和特异性分别达到了99.23%、97.37%、97.54%和99.54%。本文证明了结合病害图像和文本进行双向跨模态特征融合是识别田间环境中蔬菜病害的有效方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2379/9263697/e961c7cff51e/fpls-13-918940-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2379/9263697/cfa475e61666/fpls-13-918940-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2379/9263697/ba24b187f290/fpls-13-918940-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2379/9263697/13f248a895dd/fpls-13-918940-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2379/9263697/33f5b2fd830b/fpls-13-918940-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2379/9263697/934ae37a4a06/fpls-13-918940-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2379/9263697/14ef58eea131/fpls-13-918940-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2379/9263697/e961c7cff51e/fpls-13-918940-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2379/9263697/cfa475e61666/fpls-13-918940-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2379/9263697/ba24b187f290/fpls-13-918940-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2379/9263697/13f248a895dd/fpls-13-918940-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2379/9263697/33f5b2fd830b/fpls-13-918940-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2379/9263697/934ae37a4a06/fpls-13-918940-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2379/9263697/14ef58eea131/fpls-13-918940-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2379/9263697/e961c7cff51e/fpls-13-918940-g007.jpg

相似文献

1
A Vegetable Leaf Disease Identification Model Based on Image-Text Cross-Modal Feature Fusion.基于图像-文本跨模态特征融合的蔬菜叶片病害识别模型
Front Plant Sci. 2022 Jun 24;13:918940. doi: 10.3389/fpls.2022.918940. eCollection 2022.
2
A Plant Disease Recognition Method Based on Fusion of Images and Graph Structure Text.一种基于图像与图结构文本融合的植物病害识别方法。
Front Plant Sci. 2022 Jan 14;12:731688. doi: 10.3389/fpls.2021.731688. eCollection 2021.
3
Electromagnetic Modulation Signal Classification Using Dual-Modal Feature Fusion CNN.基于双模态特征融合卷积神经网络的电磁调制信号分类
Entropy (Basel). 2022 May 15;24(5):700. doi: 10.3390/e24050700.
4
Few-shot cotton leaf spots disease classification based on metric learning.基于度量学习的少样本棉花叶斑病分类
Plant Methods. 2021 Nov 8;17(1):114. doi: 10.1186/s13007-021-00813-7.
5
Automated multi-modal Transformer network (AMTNet) for 3D medical images segmentation.用于3D医学图像分割的自动多模态Transformer网络(AMTNet)。
Phys Med Biol. 2023 Jan 9;68(2). doi: 10.1088/1361-6560/aca74c.
6
Detection of citrus diseases in complex backgrounds based on image-text multimodal fusion and knowledge assistance.基于图像-文本多模态融合与知识辅助的复杂背景下柑橘病害检测
Front Plant Sci. 2023 Nov 27;14:1280365. doi: 10.3389/fpls.2023.1280365. eCollection 2023.
7
Transformer-Based Multi-Modal Data Fusion Method for COPD Classification and Physiological and Biochemical Indicators Identification.基于Transformer的慢性阻塞性肺疾病分类及生理生化指标识别的多模态数据融合方法
Biomolecules. 2023 Sep 15;13(9):1391. doi: 10.3390/biom13091391.
8
A Multi-Modal Convolutional Neural Network Model for Intelligent Analysis of the Influence of Music Genres on Children's Emotions.一种用于智能分析音乐流派对儿童情绪影响的多模态卷积神经网络模型。
Comput Intell Neurosci. 2022 Jul 19;2022:4957085. doi: 10.1155/2022/4957085. eCollection 2022.
9
High-Accuracy Tomato Leaf Disease Image-Text Retrieval Method Utilizing LAFANet.基于LAFANet的高精度番茄叶部病害图像-文本检索方法
Plants (Basel). 2024 Apr 23;13(9):1176. doi: 10.3390/plants13091176.
10
DMC-Fusion: Deep Multi-Cascade Fusion With Classifier-Based Feature Synthesis for Medical Multi-Modal Images.DMC-Fusion:基于分类器的特征合成的医学多模态图像深度多级联融合。
IEEE J Biomed Health Inform. 2021 Sep;25(9):3438-3449. doi: 10.1109/JBHI.2021.3083752. Epub 2021 Sep 3.

引用本文的文献

1
A multimodal framework for pepper diseases and pests detection.多模态辣椒病虫害检测框架。
Sci Rep. 2024 Nov 22;14(1):28973. doi: 10.1038/s41598-024-80675-w.
2
Multisource information fusion method for vegetable disease detection.蔬菜病害检测的多源信息融合方法。
BMC Plant Biol. 2024 Aug 2;24(1):738. doi: 10.1186/s12870-024-05346-4.
3
Detection of citrus diseases in complex backgrounds based on image-text multimodal fusion and knowledge assistance.基于图像-文本多模态融合与知识辅助的复杂背景下柑橘病害检测

本文引用的文献

1
IoT and Interpretable Machine Learning Based Framework for Disease Prediction in Pearl Millet.基于物联网和可解释机器学习的珍珠粟疾病预测框架。
Sensors (Basel). 2021 Aug 9;21(16):5386. doi: 10.3390/s21165386.
2
A Survey of Deep Convolutional Neural Networks Applied for Prediction of Plant Leaf Diseases.深度学习卷积神经网络在植物叶片病害预测中的应用综述。
Sensors (Basel). 2021 Jul 12;21(14):4749. doi: 10.3390/s21144749.
3
Early recognition of tomato gray leaf spot disease based on MobileNetv2-YOLOv3 model.基于MobileNetv2-YOLOv3模型的番茄灰叶斑病早期识别
Front Plant Sci. 2023 Nov 27;14:1280365. doi: 10.3389/fpls.2023.1280365. eCollection 2023.
4
Maize seed appearance quality assessment based on improved Inception-ResNet.基于改进的Inception-ResNet的玉米种子外观质量评估
Front Plant Sci. 2023 Aug 24;14:1249989. doi: 10.3389/fpls.2023.1249989. eCollection 2023.
5
Unstructured road extraction and roadside fruit recognition in grape orchards based on a synchronous detection algorithm.基于同步检测算法的葡萄园非结构化道路提取与路边果实识别
Front Plant Sci. 2023 Jun 2;14:1103276. doi: 10.3389/fpls.2023.1103276. eCollection 2023.
Plant Methods. 2020 Jun 8;16:83. doi: 10.1186/s13007-020-00624-2. eCollection 2020.
4
Multimodal Transformer for Unaligned Multimodal Language Sequences.用于未对齐多模态语言序列的多模态变换器
Proc Conf Assoc Comput Linguist Meet. 2019 Jul;2019:6558-6569. doi: 10.18653/v1/p19-1656.
5
Identification of Tomato Disease Types and Detection of Infected Areas Based on Deep Convolutional Neural Networks and Object Detection Techniques.基于深度卷积神经网络和目标检测技术的番茄病害类型识别与感病区域检测。
Comput Intell Neurosci. 2019 Dec 16;2019:9142753. doi: 10.1155/2019/9142753. eCollection 2019.
6
Deep Learning for Image-Based Cassava Disease Detection.基于图像的木薯疾病检测的深度学习
Front Plant Sci. 2017 Oct 27;8:1852. doi: 10.3389/fpls.2017.01852. eCollection 2017.
7
A Robust Deep-Learning-Based Detector for Real-Time Tomato Plant Diseases and Pests Recognition.基于深度学习的实时番茄病虫害识别稳健探测器。
Sensors (Basel). 2017 Sep 4;17(9):2022. doi: 10.3390/s17092022.
8
Using Deep Learning for Image-Based Plant Disease Detection.利用深度学习进行基于图像的植物病害检测。
Front Plant Sci. 2016 Sep 22;7:1419. doi: 10.3389/fpls.2016.01419. eCollection 2016.
9
Deep Neural Networks Based Recognition of Plant Diseases by Leaf Image Classification.基于深度神经网络的植物病害叶片图像分类识别
Comput Intell Neurosci. 2016;2016:3289801. doi: 10.1155/2016/3289801. Epub 2016 Jun 22.