• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于LAFANet的高精度番茄叶部病害图像-文本检索方法

High-Accuracy Tomato Leaf Disease Image-Text Retrieval Method Utilizing LAFANet.

作者信息

Xu Jiaxin, Zhou Hongliang, Hu Yufan, Xue Yongfei, Zhou Guoxiong, Li Liujun, Dai Weisi, Li Jinyang

机构信息

College of Computer and Information Engineering, Central South University of Forestry and Technology, Changsha 410004, China.

Department of Soil and Water Systems, University of Idaho, Moscow, ID 83844, USA.

出版信息

Plants (Basel). 2024 Apr 23;13(9):1176. doi: 10.3390/plants13091176.

DOI:10.3390/plants13091176
PMID:38732391
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11085479/
Abstract

Tomato leaf disease control in the field of smart agriculture urgently requires attention and reinforcement. This paper proposes a method called LAFANet for image-text retrieval, which integrates image and text information for joint analysis of multimodal data, helping agricultural practitioners to provide more comprehensive and in-depth diagnostic evidence to ensure the quality and yield of tomatoes. First, we focus on six common tomato leaf disease images and text descriptions, creating a Tomato Leaf Disease Image-Text Retrieval Dataset (TLDITRD), introducing image-text retrieval into the field of tomato leaf disease retrieval. Then, utilizing ViT and BERT models, we extract detailed image features and sequences of textual features, incorporating contextual information from image-text pairs. To address errors in image-text retrieval caused by complex backgrounds, we propose Learnable Fusion Attention (LFA) to amplify the fusion of textual and image features, thereby extracting substantial semantic insights from both modalities. To delve further into the semantic connections across various modalities, we propose a False Negative Elimination-Adversarial Negative Selection (FNE-ANS) approach. This method aims to identify adversarial negative instances that specifically target false negatives within the triplet function, thereby imposing constraints on the model. To bolster the model's capacity for generalization and precision, we propose Adversarial Regularization (AR). This approach involves incorporating adversarial perturbations during model training, thereby fortifying its resilience and adaptability to slight variations in input data. Experimental results show that, compared with existing ultramodern models, LAFANet outperformed existing models on TLDITRD dataset, with top1, top5, and top10 reaching 83.3% and 90.0%, and top1, top5, and top10 reaching 80.3%, 93.7%, and 96.3%. LAFANet offers fresh technical backing and algorithmic insights for the retrieval of tomato leaf disease through image-text correlation.

摘要

智能农业领域的番茄叶病防治迫切需要关注和加强。本文提出了一种名为LAFANet的图像-文本检索方法,该方法整合图像和文本信息以对多模态数据进行联合分析,帮助农业从业者提供更全面、深入的诊断依据,以确保番茄的质量和产量。首先,我们聚焦于六种常见的番茄叶病图像和文本描述,创建了一个番茄叶病图像-文本检索数据集(TLDITRD),将图像-文本检索引入番茄叶病检索领域。然后,利用ViT和BERT模型,我们提取详细的图像特征和文本特征序列,纳入来自图像-文本对的上下文信息。为了解决复杂背景导致的图像-文本检索错误,我们提出了可学习融合注意力(LFA)来增强文本和图像特征的融合,从而从两种模态中提取大量语义见解。为了进一步探究跨模态的语义联系,我们提出了假阴性消除-对抗性负选择(FNE-ANS)方法。该方法旨在识别在三元组函数中专门针对假阴性的对抗性负实例,从而对模型施加约束。为了增强模型的泛化能力和精度,我们提出了对抗性正则化(AR)。这种方法包括在模型训练期间纳入对抗性扰动,从而增强其对输入数据轻微变化的弹性和适应性。实验结果表明,与现有的超现代模型相比,LAFANet在TLDITRD数据集上优于现有模型,top1、top5和top10分别达到83.3%、90.0%,以及80.3%、93.7%和96.3%。LAFANet为通过图像-文本关联检索番茄叶病提供了新的技术支持和算法见解。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e61/11085479/7afaaea52516/plants-13-01176-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e61/11085479/c1e46a132bed/plants-13-01176-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e61/11085479/264521d88f16/plants-13-01176-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e61/11085479/ac29ed5005e8/plants-13-01176-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e61/11085479/680fffc5922d/plants-13-01176-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e61/11085479/06d9d02314f8/plants-13-01176-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e61/11085479/94c5d0293cc9/plants-13-01176-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e61/11085479/35bf53823b54/plants-13-01176-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e61/11085479/d20f92bbeaf2/plants-13-01176-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e61/11085479/7afaaea52516/plants-13-01176-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e61/11085479/c1e46a132bed/plants-13-01176-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e61/11085479/264521d88f16/plants-13-01176-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e61/11085479/ac29ed5005e8/plants-13-01176-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e61/11085479/680fffc5922d/plants-13-01176-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e61/11085479/06d9d02314f8/plants-13-01176-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e61/11085479/94c5d0293cc9/plants-13-01176-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e61/11085479/35bf53823b54/plants-13-01176-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e61/11085479/d20f92bbeaf2/plants-13-01176-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e61/11085479/7afaaea52516/plants-13-01176-g009.jpg

相似文献

1
High-Accuracy Tomato Leaf Disease Image-Text Retrieval Method Utilizing LAFANet.基于LAFANet的高精度番茄叶部病害图像-文本检索方法
Plants (Basel). 2024 Apr 23;13(9):1176. doi: 10.3390/plants13091176.
2
A Precise Framework for Rice Leaf Disease Image-Text Retrieval Using FHTW-Net.一种基于FHTW-Net的水稻叶部病害图像-文本检索精确框架。
Plant Phenomics. 2024 Apr 25;6:0168. doi: 10.34133/plantphenomics.0168. eCollection 2024.
3
Detection of citrus diseases in complex backgrounds based on image-text multimodal fusion and knowledge assistance.基于图像-文本多模态融合与知识辅助的复杂背景下柑橘病害检测
Front Plant Sci. 2023 Nov 27;14:1280365. doi: 10.3389/fpls.2023.1280365. eCollection 2023.
4
A Multi-Modal Open Object Detection Model for Tomato Leaf Diseases with Strong Generalization Performance Using PDC-VLD.一种基于PDC-VLD的具有强泛化性能的番茄叶部病害多模态开放目标检测模型。
Plant Phenomics. 2024 Aug 13;6:0220. doi: 10.34133/plantphenomics.0220. eCollection 2024.
5
[Cross-modal retrieval method for thyroid ultrasound image and text based on generative adversarial network].基于生成对抗网络的甲状腺超声图像与文本跨模态检索方法
Sheng Wu Yi Xue Gong Cheng Xue Za Zhi. 2020 Aug 25;37(4):641-651. doi: 10.7507/1001-5515.201812042.
6
A lightweight dual-attention network for tomato leaf disease identification.一种用于番茄叶部病害识别的轻量级双注意力网络。
Front Plant Sci. 2024 Aug 6;15:1420584. doi: 10.3389/fpls.2024.1420584. eCollection 2024.
7
An efficient deep learning model for tomato disease detection.一种用于番茄病害检测的高效深度学习模型。
Plant Methods. 2024 May 9;20(1):61. doi: 10.1186/s13007-024-01188-1.
8
Ternary Adversarial Networks With Self-Supervision for Zero-Shot Cross-Modal Retrieval.用于零样本跨模态检索的具有自监督的三元对抗网络。
IEEE Trans Cybern. 2020 Jun;50(6):2400-2413. doi: 10.1109/TCYB.2019.2928180. Epub 2019 Jul 24.
9
Plant leaf disease recognition based on improved SinGAN and improved ResNet34.基于改进的SinGAN和改进的ResNet34的植物叶片病害识别
Front Artif Intell. 2024 Jun 24;7:1414274. doi: 10.3389/frai.2024.1414274. eCollection 2024.
10
A Vegetable Leaf Disease Identification Model Based on Image-Text Cross-Modal Feature Fusion.基于图像-文本跨模态特征融合的蔬菜叶片病害识别模型
Front Plant Sci. 2022 Jun 24;13:918940. doi: 10.3389/fpls.2022.918940. eCollection 2022.

引用本文的文献

1
Tomato Leaf Disease Identification Framework FCMNet Based on Multimodal Fusion.基于多模态融合的番茄叶病识别框架FCMNet
Plants (Basel). 2025 Jul 27;14(15):2329. doi: 10.3390/plants14152329.
2
CTDUNet: A Multimodal CNN-Transformer Dual U-Shaped Network with Coordinate Space Attention for Pests and Diseases Segmentation in Complex Environments.CTDUNet:一种用于复杂环境中病虫害分割的具有坐标空间注意力的多模态卷积神经网络-Transformer双U型网络
Plants (Basel). 2024 Aug 15;13(16):2274. doi: 10.3390/plants13162274.

本文引用的文献

1
A Precise Image-Based Tomato Leaf Disease Detection Approach Using PLPNet.一种基于图像的精确番茄叶部病害检测方法——使用PLPNet
Plant Phenomics. 2023 May 12;5:0042. doi: 10.34133/plantphenomics.0042. eCollection 2023.
2
An Effective Image-Based Tomato Leaf Disease Segmentation Method Using MC-UNet.一种基于图像的使用MC-UNet的番茄叶病分割有效方法。
Plant Phenomics. 2023 May 15;5:0049. doi: 10.34133/plantphenomics.0049. eCollection 2023.
3
MultiMAP: dimensionality reduction and integration of multimodal data.MultiMAP:多模态数据的降维和整合。
Genome Biol. 2021 Dec 20;22(1):346. doi: 10.1186/s13059-021-02565-y.
4
Universal Weighting Metric Learning for Cross-Modal Retrieval.通用加权度量学习在跨模态检索中的应用。
IEEE Trans Pattern Anal Mach Intell. 2022 Oct;44(10):6534-6545. doi: 10.1109/TPAMI.2021.3088863. Epub 2022 Sep 14.