• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

多模态辣椒病虫害检测框架。

A multimodal framework for pepper diseases and pests detection.

机构信息

Shandong Provincial University Laboratory for Protected Horticulture, Weifang University of Science and Technology, Weifang, China.

出版信息

Sci Rep. 2024 Nov 22;14(1):28973. doi: 10.1038/s41598-024-80675-w.

DOI:10.1038/s41598-024-80675-w
PMID:39578611
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11584720/
Abstract

Pepper diseases and pests typically exhibit small target proportions, diverse shapes and sizes, complex imaging backgrounds, and similarities with the background. Existing detection methods perform poorly in identifying targets of different sizes and shapes within the same scene, and they lack adequate noise suppression capabilities. To address the practical needs of detecting pepper diseases and pests in complex scenarios, we have constructed the first multimodal pepper diseases and pests object detection dataset (PDD). This dataset includes a wide variety of diseases and pests images, along with detailed natural language descriptions of their attributes. Locating the described targets in complex scenes with similar disease symptoms and leaf occlusion presents a significant challenge. To tackle this issue, we propose the PepperNet model for object detection in pepper diseases and pests images using natural language descriptions. This model decomposes complex multimodal features of language and images into explicit attribute features and employs fine-grained multimodal attribute contrast learning strategies. This approach effectively distinguishes subtle local differences between similar objects, achieving fine-grained mapping from language to vision in complex scenarios. Our detection results show a mAP@0.5 of 91.93% and a detection speed of 121.8 frames per second. Visualizations indicate that the model maintains high robustness under varying noise levels and occlusion conditions, demonstrating superior performance and stability across diverse complex scenarios.

摘要

辣椒病虫害通常表现为小目标比例、形状和大小多样、成像背景复杂以及与背景相似。现有的检测方法在识别同一场景中不同大小和形状的目标方面表现不佳,并且缺乏足够的噪声抑制能力。为了满足在复杂场景中检测辣椒病虫害的实际需求,我们构建了第一个多模态辣椒病虫害目标检测数据集(PDD)。该数据集包含各种病虫害图像,以及对其属性的详细自然语言描述。在具有相似病害症状和叶片遮挡的复杂场景中定位描述的目标是一项重大挑战。为了解决这个问题,我们提出了使用自然语言描述的 PepperNet 模型来进行辣椒病虫害图像中的目标检测。该模型将语言和图像的复杂多模态特征分解为显式属性特征,并采用细粒度的多模态属性对比学习策略。这种方法有效地区分了相似物体之间细微的局部差异,实现了复杂场景中从语言到视觉的精细映射。我们的检测结果表明,mAP@0.5 为 91.93%,检测速度为 121.8 帧/秒。可视化结果表明,该模型在不同噪声水平和遮挡条件下保持了较高的鲁棒性,在各种复杂场景下表现出优异的性能和稳定性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14cf/11584720/ddbf3d76e9bd/41598_2024_80675_Fig13_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14cf/11584720/e4d14688c807/41598_2024_80675_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14cf/11584720/c83ab06fcafc/41598_2024_80675_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14cf/11584720/081a2a3ac857/41598_2024_80675_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14cf/11584720/54c23dbc4f53/41598_2024_80675_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14cf/11584720/cb8901abc3f6/41598_2024_80675_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14cf/11584720/6726511d06ae/41598_2024_80675_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14cf/11584720/ba8f6c738c4f/41598_2024_80675_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14cf/11584720/646593319ae5/41598_2024_80675_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14cf/11584720/a5753e8a8dff/41598_2024_80675_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14cf/11584720/93bc16506c8a/41598_2024_80675_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14cf/11584720/de83c32e21f3/41598_2024_80675_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14cf/11584720/f9b3b6483bb4/41598_2024_80675_Fig12_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14cf/11584720/ddbf3d76e9bd/41598_2024_80675_Fig13_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14cf/11584720/e4d14688c807/41598_2024_80675_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14cf/11584720/c83ab06fcafc/41598_2024_80675_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14cf/11584720/081a2a3ac857/41598_2024_80675_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14cf/11584720/54c23dbc4f53/41598_2024_80675_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14cf/11584720/cb8901abc3f6/41598_2024_80675_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14cf/11584720/6726511d06ae/41598_2024_80675_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14cf/11584720/ba8f6c738c4f/41598_2024_80675_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14cf/11584720/646593319ae5/41598_2024_80675_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14cf/11584720/a5753e8a8dff/41598_2024_80675_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14cf/11584720/93bc16506c8a/41598_2024_80675_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14cf/11584720/de83c32e21f3/41598_2024_80675_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14cf/11584720/f9b3b6483bb4/41598_2024_80675_Fig12_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14cf/11584720/ddbf3d76e9bd/41598_2024_80675_Fig13_HTML.jpg

相似文献

1
A multimodal framework for pepper diseases and pests detection.多模态辣椒病虫害检测框架。
Sci Rep. 2024 Nov 22;14(1):28973. doi: 10.1038/s41598-024-80675-w.
2
Early real-time detection algorithm of tomato diseases and pests in the natural environment.自然环境下番茄病虫害的早期实时检测算法
Plant Methods. 2021 Apr 23;17(1):43. doi: 10.1186/s13007-021-00745-2.
3
An Improved YOLOv8-Based Method for Detecting Pests and Diseases on Cucumber Leaves in Natural Backgrounds.一种改进的基于YOLOv8的自然背景下黄瓜叶片病虫害检测方法。
Sensors (Basel). 2025 Mar 2;25(5):1551. doi: 10.3390/s25051551.
4
CTDUNet: A Multimodal CNN-Transformer Dual U-Shaped Network with Coordinate Space Attention for Pests and Diseases Segmentation in Complex Environments.CTDUNet:一种用于复杂环境中病虫害分割的具有坐标空间注意力的多模态卷积神经网络-Transformer双U型网络
Plants (Basel). 2024 Aug 15;13(16):2274. doi: 10.3390/plants13162274.
5
Detecting tea tree pests in complex backgrounds using a hybrid architecture guided by transformers and multi-scale attention mechanism.基于 Transformer 和多尺度注意力机制的混合架构检测复杂背景下的茶树害虫。
J Sci Food Agric. 2024 Apr;104(6):3570-3584. doi: 10.1002/jsfa.13241. Epub 2024 Jan 20.
6
MMAgentRec, a personalized multi-modal recommendation agent with large language model.MMAgentRec,一个带有大语言模型的个性化多模态推荐代理。
Sci Rep. 2025 Apr 8;15(1):12062. doi: 10.1038/s41598-025-96458-w.
7
A High-Precision Identification Method for Maize Leaf Diseases and Pests Based on LFMNet under Complex Backgrounds.一种基于LFMNet的复杂背景下玉米叶病虫害高精度识别方法
Plants (Basel). 2024 Jul 3;13(13):1827. doi: 10.3390/plants13131827.
8
LangLoc: Language-Driven Localization via Formatted Spatial Description Generation.LangLoc:通过格式化空间描述生成实现语言驱动的本地化
IEEE Trans Image Process. 2025;34:1737-1752. doi: 10.1109/TIP.2025.3546853. Epub 2025 Mar 20.
9
A Robust Deep-Learning-Based Detector for Real-Time Tomato Plant Diseases and Pests Recognition.基于深度学习的实时番茄病虫害识别稳健探测器。
Sensors (Basel). 2017 Sep 4;17(9):2022. doi: 10.3390/s17092022.
10
Brain tumor segmentation and detection in MRI using convolutional neural networks and VGG16.使用卷积神经网络和VGG16在磁共振成像(MRI)中进行脑肿瘤分割与检测
Cancer Biomark. 2025 Mar;42(3):18758592241311184. doi: 10.1177/18758592241311184. Epub 2025 Apr 4.

引用本文的文献

1
Audio signal analysis using a modified forward-forward algorithm with enhanced segmentation for soil pest detection.使用改进的前向算法和增强分割进行音频信号分析以检测土壤害虫。
Sci Rep. 2025 Aug 27;15(1):31542. doi: 10.1038/s41598-025-15770-7.
2
VGG-EffAttnNet: Hybrid Deep Learning Model for Automated Chili Plant Disease Classification Using VGG16 and EfficientNetB0 With Attention Mechanism.VGG-EffAttnNet:使用VGG16和带有注意力机制的EfficientNetB0进行辣椒植物病害自动分类的混合深度学习模型
Food Sci Nutr. 2025 Jul 24;13(7):e70653. doi: 10.1002/fsn3.70653. eCollection 2025 Jul.
3
Fine-grained crop pest classification based on multi-scale feature fusion and mixed attention mechanisms.

本文引用的文献

1
BAF-Net: Bidirectional attention fusion network CNN and transformers for the pepper leaf segmentation.BAF-Net:用于辣椒叶片分割的双向注意力融合网络(结合卷积神经网络和变换器)
Front Plant Sci. 2023 Mar 27;14:1123410. doi: 10.3389/fpls.2023.1123410. eCollection 2023.
2
A Comprehensive Review of Pesticide Residues in Peppers.辣椒中农药残留的综合综述
Foods. 2023 Feb 24;12(5):970. doi: 10.3390/foods12050970.
3
A Vegetable Leaf Disease Identification Model Based on Image-Text Cross-Modal Feature Fusion.基于图像-文本跨模态特征融合的蔬菜叶片病害识别模型
基于多尺度特征融合和混合注意力机制的细粒度农作物害虫分类
Front Plant Sci. 2025 Apr 3;16:1500571. doi: 10.3389/fpls.2025.1500571. eCollection 2025.
Front Plant Sci. 2022 Jun 24;13:918940. doi: 10.3389/fpls.2022.918940. eCollection 2022.