• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

CTDUNet:一种用于复杂环境中病虫害分割的具有坐标空间注意力的多模态卷积神经网络-Transformer双U型网络

CTDUNet: A Multimodal CNN-Transformer Dual U-Shaped Network with Coordinate Space Attention for Pests and Diseases Segmentation in Complex Environments.

作者信息

Guo Ruitian, Zhang Ruopeng, Zhou Hao, Xie Tunjun, Peng Yuting, Chen Xili, Yu Guo, Wan Fangying, Li Lin, Zhang Yongzhong, Liu Ruifeng

机构信息

School of Electronic Information and Physics, Central South University of Forestry and Technology, Changsha 410004, China.

School of Business, Central South University of Forestry and Technology, Changsha 410004, China.

出版信息

Plants (Basel). 2024 Aug 15;13(16):2274. doi: 10.3390/plants13162274.

DOI:10.3390/plants13162274
PMID:39204710
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11359422/
Abstract

is a crop of high economic value, yet it is particularly susceptible to various diseases and pests that significantly reduce its yield and quality. Consequently, the precise segmentation and classification of diseased Camellia leaves are vital for managing pests and diseases effectively. Deep learning exhibits significant advantages in the segmentation of plant diseases and pests, particularly in complex image processing and automated feature extraction. However, when employing single-modal models to segment diseases, three critical challenges arise: (A) lesions may closely resemble the colors of the complex background; (B) small sections of diseased leaves overlap; (C) the presence of multiple diseases on a single leaf. These factors considerably hinder segmentation accuracy. A novel multimodal model, CNN-Transformer Dual U-shaped Network (CTDUNet), based on a CNN-Transformer architecture, has been proposed to integrate image and text information. This model first utilizes text data to address the shortcomings of single-modal image features, enhancing its ability to distinguish lesions from environmental characteristics, even under conditions where they closely resemble one another. Additionally, we introduce Coordinate Space Attention (CSA), which focuses on the positional relationships between targets, thereby improving the segmentation of overlapping leaf edges. Furthermore, cross-attention (CA) is employed to align image and text features effectively, preserving local information and enhancing the perception and differentiation of various diseases. The CTDUNet model was evaluated on a self-made multimodal dataset compared against several models, including DeeplabV3+, UNet, PSPNet, Segformer, HrNet, and Language meets Vision Transformer (LViT). The experimental results demonstrate that CTDUNet achieved an mean Intersection over Union (mIoU) of 86.14%, surpassing both multimodal models and the best single-modal model by 3.91% and 5.84%, respectively. Additionally, CTDUNet exhibits high balance in the multi-class segmentation of diseases and pests. These results indicate the successful application of fused image and text multimodal information in the segmentation of Camellia disease, achieving outstanding performance.

摘要

茶树是一种具有高经济价值的作物,但它特别容易受到各种病虫害的影响,这些病虫害会显著降低其产量和品质。因此,对患病茶树叶片进行精确的分割和分类对于有效防治病虫害至关重要。深度学习在植物病虫害分割方面具有显著优势,尤其是在复杂图像处理和自动特征提取方面。然而,在使用单模态模型进行病害分割时,会出现三个关键挑战:(A)病斑颜色可能与复杂背景颜色极为相似;(B)患病叶片的小部分相互重叠;(C)单片叶子上存在多种病害。这些因素极大地阻碍了分割精度。基于卷积神经网络-Transformer架构,提出了一种新颖的多模态模型,即卷积神经网络-Transformer双U型网络(CTDUNet),以整合图像和文本信息。该模型首先利用文本数据来弥补单模态图像特征的不足,增强其在病斑与环境特征极为相似的情况下区分病斑与环境特征之间的能力。此外,我们引入了坐标空间注意力(CSA),它专注于目标之间的位置关系,从而改善重叠叶片边缘的分割。此外,采用交叉注意力(CA)来有效对齐图像和文本特征,保留局部信息并增强对各种病害的感知和区分能力。CTDUNet模型在一个自制的多模态数据集上与包括DeeplabV3 +、UNet、PSPNet、Segformer、HrNet和语言与视觉Transformer(LViT)在内的多个模型进行了比较评估。实验结果表明,CTDUNet的平均交并比(mIoU)达到了86.14%,分别比多模态模型和最佳单模态模型高出3.91%和5.84%。此外,CTDUNet在病虫害的多类分割中表现出高度的平衡性。这些结果表明融合图像和文本多模态信息在茶树病害分割中成功应用,取得了优异的性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/718b/11359422/de768b09a29f/plants-13-02274-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/718b/11359422/ea88181a38bb/plants-13-02274-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/718b/11359422/a097e831d929/plants-13-02274-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/718b/11359422/2281e2154106/plants-13-02274-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/718b/11359422/2e5de58d6fdd/plants-13-02274-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/718b/11359422/cc5f55f3be7d/plants-13-02274-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/718b/11359422/dcfc6a82719c/plants-13-02274-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/718b/11359422/977c5b12856b/plants-13-02274-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/718b/11359422/de768b09a29f/plants-13-02274-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/718b/11359422/ea88181a38bb/plants-13-02274-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/718b/11359422/a097e831d929/plants-13-02274-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/718b/11359422/2281e2154106/plants-13-02274-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/718b/11359422/2e5de58d6fdd/plants-13-02274-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/718b/11359422/cc5f55f3be7d/plants-13-02274-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/718b/11359422/dcfc6a82719c/plants-13-02274-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/718b/11359422/977c5b12856b/plants-13-02274-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/718b/11359422/de768b09a29f/plants-13-02274-g008.jpg

相似文献

1
CTDUNet: A Multimodal CNN-Transformer Dual U-Shaped Network with Coordinate Space Attention for Pests and Diseases Segmentation in Complex Environments.CTDUNet:一种用于复杂环境中病虫害分割的具有坐标空间注意力的多模态卷积神经网络-Transformer双U型网络
Plants (Basel). 2024 Aug 15;13(16):2274. doi: 10.3390/plants13162274.
2
ETUNet:Exploring efficient transformer enhanced UNet for 3D brain tumor segmentation.ETUNet:探索高效的基于Transformer 的增强型 UNet 进行 3D 脑肿瘤分割。
Comput Biol Med. 2024 Mar;171:108005. doi: 10.1016/j.compbiomed.2024.108005. Epub 2024 Jan 23.
3
ECA-TFUnet: A U-shaped CNN-Transformer network with efficient channel attention for organ segmentation in anatomical sectional images of canines.ECA-TFUnet:一种具有高效通道注意力的 U 形 CNN-Transformer 网络,用于犬类解剖切片图像中的器官分割。
Math Biosci Eng. 2023 Oct 7;20(10):18650-18669. doi: 10.3934/mbe.2023827.
4
ETU-Net: edge enhancement-guided U-Net with transformer for skin lesion segmentation.ETU-Net:基于边缘增强引导的 U-Net 与 Transformer 的皮肤病变分割。
Phys Med Biol. 2023 Dec 22;69(1). doi: 10.1088/1361-6560/ad13d2.
5
RTC_TongueNet: An improved tongue image segmentation model based on DeepLabV3.RTC_TongueNet:一种基于DeepLabV3的改进型舌图像分割模型。
Digit Health. 2024 Mar 28;10:20552076241242773. doi: 10.1177/20552076241242773. eCollection 2024 Jan-Dec.
6
STC-UNet: renal tumor segmentation based on enhanced feature extraction at different network levels.STC-UNet:基于不同网络层次增强特征提取的肾肿瘤分割。
BMC Med Imaging. 2024 Jul 19;24(1):179. doi: 10.1186/s12880-024-01359-5.
7
Dual encoder network with transformer-CNN for multi-organ segmentation.基于 Transformer-CNN 的双编码器网络的多器官分割。
Med Biol Eng Comput. 2023 Mar;61(3):661-671. doi: 10.1007/s11517-022-02723-9. Epub 2022 Dec 29.
8
CPFTransformer: transformer fusion context pyramid medical image segmentation network.CPFTransformer:变换器融合上下文金字塔医学图像分割网络。
Front Neurosci. 2023 Dec 7;17:1288366. doi: 10.3389/fnins.2023.1288366. eCollection 2023.
9
LViT: Language Meets Vision Transformer in Medical Image Segmentation.LViT:医学图像分割中语言与视觉Transformer的融合
IEEE Trans Med Imaging. 2024 Jan;43(1):96-107. doi: 10.1109/TMI.2023.3291719. Epub 2024 Jan 2.
10
SwinCross: Cross-modal Swin transformer for head-and-neck tumor segmentation in PET/CT images.SwinCross:用于 PET/CT 图像中头颈部肿瘤分割的跨模态 Swin 变换器。
Med Phys. 2024 Mar;51(3):2096-2107. doi: 10.1002/mp.16703. Epub 2023 Sep 30.

引用本文的文献

1
Sparse-MoE-SAM: A Lightweight Framework Integrating MoE and SAM with a Sparse Attention Mechanism for Plant Disease Segmentation in Resource-Constrained Environments.稀疏混合专家-分割注意力模型:一种在资源受限环境下用于植物病害分割的、集成了混合专家和分割注意力模型并采用稀疏注意力机制的轻量级框架。
Plants (Basel). 2025 Aug 24;14(17):2634. doi: 10.3390/plants14172634.
2
Multiclass semantic segmentation for prime disease detection with severity level identification in Citrus plant leaves.用于柑橘植物叶片主要病害检测及严重程度识别的多类语义分割
Sci Rep. 2025 Jul 1;15(1):21208. doi: 10.1038/s41598-025-04758-y.
3
Artificial Intelligence-Assisted Breeding for Plant Disease Resistance.

本文引用的文献

1
NFMPAtt-Unet: Neighborhood Fuzzy C-means Multi-scale Pyramid Hybrid Attention Unet for medical image segmentation.NFMPAtt-Unet:用于医学图像分割的邻域模糊C均值多尺度金字塔混合注意力Unet
Neural Netw. 2024 Oct;178:106489. doi: 10.1016/j.neunet.2024.106489. Epub 2024 Jun 22.
2
Efficient Preparation of Biodiesel Using Sulfonated Shell Biochar as a Catalyst.利用磺化贝壳生物炭作为催化剂高效制备生物柴油。
Molecules. 2024 Jun 9;29(12):2752. doi: 10.3390/molecules29122752.
3
High-Accuracy Tomato Leaf Disease Image-Text Retrieval Method Utilizing LAFANet.
人工智能辅助的植物抗病育种
Int J Mol Sci. 2025 Jun 1;26(11):5324. doi: 10.3390/ijms26115324.
基于LAFANet的高精度番茄叶部病害图像-文本检索方法
Plants (Basel). 2024 Apr 23;13(9):1176. doi: 10.3390/plants13091176.
4
A Precise Framework for Rice Leaf Disease Image-Text Retrieval Using FHTW-Net.一种基于FHTW-Net的水稻叶部病害图像-文本检索精确框架。
Plant Phenomics. 2024 Apr 25;6:0168. doi: 10.34133/plantphenomics.0168. eCollection 2024.
5
Transparent medical image AI via an image-text foundation model grounded in medical literature.基于医学文献的图文基础模型实现透明的医学影像 AI
Nat Med. 2024 Apr;30(4):1154-1165. doi: 10.1038/s41591-024-02887-x. Epub 2024 Apr 16.
6
Enhanced Heterogeneous Graph Attention Network with a Novel Multilabel Focal Loss for Document-Level Relation Extraction.具有新型多标签焦点损失的增强异构图注意力网络用于文档级关系抽取
Entropy (Basel). 2024 Feb 28;26(3):210. doi: 10.3390/e26030210.
7
Adaptive t-vMF dice loss: An effective expansion of dice loss for medical image segmentation.自适应 t-vMF Dice 损失:Dice 损失在医学图像分割中的有效扩展。
Comput Biol Med. 2024 Jan;168:107695. doi: 10.1016/j.compbiomed.2023.107695. Epub 2023 Nov 27.
8
Classification of Diseases in Complex Environments by Attention and Multi-Dimensional Feature Fusion Neural Network.基于注意力和多维度特征融合神经网络的复杂环境下疾病分类
Plants (Basel). 2023 Jul 20;12(14):2701. doi: 10.3390/plants12142701.
9
VGG16 Feature Extractor with Extreme Gradient Boost Classifier for Pancreas Cancer Prediction.用于胰腺癌预测的具有极端梯度提升分类器的VGG16特征提取器
J Imaging. 2023 Jul 7;9(7):138. doi: 10.3390/jimaging9070138.
10
DPAM-PSPNet: ultrasonic image segmentation of thyroid nodule based on dual-path attention mechanism.DPAM-PSPNet:基于双路径注意力机制的甲状腺结节超声图像分割
Phys Med Biol. 2023 Jul 31;68(16). doi: 10.1088/1361-6560/ace6f1.