Suppr超能文献

通过一种新型混合移位视觉Transformer方法增强甘蔗叶病分类:技术见解与方法进展

Enhancing sugarcane leaf disease classification through a novel hybrid shifted-vision transformer approach: technical insights and methodological advancements.

作者信息

Kuppusamy Abirami, Sundaresan Srinivasan Kandasamy, Cingaram Ravichandran

机构信息

Department of Electronics and Communication Engineering, Easwari Engineering College, Chennai, Tamil Nadu, India.

Department of Electronics and Communication Engineering, Mohamed Sathak AJ College of Engineering, Chennai, Tamil Nadu, India.

出版信息

Environ Monit Assess. 2024 Dec 7;197(1):37. doi: 10.1007/s10661-024-13468-3.

Abstract

In the agricultural sector, sugarcane farming is one of the most organized forms of cultivation. India is the second-largest producer of sugarcane in the world. However, sugarcane crops are highly affected by diseases, which significantly affect crop production. Despite development in deep learning techniques, disease detection remains a challenging and time-consuming task. This paper presents a novel Hybrid Shifted-Vision Transformer approach for the automated classification of sugarcane leaf diseases. The model integrates the Vision Transformer architecture with Hybrid Shifted Windows to effectively capture both local and global features, which is crucial for accurately identifying disease patterns at different spatial scales. To improve feature representation and model performance, self-supervised learning is employed using data augmentation techniques like random rotation, flipping, and occlusion, combined with a jigsaw puzzle task that helps the model learn spatial relationships in images. The method addresses class imbalances in the dataset through stratified sampling, ensuring balanced training and testing sets. The approach is fine-tuned on sugarcane leaf disease datasets using categorical cross-entropy loss, minimizing dissimilarity between predicted probabilities and real labels. Experimental results demonstrate that the Hybrid Shifted-Vision Transformer outperforms traditional models, achieving higher accuracy in disease detection of 98.5%, making it crucial for reliable disease diagnosis and decision-making in agriculture. This architecture enables efficient, large-scale automated sugarcane disease monitoring.

摘要

在农业领域,甘蔗种植是最具组织化的种植形式之一。印度是世界上第二大甘蔗生产国。然而,甘蔗作物极易受到病害影响,这严重影响了作物产量。尽管深度学习技术有所发展,但病害检测仍然是一项具有挑战性且耗时的任务。本文提出了一种新颖的混合移位视觉Transformer方法,用于甘蔗叶部病害的自动分类。该模型将视觉Transformer架构与混合移位窗口相结合,以有效捕捉局部和全局特征,这对于在不同空间尺度上准确识别病害模式至关重要。为了提高特征表示和模型性能,使用随机旋转、翻转和遮挡等数据增强技术以及有助于模型学习图像中空间关系的拼图任务进行自监督学习。该方法通过分层采样解决数据集中的类别不平衡问题,确保训练集和测试集的平衡。使用分类交叉熵损失在甘蔗叶部病害数据集上对该方法进行微调,以最小化预测概率与真实标签之间的差异。实验结果表明,混合移位视觉Transformer优于传统模型,在病害检测中达到了98.5%的更高准确率,这对于农业中可靠的病害诊断和决策至关重要。这种架构能够实现高效、大规模的甘蔗病害自动监测。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验