Suppr超能文献

TrIncNet:一种用于植物病害识别的轻量级视觉Transformer网络。

TrIncNet: a lightweight vision transformer network for identification of plant diseases.

作者信息

Gole Pushkar, Bedi Punam, Marwaha Sudeep, Haque Md Ashraful, Deb Chandan Kumar

机构信息

Department of Computer Science, University of Delhi, New Delhi, India.

Indian Council of Agricultural Research (ICAR)-Indian Agricultural Statistics Research Institute, New Delhi, India.

出版信息

Front Plant Sci. 2023 Jul 27;14:1221557. doi: 10.3389/fpls.2023.1221557. eCollection 2023.

Abstract

In the agricultural sector, identifying plant diseases at their earliest possible stage of infestation still remains a huge challenge with respect to the maximization of crop production and farmers' income. In recent years, advanced computer vision techniques like Vision Transformers (ViTs) are being successfully applied to identify plant diseases automatically. However, the MLP module in existing ViTs is computationally expensive as well as inefficient in extracting promising features from diseased images. Therefore, this study proposes a comparatively lightweight and improved vision transformer network, also known as "TrIncNet" for plant disease identification. In the proposed network, we introduced a modified encoder architecture a.k.a. Trans-Inception block in which the MLP block of existing ViT was replaced by a custom inception block. Additionally, each Trans-Inception block is surrounded by a skip connection, making it much more resistant to the vanishing gradient problem. The applicability of the proposed network for identifying plant diseases was assessed using two plant disease image datasets viz: PlantVillage dataset and Maize disease dataset (contains in-field images of Maize diseases). The comparative performance analysis on both datasets reported that the proposed TrIncNet network outperformed the state-of-the-art CNN architectures viz: VGG-19, GoogLeNet, ResNet-50, Xception, InceptionV3, and MobileNet. Moreover, the experimental results also showed that the proposed network had achieved 5.38% and 2.87% higher testing accuracy than the existing ViT network on both datasets, respectively. Therefore, the lightweight nature and improved prediction performance make the proposed network suitable for being integrated with IoT devices to assist the stakeholders in identifying plant diseases at the field level.

摘要

在农业领域,就作物产量最大化和农民收入而言,在病虫害侵染的最早阶段识别植物病害仍然是一项巨大挑战。近年来,诸如视觉Transformer(ViT)等先进的计算机视觉技术已成功应用于自动识别植物病害。然而,现有ViT中的MLP模块计算成本高昂,且在从患病图像中提取有前景的特征方面效率低下。因此,本研究提出了一种相对轻量级且经过改进的视觉Transformer网络,即用于植物病害识别的“TrIncNet”。在所提出的网络中,我们引入了一种经过修改的编码器架构,即Trans-Inception模块,其中现有ViT的MLP模块被一个定制的Inception模块所取代。此外,每个Trans-Inception模块都被一个跳跃连接所环绕,使其对梯度消失问题具有更强的抗性。使用两个植物病害图像数据集,即植物村数据集和玉米病害数据集(包含玉米病害的田间图像),评估了所提出网络在识别植物病害方面的适用性。对这两个数据集的比较性能分析表明,所提出的TrIncNet网络优于当前最先进的CNN架构,即VGG-19、GoogLeNet、ResNet-50、Xception、InceptionV3和MobileNet。此外,实验结果还表明,在所提出的网络在两个数据集上的测试准确率分别比现有ViT网络高5.38%和2.87%。因此,轻量级特性和改进的预测性能使所提出的网络适合与物联网设备集成,以协助利益相关者在田间层面识别植物病害。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6eea/10414585/88b31453e613/fpls-14-1221557-g001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验