Gao Xinyue, He Lili, Liu Yinchuan, Wu Jiaxin, Cao Yuying, Dong Shoutian, Jia Yinjiang
College of Electrical Engineering and Information, Northeast Agricultural University, Harbin 150030, China.
Key Laboratory of Northeast Smart Agricultural Technology, Ministry of Agriculture and Rural Affairs, Harbin 150030, China.
Sensors (Basel). 2025 Aug 10;25(16):4954. doi: 10.3390/s25164954.
Early and accurate identification of maize diseases is crucial for ensuring sustainable agricultural development. However, existing maize disease identification models face challenges including high inter-class similarity, intra-class variability, and limited capability in identifying early-stage symptoms. To address these limitations, we proposed DSTANet (decomposed spatial token aggregation network), a lightweight and high-performance model for maize leaf disease identification. In this study, we constructed a comprehensive maize leaf image dataset comprising six common disease types and healthy samples, with early and late stages of northern leaf blight and eyespot specifically differentiated. DSTANet employed MobileViT as the backbone architecture, combining the advantages of CNNs for local feature extraction with transformers for global feature modeling. To enhance lesion localization and mitigate interference from complex field backgrounds, DSFM (decomposed spatial fusion module) was introduced. Additionally, the MSTA (multi-scale token aggregator) was designed to leverage hidden-layer feature channels more effectively, improving information flow and preventing gradient vanishing. Experimental results showed that DSTANet achieved an accuracy of 96.11%, precision of 96.17%, recall of 96.11%, and F1-score of 96.14%. With only 1.9M parameters, 0.6 GFLOPs (floating point operations), and an inference speed of 170 images per second, the model meets real-time deployment requirements on edge devices. This study provided a novel and practical approach for fine-grained and early-stage maize disease identification, offering technical support for smart agriculture and precision crop management.
早期准确识别玉米病害对于确保农业可持续发展至关重要。然而,现有的玉米病害识别模型面临诸多挑战,包括类间相似度高、类内变异性大以及识别早期症状的能力有限。为解决这些局限性,我们提出了DSTANet(分解空间令牌聚合网络),这是一种用于玉米叶部病害识别的轻量级高性能模型。在本研究中,我们构建了一个全面的玉米叶图像数据集,包含六种常见病害类型和健康样本,特别区分了大斑病和眼斑病的早期和晚期阶段。DSTANet采用MobileViT作为骨干架构,结合了卷积神经网络(CNN)用于局部特征提取的优势和Transformer用于全局特征建模的优势。为增强病斑定位并减轻复杂田间背景的干扰,引入了DSFM(分解空间融合模块)。此外,设计了MSTA(多尺度令牌聚合器)以更有效地利用隐藏层特征通道,改善信息流并防止梯度消失。实验结果表明,DSTANet的准确率为96.11%,精确率为96.17%,召回率为96.11%,F1分数为96.14%。该模型仅具有190万个参数、0.6 GFLOP(浮点运算),推理速度为每秒170张图像,满足在边缘设备上的实时部署要求。本研究为细粒度和早期玉米病害识别提供了一种新颖实用的方法,为智能农业和精准作物管理提供了技术支持。