Qian Xiufeng, Zhang Chengqi, Chen Li, Li Ke
School of Information and Computer, Anhui Agricultural University, Hefei, China.
Anhui Provincial Engineering Laboratory for Beidou Precision Agriculture Information, Anhui Agricultural University, Hefei, China.
Front Plant Sci. 2022 Apr 28;13:864486. doi: 10.3389/fpls.2022.864486. eCollection 2022.
Maize leaf diseases significantly reduce maize yield; therefore, monitoring and identifying the diseases during the growing season are crucial. Some of the current studies are based on images with simple backgrounds, and the realistic field settings are full of background noise, making this task challenging. We collected low-cost red, green, and blue (RGB) images from our experimental fields and public dataset, and they contain a total of four categories, namely, southern corn leaf blight (SCLB), gray leaf spot (GLS), southern corn rust (SR), and healthy (H). This article proposes a model different from convolutional neural networks (CNNs) based on transformer and self-attention. It represents visual information of local regions of images by tokens, calculates the correlation (called attention) of information between local regions with an attention mechanism, and finally integrates global information to make the classification. The results show that our model achieves the best performance compared to five mainstream CNNs at a meager computational cost, and the attention mechanism plays an extremely important role. The disease lesions information was effectively emphasized, and the background noise was suppressed. The proposed model is more suitable for fine-grained maize leaf disease identification in a complex background, and we demonstrated this idea from three perspectives, namely, theoretical, experimental, and visualization.
玉米叶部病害会显著降低玉米产量;因此,在生长季节对病害进行监测和识别至关重要。当前的一些研究基于背景简单的图像,而实际的田间环境充满背景噪声,使得这项任务具有挑战性。我们从实验田和公共数据集中收集了低成本的红、绿、蓝(RGB)图像,它们总共包含四类,即玉米小斑病(SCLB)、灰斑病(GLS)、玉米南方锈病(SR)和健康(H)。本文提出了一种基于Transformer和自注意力机制的不同于卷积神经网络(CNN)的模型。它通过令牌表示图像局部区域的视觉信息,利用注意力机制计算局部区域之间信息的相关性(称为注意力),最后整合全局信息进行分类。结果表明,我们的模型以极低的计算成本与五种主流CNN相比取得了最佳性能,且注意力机制发挥了极其重要的作用。病害病斑信息得到有效强调,背景噪声得到抑制。所提出的模型更适合在复杂背景下进行细粒度的玉米叶部病害识别,我们从理论、实验和可视化三个角度论证了这一观点。