Qian Yiheng, Xiao Zhiyong, Deng Zhaohong
School of Artificial Intelligence and Computer Science, Jiangnan University, Wuxi, China.
Front Plant Sci. 2025 Apr 3;16:1500571. doi: 10.3389/fpls.2025.1500571. eCollection 2025.
Pests are a major cause of crop loss globally, and accurate pest identification is crucial for effective prevention and control strategies. This paper proposes a novel deep-learning architecture for crop pest classification, addressing the limitations of existing methods that struggle with distinguishing the fine details of pests and background interference. The proposed model is designed to balance fine-grained feature extraction with deep semantic understanding, utilizing a parallel structure composed of two main components: the Feature Fusion Module (FFM) and the Mixed Attention Module (MAM). FFM focuses on extracting key fine-grained features and fusing them across multiple scales, while MAM leverages an attention mechanism to model long-range dependencies within the channel domain, further enhancing feature representation. Additionally, a Transformer block is integrated to overcome the limitations of traditional convolutional approaches in capturing global contextual information. The proposed architecture is evaluated on three benchmark datasets-IP102, D0, and Li-demonstrating its superior performance over state-of-the-art methods. The model achieves accuracies of 75.74% on IP102, 99.82% on D0, and 98.77% on Li, highlighting its robustness and effectiveness in complex crop pest recognition tasks. These results indicate that the proposed method excels in multi-scale feature fusion and long-range dependency modeling, offering a new competitive approach to pest classification in agricultural settings.
害虫是全球作物损失的主要原因,准确识别害虫对于有效的预防和控制策略至关重要。本文提出了一种用于作物害虫分类的新型深度学习架构,解决了现有方法在区分害虫细微细节和背景干扰方面的局限性。所提出的模型旨在平衡细粒度特征提取与深度语义理解,利用由两个主要组件组成的并行结构:特征融合模块(FFM)和混合注意力模块(MAM)。FFM专注于提取关键的细粒度特征并跨多个尺度进行融合,而MAM利用注意力机制对通道域内的长距离依赖关系进行建模,进一步增强特征表示。此外,还集成了一个Transformer模块,以克服传统卷积方法在捕获全局上下文信息方面的局限性。在所提出的架构在三个基准数据集IP102、D0和Li上进行了评估,证明了其优于现有方法的性能。该模型在IP102上的准确率达到75.74%,在D0上达到99.82%,在Li上达到98.77%,突出了其在复杂作物害虫识别任务中的鲁棒性和有效性。这些结果表明,所提出的方法在多尺度特征融合和长距离依赖建模方面表现出色,为农业环境中的害虫分类提供了一种新的具有竞争力的方法。