Wu Ruizhao, He Feng, Rong Ziyang, Liang Zhixue, Xu Wenxing, Ni Fuchuan, Dong Wenyong
College of Informatics, Huazhong Agricultural University, Wuhan, China.
Engineering Research Center of Intelligent Technology for Agriculture, Ministry of Education, College of Informatics, Huazhong Agricultural University, Wuhan, China.
Front Plant Sci. 2024 Aug 13;15:1411689. doi: 10.3389/fpls.2024.1411689. eCollection 2024.
Detecting and controlling tea pests promptly are crucial for safeguarding tea production quality. Due to the insufficient feature extraction ability of traditional CNN-based methods, they face challenges such as inaccuracy and inefficiency of detecting pests in dense and mimicry scenarios. This study proposes an end-to-end tea pest detection and segmentation framework, TeaPest-Transfiner (TP-Transfiner), based on Mask Transfiner to address the challenge of detecting and segmenting pests in mimicry and dense scenarios. In order to improve the feature extraction inability and weak accuracy of traditional convolution modules, this study proposes three strategies. Firstly, a deformable attention block is integrated into the model, which consists of deformable convolution and self-attention using the key content only term. Secondly, the FPN architecture in the backbone network is improved with a more effective feature-aligned pyramid network (FaPN). Lastly, focal loss is employed to balance positive and negative samples during the training period, and parameters are adapted to the dataset distribution. Furthermore, to address the lack of tea pest images, a dataset called TeaPestDataset is constructed, which contains 1,752 images and 29 species of tea pests. Experimental results on the TeaPestDataset show that the proposed TP-Transfiner model achieves state-of-the-art performance compared with other models, attaining a detection precision (AP50) of 87.211% and segmentation performance of 87.381%. Notably, the model shows a significant improvement in segmentation average precision (mAP) by 9.4% and a reduction in model size by 30% compared to the state-of-the-art CNN-based model Mask R-CNN. Simultaneously, TP-Transfiner's lightweight module fusion maintains fast inference speeds and a compact model size, demonstrating practical potential for pest control in tea gardens, especially in dense and mimicry scenarios.
及时检测和控制茶树害虫对于保障茶叶生产质量至关重要。由于传统基于卷积神经网络(CNN)的方法特征提取能力不足,它们在密集和拟态场景中检测害虫时面临诸如不准确和效率低下等挑战。本研究基于Mask Transfiner提出了一种端到端的茶树害虫检测与分割框架,即TeaPest-Transfiner(TP-Transfiner),以应对在拟态和密集场景中检测与分割害虫的挑战。为了改善传统卷积模块特征提取能力不足和准确性较弱的问题,本研究提出了三种策略。首先,将可变形注意力模块集成到模型中,该模块由可变形卷积和仅使用关键内容项的自注意力组成。其次,使用更有效的特征对齐金字塔网络(FaPN)改进骨干网络中的特征金字塔网络(FPN)架构。最后,在训练期间采用焦点损失来平衡正负样本,并使参数适应数据集分布。此外,为了解决茶树害虫图像不足的问题,构建了一个名为TeaPestDataset的数据集,其中包含1752张图像和29种茶树害虫。在TeaPestDataset上的实验结果表明,与其他模型相比,所提出的TP-Transfiner模型实现了最优性能,检测精度(AP50)达到87.211%,分割性能达到87.381%。值得注意的是,与基于CNN的最优模型Mask R-CNN相比,该模型的分割平均精度(mAP)显著提高了9.4%,模型大小减少了30%。同时,TP-Transfiner的轻量级模块融合保持了快速推理速度和紧凑的模型大小,展示了在茶园害虫防治中的实际应用潜力,特别是在密集和拟态场景中。