Zhao Peirui, Cai Weiwei, Zhou Wenhua, Li Na
College of Computer & Information Engineering, Central South University of Forestry and Technology, Changsha, 410004, China.
Changsha Vocational and Technical College of Commerce and Tourism, Changsha, 410004, China.
Plant Methods. 2024 Nov 4;20(1):167. doi: 10.1186/s13007-024-01287-z.
With the emergence of the new generation vision architecture Vmamba and the further demand for agricultural yield and efficiency, we propose an efficient and high-accuracy target detection network for automated pear picking tasks based on Vmamba, aiming to address the issue of low efficiency in current Transformer architectures. The proposed network, named SRSMamba, employs a Reward and Punishment Mechanism (RPM) to focus on important information while minimizing redundancy interference. It utilizes 3D Selective Scan (SS3D) to extend scanning dimensions and integrates global information across channel dimensions, thereby enhancing the model's robustness in complex agricultural environments and effectively adapting to the extraction of complex features in pear orchards and farmlands. Additionally, a Stacked Feature Pyramid Network (SFPN) is introduced to enhance semantic information during the feature fusion stage, particularly improving the detection capability for small targets. Experimental results show that SRSMamba has a low parameter count of 21.1 M, GFLOPs of 50.4, mAP of 72.0%, mAP50 reaching 94.8%, mAP75 at 68.1%, and FPS at 26.9. Compared with other state-of-the-art (SOTA) object detection methods, it achieves a good trade-off between model efficiency and detection accuracy.
随着新一代视觉架构Vmamba的出现以及对农业产量和效率的进一步需求,我们提出了一种基于Vmamba的用于自动梨采摘任务的高效高精度目标检测网络,旨在解决当前Transformer架构效率低下的问题。所提出的网络名为SRSMamba,采用奖惩机制(RPM)来聚焦重要信息,同时最小化冗余干扰。它利用3D选择性扫描(SS3D)来扩展扫描维度,并跨通道维度整合全局信息,从而增强模型在复杂农业环境中的鲁棒性,并有效适应梨园和农田中复杂特征的提取。此外,引入了堆叠特征金字塔网络(SFPN)以在特征融合阶段增强语义信息,特别是提高对小目标的检测能力。实验结果表明,SRSMamba的参数数量低至2110万个,浮点运算次数为50.4,平均精度均值(mAP)为72.0%,mAP50达到94.8%,mAP75为68.1%,每秒帧数(FPS)为26.9。与其他最新的(SOTA)目标检测方法相比,它在模型效率和检测精度之间实现了良好的权衡。