Suppr超能文献

利用曼巴架构革新自动化梨采摘技术。

Revolutionizing automated pear picking using Mamba architecture.

作者信息

Zhao Peirui, Cai Weiwei, Zhou Wenhua, Li Na

机构信息

College of Computer & Information Engineering, Central South University of Forestry and Technology, Changsha, 410004, China.

Changsha Vocational and Technical College of Commerce and Tourism, Changsha, 410004, China.

出版信息

Plant Methods. 2024 Nov 4;20(1):167. doi: 10.1186/s13007-024-01287-z.

Abstract

With the emergence of the new generation vision architecture Vmamba and the further demand for agricultural yield and efficiency, we propose an efficient and high-accuracy target detection network for automated pear picking tasks based on Vmamba, aiming to address the issue of low efficiency in current Transformer architectures. The proposed network, named SRSMamba, employs a Reward and Punishment Mechanism (RPM) to focus on important information while minimizing redundancy interference. It utilizes 3D Selective Scan (SS3D) to extend scanning dimensions and integrates global information across channel dimensions, thereby enhancing the model's robustness in complex agricultural environments and effectively adapting to the extraction of complex features in pear orchards and farmlands. Additionally, a Stacked Feature Pyramid Network (SFPN) is introduced to enhance semantic information during the feature fusion stage, particularly improving the detection capability for small targets. Experimental results show that SRSMamba has a low parameter count of 21.1 M, GFLOPs of 50.4, mAP of 72.0%, mAP50 reaching 94.8%, mAP75 at 68.1%, and FPS at 26.9. Compared with other state-of-the-art (SOTA) object detection methods, it achieves a good trade-off between model efficiency and detection accuracy.

摘要

随着新一代视觉架构Vmamba的出现以及对农业产量和效率的进一步需求,我们提出了一种基于Vmamba的用于自动梨采摘任务的高效高精度目标检测网络,旨在解决当前Transformer架构效率低下的问题。所提出的网络名为SRSMamba,采用奖惩机制(RPM)来聚焦重要信息,同时最小化冗余干扰。它利用3D选择性扫描(SS3D)来扩展扫描维度,并跨通道维度整合全局信息,从而增强模型在复杂农业环境中的鲁棒性,并有效适应梨园和农田中复杂特征的提取。此外,引入了堆叠特征金字塔网络(SFPN)以在特征融合阶段增强语义信息,特别是提高对小目标的检测能力。实验结果表明,SRSMamba的参数数量低至2110万个,浮点运算次数为50.4,平均精度均值(mAP)为72.0%,mAP50达到94.8%,mAP75为68.1%,每秒帧数(FPS)为26.9。与其他最新的(SOTA)目标检测方法相比,它在模型效率和检测精度之间实现了良好的权衡。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1881/11536875/92d8d9a77beb/13007_2024_1287_Fig1_HTML.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验