Institute of Automation, Qilu University of Technology (Shandong Academy of Sciences), Jinan, 250014, China.
Auckland University of Technology, Auckland, New Zealand.
Sci Rep. 2024 Oct 30;14(1):26086. doi: 10.1038/s41598-024-76662-w.
The accurate recognition of apples in complex orchard environments is a fundamental aspect of the operation of automated picking equipment. This paper aims to investigate the influence of dense targets, occlusion, and the natural environment in practical application scenarios. To this end, it constructs a fruit dataset containing different scenarios and proposes a real-time lightweight detection network, ELD(Efficient Lightweight object Detector). The EGSS(Efficient Ghost-shuffle Slim module) module and MCAttention(Mix channel Attention) are proposed as innovative solutions to the problems of feature extraction and classification. The attention mechanism is employed to construct a novel feature extraction network, which effectively utilizes the low-latitude feature information, significantly enhances the fine-grained feature information and gradient flow of the model, and improves the model's anti-interference ability. Eliminate redundant channels with SlimPAN to further compress the network and optimise functionality. The network as a whole employs the Shape-IOU loss function, which considers the influence of the bounding box itself, thereby enhancing the robustness of the model. Finally, the target detection accuracy is enhanced through the transfer of knowledge from the teacher's network through knowledge distillation, while ensuring that the overall network is sufficiently lightweight. The experimental results demonstrate that the ELD network, designed for fruit detection, achieves an accuracy of 87.4%. It has a relatively low number of parameters ( ), a GLOPs of only 1.7, and a high FPS of 156. This network can achieve high accuracy while consuming fewer computational resources and performing better than other networks.
在复杂的果园环境中准确识别苹果是自动化采摘设备运行的基础。本文旨在研究密集目标、遮挡和实际应用场景中的自然环境对其的影响。为此,构建了一个包含不同场景的水果数据集,并提出了一个实时轻量级检测网络 ELD(Efficient Lightweight object Detector)。EGSS(Efficient Ghost-shuffle Slim module)模块和 MCAttention(Mix channel Attention)被提出作为解决特征提取和分类问题的创新方案。注意力机制被用于构建一个新的特征提取网络,有效地利用了低纬特征信息,显著增强了模型的细粒度特征信息和梯度流,提高了模型的抗干扰能力。使用 SlimPAN 消除冗余通道,进一步压缩网络并优化功能。网络整体采用 Shape-IOU 损失函数,考虑了边界框自身的影响,从而增强了模型的鲁棒性。最后,通过知识蒸馏从教师网络转移知识,提高目标检测精度,同时确保整个网络足够轻量级。实验结果表明,设计用于水果检测的 ELD 网络的准确率为 87.4%。它的参数量相对较少( ),GLOPs 仅为 1.7,FPS 高达 156。该网络在消耗较少计算资源的情况下能够实现高精度,性能优于其他网络。