College of Information and Computer Science, Anhui Agricultural University, Hefei 230036, China.
Key Laboratory of Agricultural Sensors, Ministry of Agriculture and Rural Affairs, Hefei 230036, China.
Sensors (Basel). 2022 Dec 20;23(1):30. doi: 10.3390/s23010030.
Precise pear detection and recognition is an essential step toward modernizing orchard management. However, due to the ubiquitous occlusion in orchards and various locations of image acquisition, the pears in the acquired images may be quite small and occluded, causing high false detection and object loss rate. In this paper, a multi-scale collaborative perception network YOLOv5s-FP (Fusion and Perception) was proposed for pear detection, which coupled local and global features. Specifically, a pear dataset with a high proportion of small and occluded pears was proposed, comprising 3680 images acquired with cameras mounted on a ground tripod and a UAV platform. The cross-stage partial (CSP) module was optimized to extract global features through a transformer encoder, which was then fused with local features by an attentional feature fusion mechanism. Subsequently, a modified path aggregation network oriented to collaboration perception of multi-scale features was proposed by incorporating a transformer encoder, the optimized CSP, and new skip connections. The quantitative results of utilizing the YOLOv5s-FP for pear detection were compared with other typical object detection networks of the YOLO series, recording the highest average precision of 96.12% with less detection time and computational cost. In qualitative experiments, the proposed network achieved superior visual performance with stronger robustness to the changes in occlusion and illumination conditions, particularly providing the ability to detect pears with different sizes in highly dense, overlapping environments and non-normal illumination areas. Therefore, the proposed YOLOv5s-FP network was practicable for detecting in-field pears in a real-time and accurate way, which could be an advantageous component of the technology for monitoring pear growth status and implementing automated harvesting in unmanned orchards.
精准的梨果检测和识别是实现果园管理现代化的重要步骤。然而,由于果园中存在普遍的遮挡和图像采集位置的多样性,采集到的图像中的梨果可能很小且被遮挡,导致高误检和目标丢失率。在本文中,我们提出了一种用于梨果检测的多尺度协同感知网络 YOLOv5s-FP(融合与感知),该网络结合了局部和全局特征。具体来说,我们提出了一个具有高比例小尺寸和遮挡梨果的梨果数据集,该数据集由安装在地面三脚架和无人机平台上的相机采集的 3680 张图像组成。我们优化了交叉阶段局部(CSP)模块,通过变压器编码器提取全局特征,然后通过注意力特征融合机制与局部特征融合。随后,我们通过引入变压器编码器、优化的 CSP 和新的 skip 连接,提出了一种面向多尺度特征协同感知的改进路径聚合网络。我们将利用 YOLOv5s-FP 进行梨果检测的定量结果与其他典型的 YOLO 系列目标检测网络进行了比较,记录了 96.12%的最高平均精度,同时检测时间和计算成本更低。在定性实验中,所提出的网络表现出卓越的视觉性能,对遮挡和光照条件的变化具有更强的鲁棒性,特别是能够在高度密集、重叠的环境和非正态光照区域中检测到不同大小的梨果。因此,所提出的 YOLOv5s-FP 网络可实时、准确地检测田间梨果,这可以成为监测梨果生长状态和在无人果园中实现自动化采摘的技术的一个有利组成部分。