Wang Lele, Zhao Yingjie, Xiong Zhangjun, Wang Shizhou, Li Yuanhong, Lan Yubin
College of Electronic Engineering, College of Artificial Intelligence, South China Agricultural University, Guangzhou, China.
National Center for International Collaboration Research on Precision Agricultural Aviation Pesticides Spraying Technology, Guangzhou, China.
Front Plant Sci. 2022 Aug 9;13:965425. doi: 10.3389/fpls.2022.965425. eCollection 2022.
The fast and precise detection of dense litchi fruits and the determination of their maturity is of great practical significance for yield estimation in litchi orchards and robot harvesting. Factors such as complex growth environment, dense distribution, and random occlusion by leaves, branches, and other litchi fruits easily cause the predicted output based on computer vision deviate from the actual value. This study proposed a fast and precise litchi fruit detection method and application software based on an improved You Only Look Once version 5 (YOLOv5) model, which can be used for the detection and yield estimation of litchi in orchards. First, a dataset of litchi with different maturity levels was established. Second, the YOLOv5s model was chosen as a base version of the improved model. ShuffleNet v2 was used as the improved backbone network, and then the backbone network was fine-tuned to simplify the model structure. In the feature fusion stage, the CBAM module was introduced to further refine litchi's effective feature information. Considering the characteristics of the small size of dense litchi fruits, the 1,280 × 1,280 was used as the improved model input size while we optimized the network structure. To evaluate the performance of the proposed method, we performed ablation experiments and compared it with other models on the test set. The results showed that the improved model's mean average precision (mAP) presented a 3.5% improvement and 62.77% compression in model size compared with the original model. The improved model size is 5.1 MB, and the frame per second (FPS) is 78.13 frames/s at a confidence of 0.5. The model performs well in precision and robustness in different scenarios. In addition, we developed an Android application for litchi counting and yield estimation based on the improved model. It is known from the experiment that the correlation coefficient between the application test and the actual results was 0.9879. In summary, our improved method achieves high precision, lightweight, and fast detection performance at large scales. The method can provide technical means for portable yield estimation and visual recognition of litchi harvesting robots.
快速、精确地检测密集的荔枝果实并确定其成熟度,对于荔枝果园的产量估计和机器人采摘具有重要的实际意义。复杂的生长环境、密集的分布以及叶片、树枝和其他荔枝果实的随机遮挡等因素,容易导致基于计算机视觉的预测输出偏离实际值。本研究提出了一种基于改进的You Only Look Once版本5(YOLOv5)模型的快速、精确的荔枝果实检测方法及应用软件,可用于果园荔枝的检测和产量估计。首先,建立了不同成熟度水平的荔枝数据集。其次,选择YOLOv5s模型作为改进模型的基础版本。使用ShuffleNet v2作为改进的主干网络,然后对主干网络进行微调以简化模型结构。在特征融合阶段,引入CBAM模块以进一步细化荔枝的有效特征信息。考虑到密集荔枝果实尺寸小的特点,在优化网络结构时,将1280×1280用作改进模型的输入尺寸。为了评估所提方法的性能,我们进行了消融实验,并在测试集上与其他模型进行了比较。结果表明,与原始模型相比,改进模型的平均精度均值(mAP)提高了3.5%,模型大小压缩了62.77%。改进模型大小为5.1MB,在置信度为0.5时,每秒帧数(FPS)为78.13帧/秒。该模型在不同场景下的精度和鲁棒性方面表现良好。此外,我们基于改进模型开发了一个用于荔枝计数和产量估计的安卓应用程序。从实验中可知,应用测试与实际结果之间的相关系数为0.9879。综上所述,我们的改进方法在大规模上实现了高精度、轻量级和快速检测性能。该方法可为便携式产量估计和荔枝采摘机器人的视觉识别提供技术手段。