Wang Yan, Rong Qianjie, Hu Chunhua
College of Information Science and Technology & Artificial Intelligence, Nanjing Forestry University, Nanjing 210037, China.
Plants (Basel). 2024 Nov 20;13(22):3253. doi: 10.3390/plants13223253.
Recognizing ripe tomatoes is a crucial aspect of tomato picking. To ensure the accuracy of inspection results, You Only Look Once version 9 (YOLOv9) has been explored as a fruit detection algorithm. To tackle the challenge of identifying tomatoes and the low accuracy of small object detection in complex environments, we propose a ripe tomato recognition algorithm based on an enhanced YOLOv9-C model. After collecting tomato data, we used Mosaic for data augmentation, which improved model robustness and enriched experimental data. Improvements were made to the feature extraction and down-sampling modules, integrating HGBlock and SPD-ADown modules into the YOLOv9 model. These measures resulted in high detection performance with precision and recall rates of 97.2% and 92.3% in horizontal and vertical experimental comparisons, respectively. The module-integrated model improved accuracy and recall by 1.3% and 1.1%, respectively, and also reduced inference time by 1 ms compared to the original model. The inference time of this model was 14.7 ms, which is 16 ms better than the RetinaNet model. This model was tested accurately with mAP@0.5 (%) up to 98%, which is 9.6% higher than RetinaNet. Its increased speed and accuracy make it more suitable for practical applications. Overall, this model provides a reliable technique for recognizing ripe tomatoes during the picking process.
识别成熟番茄是番茄采摘的一个关键环节。为确保检测结果的准确性,已探索将You Only Look Once版本9(YOLOv9)作为一种水果检测算法。为应对复杂环境中识别番茄的挑战以及小目标检测精度较低的问题,我们提出了一种基于增强型YOLOv9 - C模型的成熟番茄识别算法。收集番茄数据后,我们使用Mosaic进行数据增强,这提高了模型的鲁棒性并丰富了实验数据。对特征提取和下采样模块进行了改进,将HGBlock和SPD - ADown模块集成到YOLOv9模型中。这些措施带来了较高的检测性能,在水平和垂直实验比较中,精确率和召回率分别达到97.2%和92.3%。与原始模型相比,集成模块后的模型精确率和召回率分别提高了1.3%和1.1%,推理时间也减少了1毫秒。该模型的推理时间为14.7毫秒,比RetinaNet模型快16毫秒。该模型在mAP@0.5(%)高达98%的情况下进行了准确测试,比RetinaNet高9.6%。其速度和准确性的提升使其更适合实际应用。总体而言,该模型为采摘过程中识别成熟番茄提供了一种可靠的技术。