一种用于非结构化环境中葡萄检测的改进型YOLO v4。

An improved YOLO v4 used for grape detection in unstructured environment.

作者信息

Guo Canzhi, Zheng Shiwu, Cheng Guanggui, Zhang Yue, Ding Jianning

机构信息

Institute of Intelligent Flexible Mechatronics, Jiangsu University, Zhenjiang, China.

Jiangsu Collaborative Innovation Center of Photovoltaic Science and Engineering, Changzhou, China.

出版信息

Front Plant Sci. 2023 Jul 13;14:1209910. doi: 10.3389/fpls.2023.1209910. eCollection 2023.

DOI:10.3389/fpls.2023.1209910

PMID:37521937

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10374324/

Abstract

Visual recognition is the most critical function of a harvesting robot, and the accuracy of the harvesting action is based on the performance of visual recognition. However, unstructured environment, such as severe occlusion, fruits overlap, illumination changes, complex backgrounds, and even heavy fog weather, pose series of serious challenges to the detection accuracy of the recognition algorithm. Hence, this paper proposes an improved YOLO v4 model, called YOLO v4+, to cope with the challenges brought by unstructured environment. The output of each Resblock_body in the backbone is processed using a simple, parameterless attention mechanism for full dimensional refinement of extracted features. Further, in order to alleviate the problem of feature information loss, a multi scale feature fusion module with fusion weight and jump connection structure was pro-posed. In addition, the focal loss function is adopted and the hyperparameters α, γ are adjusted to 0.75 and 2. The experimental results show that the average precision of the YOLO v4+ model is 94.25% and the F1 score is 93%, which is 3.35% and 3% higher than the original YOLO v4 respectively. Compared with several state-of-the-art detection models, YOLO v4+ not only has the highest comprehensive ability, but also has better generalization ability. Selecting the corresponding augmentation method for specific working condition can greatly improve the model detection accuracy. Applying the proposed method to harvesting robots may enhance the applicability and robustness of the robotic system.

摘要

视觉识别是采摘机器人最关键的功能，采摘动作的准确性基于视觉识别的性能。然而，非结构化环境，如严重遮挡、果实重叠、光照变化、复杂背景，甚至大雾天气，给识别算法的检测精度带来了一系列严峻挑战。因此，本文提出了一种改进的YOLO v4模型，即YOLO v4+，以应对非结构化环境带来的挑战。主干网络中每个Resblock_body的输出通过一个简单的、无参数的注意力机制进行处理，以对提取的特征进行全维度细化。此外，为了缓解特征信息丢失的问题，提出了一种具有融合权重和跳跃连接结构的多尺度特征融合模块。另外，采用了焦点损失函数，并将超参数α、γ调整为0.75和2。实验结果表明，YOLO v4+模型的平均精度为94.25%，F1分数为93%，分别比原始YOLO v4高3.35%和3%。与几种先进的检测模型相比，YOLO v4+不仅具有最高的综合能力，而且具有更好的泛化能力。针对特定工作条件选择相应的增强方法可以大大提高模型的检测精度。将所提出的方法应用于采摘机器人可能会增强机器人系统的适用性和鲁棒性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ce60/10374324/1469d149ada0/fpls-14-1209910-g001.jpg

相似文献

An improved YOLO v4 used for grape detection in unstructured environment.一种用于非结构化环境中葡萄检测的改进型YOLO v4。

Front Plant Sci. 2023 Jul 13;14:1209910. doi: 10.3389/fpls.2023.1209910. eCollection 2023.

YOLO-P: An efficient method for pear fast detection in complex orchard picking environment.YOLO-P：一种在复杂果园采摘环境中快速检测梨的有效方法。

Front Plant Sci. 2023 Jan 4;13:1089454. doi: 10.3389/fpls.2022.1089454. eCollection 2022.

Unstructured road extraction and roadside fruit recognition in grape orchards based on a synchronous detection algorithm.基于同步检测算法的葡萄园非结构化道路提取与路边果实识别

Front Plant Sci. 2023 Jun 2;14:1103276. doi: 10.3389/fpls.2023.1103276. eCollection 2023.

Research on fabric surface defect detection algorithm based on improved Yolo_v4.基于改进的Yolo_v4的织物表面缺陷检测算法研究

Sci Rep. 2024 Mar 6;14(1):5537. doi: 10.1038/s41598-023-50671-7.

A Method for Real-Time Recognition of Safflower Filaments in Unstructured Environments Using the YOLO-SaFi Model.一种使用YOLO-SaFi模型在非结构化环境中实时识别红花花丝的方法。

Sensors (Basel). 2024 Jul 8;24(13):4410. doi: 10.3390/s24134410.

Enhanced tomato detection in greenhouse environments: a lightweight model based on S-YOLO with high accuracy.温室环境中番茄检测的增强：一种基于S-YOLO的高精度轻量级模型。

Front Plant Sci. 2024 Aug 22;15:1451018. doi: 10.3389/fpls.2024.1451018. eCollection 2024.

ETL-YOLO v4: A face mask detection algorithm in era of COVID-19 pandemic.ETL-YOLO v4：新冠疫情时代的一种口罩检测算法。

Optik (Stuttg). 2022 Jun;259:169051. doi: 10.1016/j.ijleo.2022.169051. Epub 2022 Apr 7.

A Method of Green Citrus Detection in Natural Environments Using a Deep Convolutional Neural Network.一种利用深度卷积神经网络在自然环境中检测绿色柑橘的方法。

Front Plant Sci. 2021 Sep 7;12:705737. doi: 10.3389/fpls.2021.705737. eCollection 2021.

Small object detection algorithm incorporating swin transformer for tea buds.用于茶芽的融合 Swin 变换小目标检测算法。

PLoS One. 2024 Mar 21;19(3):e0299902. doi: 10.1371/journal.pone.0299902. eCollection 2024.

Fusion of fruit image processing and deep learning: a study on identification of citrus ripeness based on R-LBP algorithm and YOLO-CIT model.水果图像处理与深度学习的融合：基于R-LBP算法和YOLO-CIT模型的柑橘成熟度识别研究

Front Plant Sci. 2024 Jun 5;15:1397816. doi: 10.3389/fpls.2024.1397816. eCollection 2024.

引用本文的文献

YOLO Object Detection for Real-Time Fabric Defect Inspection in the Textile Industry: A Review of YOLOv1 to YOLOv11.用于纺织行业实时织物缺陷检测的YOLO目标检测：从YOLOv1到YOLOv11的综述

Sensors (Basel). 2025 Apr 3;25(7):2270. doi: 10.3390/s25072270.

Framework for smartphone-based grape detection and vineyard management using UAV-trained AI.基于无人机训练的人工智能的智能手机葡萄检测与葡萄园管理框架。

Heliyon. 2025 Feb 6;11(4):e42525. doi: 10.1016/j.heliyon.2025.e42525. eCollection 2025 Feb 28.

Efficient online detection device and method for cottonseed breakage based on Light-YOLO.基于Light-YOLO的高效棉籽破损在线检测装置及方法

Front Plant Sci. 2024 Aug 9;15:1418224. doi: 10.3389/fpls.2024.1418224. eCollection 2024.

A lightweight and efficient model for grape bunch detection and biophysical anomaly assessment in complex environments based on YOLOv8s.一种基于YOLOv8s的轻量级高效模型，用于复杂环境下葡萄串检测和生物物理异常评估。

Front Plant Sci. 2024 Aug 6;15:1395796. doi: 10.3389/fpls.2024.1395796. eCollection 2024.

本文引用的文献

YOLO Series for Human Hand Action Detection and Classification from Egocentric Videos.基于自拍摄视频的人体手部动作检测与分类的 YOLO 系列。

Sensors (Basel). 2023 Mar 20;23(6):3255. doi: 10.3390/s23063255.

Fruit Detection and Pose Estimation for Grape Cluster-Harvesting Robot Using Binocular Imagery Based on Deep Neural Networks.基于深度神经网络的双目图像葡萄串采摘机器人果实检测与姿态估计

Front Robot AI. 2021 Jun 22;8:626989. doi: 10.3389/frobt.2021.626989. eCollection 2021.

Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks.更快的 R-CNN：基于区域建议网络的实时目标检测。

IEEE Trans Pattern Anal Mach Intell. 2017 Jun;39(6):1137-1149. doi: 10.1109/TPAMI.2016.2577031. Epub 2016 Jun 6.

Visual attention: the past 25 years.视觉注意力：过去25年

Vision Res. 2011 Jul 1;51(13):1484-525. doi: 10.1016/j.visres.2011.04.012. Epub 2011 Apr 28.

Early and late mechanisms of surround suppression in striate cortex of macaque.猕猴纹状皮层中周边抑制的早期和晚期机制

J Neurosci. 2005 Dec 14;25(50):11666-75. doi: 10.1523/JNEUROSCI.3414-05.2005.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

一种用于非结构化环境中葡萄检测的改进型YOLO v4。

An improved YOLO v4 used for grape detection in unstructured environment.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献