Suppr超能文献

基于改进YOLOv8的多阶段番茄果实识别方法

Multi-stage tomato fruit recognition method based on improved YOLOv8.

作者信息

Fu Yuliang, Li Weiheng, Li Gang, Dong Yuanzhi, Wang Songlin, Zhang Qingyang, Li Yanbin, Dai Zhiguang

机构信息

North China University of Water Resources and Electric PowerSchool of Water Conservancy, Zhengzhou, China.

State Key Laboratory of Eco-Hydraulics in the Northwest Arid Region of China, Xi'an University of Technology, Xi'an, Shanxi, China.

出版信息

Front Plant Sci. 2024 Sep 5;15:1447263. doi: 10.3389/fpls.2024.1447263. eCollection 2024.

Abstract

INTRODUCTION

In the field of facility agriculture, the accurate identification of tomatoes at multiple stages has become a significant area of research. However, accurately identifying and localizing tomatoes in complex environments is a formidable challenge. Complex working conditions can impair the performance of conventional detection techniques, underscoring the necessity for more robust methods.

METHODS

To address this issue, we propose a novel model of YOLOv8-EA for the localization and identification of tomato fruit. The model incorporates a number of significant enhancements. Firstly, the EfficientViT network replaces the original YOLOv8 backbone network, which has the effect of reducing the number of model parameters and improving the capability of the network to extract features. Secondly, some of the convolutions were integrated into the C2f module to create the C2f-Faster module, which facilitates the inference process of the model. Third, the bounding box loss function was modified to SIoU, thereby accelerating model convergence and enhancing detection accuracy. Lastly, the Auxiliary Detection Head (Aux-Head) module was incorporated to augment the network's learning capacity.

RESULT

The accuracy, recall, and average precision of the YOLOv8-EA model on the self-constructed dataset were 91.4%, 88.7%, and 93.9%, respectively, with a detection speed of 163.33 frames/s. In comparison to the baseline YOLOv8n network, the model weight was increased by 2.07 MB, and the accuracy, recall, and average precision were enhanced by 10.9, 11.7, and 7.2 percentage points, respectively. The accuracy, recall, and average precision increased by 10.9, 11.7, and 7.2 percentage points, respectively, while the detection speed increased by 42.1%. The detection precision for unripe, semi-ripe, and ripe tomatoes was 97.1%, 91%, and 93.7%, respectively. On the public dataset, the accuracy, recall, and average precision of YOLOv8-EA are 91%, 89.2%, and 95.1%, respectively, and the detection speed is 1.8 ms, which is 4, 4.21, and 3.9 percentage points higher than the baseline YOLOv8n network. This represents an 18.2% improvement in detection speed, which demonstrates good generalization ability.

DISCUSSION

The reliability of YOLOv8-EA in identifying and locating multi-stage tomato fruits in complex environments demonstrates its efficacy in this regard and provides a technical foundation for the development of intelligent tomato picking devices.

摘要

引言

在设施农业领域,对番茄多个生长阶段进行准确识别已成为一个重要的研究领域。然而,在复杂环境中准确识别和定位番茄是一项艰巨的挑战。复杂的工作条件会削弱传统检测技术的性能,这凸显了采用更强大方法的必要性。

方法

为解决这一问题,我们提出了一种用于番茄果实定位和识别的新型YOLOv8-EA模型。该模型包含多项重大改进。首先,EfficientViT网络取代了原始的YOLOv8主干网络,具有减少模型参数数量和提高网络特征提取能力的效果。其次,将部分卷积集成到C2f模块中创建了C2f-Faster模块,这有助于模型的推理过程。第三,将边界框损失函数修改为SIoU,从而加速模型收敛并提高检测精度。最后,加入了辅助检测头(Aux-Head)模块以增强网络的学习能力。

结果

YOLOv8-EA模型在自建数据集上的准确率、召回率和平均精度分别为91.4%、88.7%和93.9%,检测速度为163.33帧/秒。与基线YOLOv8n网络相比,模型权重增加了2.07MB,准确率、召回率和平均精度分别提高了10.9、11.7和7.2个百分点。准确率、召回率和平均精度分别提高了10.9、11.7和7.2个百分点,同时检测速度提高了42.1%。未成熟、半成熟和成熟番茄的检测精度分别为97.1%、91%和93.7%。在公共数据集上,YOLOv8-EA的准确率、召回率和平均精度分别为91%、89.2%和95.1%,检测速度为1.8毫秒,比基线YOLOv8n网络分别高4、4.21和3.9个百分点。这代表检测速度提高了18.2%,表明其具有良好的泛化能力。

讨论

YOLOv8-EA在复杂环境中识别和定位多阶段番茄果实的可靠性证明了其在这方面的有效性,并为智能番茄采摘设备的开发提供了技术基础。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4d24/11410604/13e3121676b7/fpls-15-1447263-g001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验