• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种基于改进YOLOv8的轻量化云南小米辣检测与姿态估计

A lightweight Yunnan Xiaomila detection and pose estimation based on improved YOLOv8.

作者信息

Wang Fenghua, Tang Yuan, Gong Zaipeng, Jiang Jin, Chen Yu, Xu Qiang, Hu Peng, Zhu Hailong

机构信息

Faculty of Modern Agricultural Engineering, Kunming University of Science and Technology, Kunming, Yunnan, China.

Engineering Training Center, Kunming University of Science and Technology, Kunming, Yunnan, China.

出版信息

Front Plant Sci. 2024 Jun 5;15:1421381. doi: 10.3389/fpls.2024.1421381. eCollection 2024.

DOI:10.3389/fpls.2024.1421381
PMID:38903433
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11188358/
Abstract

INTRODUCTION

Yunnan Xiaomila is a pepper variety whose flowers and fruits become mature at the same time and multiple times a year. The distinction between the fruits and the background is low and the background is complex. The targets are small and difficult to identify.

METHODS

This paper aims at the problem of target detection of Yunnan Xiaomila under complex background environment, in order to reduce the impact caused by the small color gradient changes between xiaomila and background and the unclear feature information, an improved PAE-YOLO model is proposed, which combines the EMA attention mechanism and DCNv3 deformable convolution is integrated into the YOLOv8 model, which improves the model's feature extraction capability and inference speed for Xiaomila in complex environments, and achieves a lightweight model. First, the EMA attention mechanism is combined with the C2f module in the YOLOv8 network. The C2f module can well extract local features from the input image, and the EMA attention mechanism can control the global relationship. The two complement each other, thereby enhancing the model's expression ability; Meanwhile, in the backbone network and head network, the DCNv3 convolution module is introduced, which can adaptively adjust the sampling position according to the input feature map, contributing to stronger feature capture capabilities for targets of different scales and a lightweight network. It also uses a depth camera to estimate the posture of Xiaomila, while analyzing and optimizing different occlusion situations. The effectiveness of the proposed method was verified through ablation experiments, model comparison experiments and attitude estimation experiments.

RESULTS

The experimental results indicated that the model obtained an average mean accuracy (mAP) of 88.8%, which was 1.3% higher than that of the original model. Its F1 score reached 83.2, and the GFLOPs and model sizes were 7.6G and 5.7MB respectively. The F1 score ranked the best among several networks, with the model weight and gigabit floating-point operations per second (GFLOPs) being the smallest, which are 6.2% and 8.1% lower than the original model. The loss value was the lowest during training, and the convergence speed was the fastest. Meanwhile, the attitude estimation results of 102 targets showed that the orientation was correctly estimated exceed 85% of the cases, and the average error angle was 15.91°. In the occlusion condition, 86.3% of the attitude estimation error angles were less than 40°, and the average error angle was 23.19°.

DISCUSSION

The results show that the improved detection model can accurately identify Xiaomila targets fruits, has higher model accuracy, less computational complexity, and can better estimate the target posture.

摘要

引言

云南小米辣是一种花和果实同时成熟且一年多次成熟的辣椒品种。果实与背景之间的区分度低且背景复杂。目标物小,难以识别。

方法

本文针对复杂背景环境下云南小米辣的目标检测问题,为减少小米辣与背景之间颜色梯度变化小以及特征信息不清晰所造成的影响,提出了一种改进的PAE-YOLO模型,该模型将EMA注意力机制和DCNv3可变形卷积集成到YOLOv8模型中,提高了模型在复杂环境下对小米辣的特征提取能力和推理速度,并实现了模型轻量化。首先,将EMA注意力机制与YOLOv8网络中的C2f模块相结合。C2f模块能够很好地从输入图像中提取局部特征,而EMA注意力机制可以控制全局关系。二者相辅相成,从而增强了模型的表达能力;同时,在主干网络和头部网络中引入DCNv3卷积模块,其能够根据输入特征图自适应调整采样位置,有助于对不同尺度的目标具有更强的特征捕捉能力以及实现网络轻量化。还使用深度相机估计小米辣的姿态,同时分析和优化不同的遮挡情况。通过消融实验、模型对比实验和姿态估计实验验证了所提方法的有效性。

结果

实验结果表明,该模型的平均精度均值(mAP)达到88.8%,比原模型高1.3%。其F1分数达到83.2,GFLOPs和模型大小分别为7.6G和5.7MB。F1分数在几个网络中排名最佳,模型权重和每秒千兆浮点运算次数(GFLOPs)最小,分别比原模型低6.2%和8.1%。训练期间损失值最低,收敛速度最快。同时,102个目标的姿态估计结果表明,方位估计正确的情况超过85%,平均误差角度为15.91°。在遮挡情况下,86.3%的姿态估计误差角度小于40°,平均误差角度为23.19°。

讨论

结果表明,改进后的检测模型能够准确识别小米辣目标果实,具有更高的模型精度、更低的计算复杂度,并且能够更好地估计目标姿态。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7211/11188358/61b57720a1ff/fpls-15-1421381-g015.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7211/11188358/6560c1217700/fpls-15-1421381-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7211/11188358/ac4fa582142b/fpls-15-1421381-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7211/11188358/8d0244743585/fpls-15-1421381-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7211/11188358/b38e07ec3e9c/fpls-15-1421381-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7211/11188358/8f8fd38c1793/fpls-15-1421381-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7211/11188358/f2e527e8c8a0/fpls-15-1421381-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7211/11188358/bdcf6e914bde/fpls-15-1421381-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7211/11188358/72adaa703f1c/fpls-15-1421381-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7211/11188358/df4ed7cc668b/fpls-15-1421381-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7211/11188358/8f7df0a9baf9/fpls-15-1421381-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7211/11188358/252eb47c6151/fpls-15-1421381-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7211/11188358/bf8e15aa887f/fpls-15-1421381-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7211/11188358/c51974f5f98c/fpls-15-1421381-g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7211/11188358/fc97e3509acb/fpls-15-1421381-g014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7211/11188358/61b57720a1ff/fpls-15-1421381-g015.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7211/11188358/6560c1217700/fpls-15-1421381-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7211/11188358/ac4fa582142b/fpls-15-1421381-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7211/11188358/8d0244743585/fpls-15-1421381-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7211/11188358/b38e07ec3e9c/fpls-15-1421381-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7211/11188358/8f8fd38c1793/fpls-15-1421381-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7211/11188358/f2e527e8c8a0/fpls-15-1421381-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7211/11188358/bdcf6e914bde/fpls-15-1421381-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7211/11188358/72adaa703f1c/fpls-15-1421381-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7211/11188358/df4ed7cc668b/fpls-15-1421381-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7211/11188358/8f7df0a9baf9/fpls-15-1421381-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7211/11188358/252eb47c6151/fpls-15-1421381-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7211/11188358/bf8e15aa887f/fpls-15-1421381-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7211/11188358/c51974f5f98c/fpls-15-1421381-g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7211/11188358/fc97e3509acb/fpls-15-1421381-g014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7211/11188358/61b57720a1ff/fpls-15-1421381-g015.jpg

相似文献

1
A lightweight Yunnan Xiaomila detection and pose estimation based on improved YOLOv8.一种基于改进YOLOv8的轻量化云南小米辣检测与姿态估计
Front Plant Sci. 2024 Jun 5;15:1421381. doi: 10.3389/fpls.2024.1421381. eCollection 2024.
2
Rapid detection of Yunnan Xiaomila based on lightweight YOLOv7 algorithm.基于轻量级YOLOv7算法的云南小米辣快速检测
Front Plant Sci. 2023 Jun 5;14:1200144. doi: 10.3389/fpls.2023.1200144. eCollection 2023.
3
A Lightweight Strip Steel Surface Defect Detection Network Based on Improved YOLOv8.一种基于改进YOLOv8的轻质带钢表面缺陷检测网络。
Sensors (Basel). 2024 Oct 9;24(19):6495. doi: 10.3390/s24196495.
4
YOLOv8-RMDA: Lightweight YOLOv8 Network for Early Detection of Small Target Diseases in Tea.YOLOv8-RMDA:用于茶中早期检测小目标疾病的轻量级 YOLOv8 网络。
Sensors (Basel). 2024 May 1;24(9):2896. doi: 10.3390/s24092896.
5
YOLOv8-Peas: a lightweight drought tolerance method for peas based on seed germination vigor.YOLOv8-豌豆:一种基于种子萌发活力的豌豆轻量级耐旱方法。
Front Plant Sci. 2023 Sep 28;14:1257947. doi: 10.3389/fpls.2023.1257947. eCollection 2023.
6
A Lightweight and Efficient Multi-Type Defect Detection Method for Transmission Lines Based on DCP-YOLOv8.一种基于DCP-YOLOv8的轻量级高效输电线路多类型缺陷检测方法
Sensors (Basel). 2024 Jul 11;24(14):4491. doi: 10.3390/s24144491.
7
GFI-YOLOv8: Sika Deer Posture Recognition Target Detection Method Based on YOLOv8.GFI-YOLOv8:基于YOLOv8的梅花鹿姿态识别目标检测方法
Animals (Basel). 2024 Sep 11;14(18):2640. doi: 10.3390/ani14182640.
8
Wheat Seed Detection and Counting Method Based on Improved YOLOv8 Model.基于改进 YOLOv8 模型的小麦种子检测与计数方法。
Sensors (Basel). 2024 Mar 3;24(5):1654. doi: 10.3390/s24051654.
9
Improved YOLOv8 for Gas-Flame State Recognition under Low-Pressure Conditions.用于低压条件下气体火焰状态识别的改进型YOLOv8
Sensors (Basel). 2024 Oct 2;24(19):6383. doi: 10.3390/s24196383.
10
Chili Pepper Object Detection Method Based on Improved YOLOv8n.基于改进YOLOv8n的辣椒目标检测方法
Plants (Basel). 2024 Aug 28;13(17):2402. doi: 10.3390/plants13172402.

引用本文的文献

1
SGSNet: a lightweight deep learning model for strawberry growth stage detection.SGSNet:一种用于草莓生长阶段检测的轻量级深度学习模型。
Front Plant Sci. 2024 Dec 2;15:1491706. doi: 10.3389/fpls.2024.1491706. eCollection 2024.

本文引用的文献

1
SGR-YOLO: a method for detecting seed germination rate in wild rice.SGR-YOLO:一种检测野生稻种子发芽率的方法。
Front Plant Sci. 2024 Jan 23;14:1305081. doi: 10.3389/fpls.2023.1305081. eCollection 2023.
2
An effective deep learning approach for the classification of Bacteriosis in peach leave.一种用于桃树叶细菌性病害分类的有效深度学习方法。
Front Plant Sci. 2022 Nov 24;13:1064854. doi: 10.3389/fpls.2022.1064854. eCollection 2022.
3
Fruit Detection and Pose Estimation for Grape Cluster-Harvesting Robot Using Binocular Imagery Based on Deep Neural Networks.
基于深度神经网络的双目图像葡萄串采摘机器人果实检测与姿态估计
Front Robot AI. 2021 Jun 22;8:626989. doi: 10.3389/frobt.2021.626989. eCollection 2021.
4
Real-Time Fruit Recognition and Grasping Estimation for Robotic Apple Harvesting.实时水果识别与机器人采摘苹果的抓取估计。
Sensors (Basel). 2020 Oct 4;20(19):5670. doi: 10.3390/s20195670.
5
Guava Detection and Pose Estimation Using a Low-Cost RGB-D Sensor in the Field.使用低成本 RGB-D 传感器进行田间番石榴检测和姿态估计。
Sensors (Basel). 2019 Jan 21;19(2):428. doi: 10.3390/s19020428.
6
Pose Estimation of Sweet Pepper through Symmetry Axis Detection.通过对称轴检测实现甜椒的姿态估计。
Sensors (Basel). 2018 Sep 13;18(9):3083. doi: 10.3390/s18093083.