Suppr超能文献

BHC-YOLOV8:一种改进的基于YOLOv8的BHC目标检测模型,用于实际场景中的茶叶病害和缺陷检测。

BHC-YOLOV8 : improved YOLOv8-based BHC target detection model for tea leaf disease and defect in real-world scenarios.

作者信息

Zhan BaiShao, Xiong Xi, Li Xiaoli, Luo Wei

机构信息

School of Electrical and Automation Engineering, East China Jiaotong University, Nanchang, China.

College of Biosystems Engineering and Food Science, Zhejiang University, Hangzhou, China.

出版信息

Front Plant Sci. 2024 Dec 2;15:1492504. doi: 10.3389/fpls.2024.1492504. eCollection 2024.

Abstract

INTRODUCTION

The detection efficiency of tea diseases and defects ensures the quality and yield of tea. However, in actual production, on the one hand, the tea plantation has high mountains and long roads, and the safety of inspection personnel cannot be guaranteed; on the other hand, the inspection personnel have factors such as lack of experience and fatigue, resulting in incomplete and slow testing results. Introducing visual inspection technology can avoid the above problems.

METHODS

Firstly, a dynamic sparse attention mechanism (Bi Former) is introduced into the model backbone. It filters out irrelevant key value pairs at the coarse region level, utilizing sparsity to save computation and memory; jointly apply fine region token to token attention in the remaining candidate regions. Secondly, Haar wavelets are introduced to improve the down sampling module. By processing the input information flow horizontally, vertically, and diagonally, the original image is reconstructed. Finally, a new feature fusion network is designed using a multi-head attention mechanism to decompose the main network into several cascaded stages, each stage comprising a sub-backbone for parallel processing of different features. Simultaneously, skip connections are performed on features from the same layer, and unbounded fusion weight normalization is introduced to constrain the range of each weight value.

RESULTS

After the above improvements, the confidence level of the current mainstream models increased by 7.1%, mAP0.5 increased by 8%, and reached 94.5%. After conducting ablation experiments and comparing with mainstream models, the feature fusion network proposed in this paper reduced computational complexity by 10.6 GFlops, increased confidence by 2.7%, and increased mAP0.5 by 3.2%.

DISCUSSION

This paper developed a new network based on YOLOv8 to overcome the difficulties of tea diseases and defects such as small target, multiple occlusion and complex background.

摘要

引言

茶叶病虫害及缺陷的检测效率关乎茶叶的品质与产量。然而,在实际生产中,一方面茶园地势起伏、道路漫长,巡检人员的安全难以保障;另一方面,巡检人员存在经验不足、疲劳等因素,导致检测结果不完整且速度缓慢。引入视觉检测技术可避免上述问题。

方法

首先,在模型主干中引入动态稀疏注意力机制(Bi Former)。它在粗粒度区域层面过滤掉无关的键值对,利用稀疏性节省计算量和内存;在剩余候选区域联合应用细粒度区域的令牌到令牌注意力。其次,引入哈尔小波改进下采样模块。通过对输入信息流进行水平、垂直和对角线处理,重建原始图像。最后,使用多头注意力机制设计一种新的特征融合网络,将主网络分解为几个级联阶段,每个阶段包含一个用于并行处理不同特征的子主干。同时,对来自同一层的特征进行跳跃连接,并引入无界融合权重归一化来约束每个权重值的范围。

结果

经过上述改进,当前主流模型的置信度提升了7.1%,mAP0.5提升了8%,达到了94.5%。在进行消融实验并与主流模型比较后,本文提出的特征融合网络将计算复杂度降低了10.6 GFlops,置信度提高了2.7%,mAP0.5提高了3.2%。

讨论

本文基于YOLOv8开发了一种新网络,以克服茶叶病虫害及缺陷检测中存在的小目标、多遮挡和复杂背景等难题。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/851b/11649061/dbff1c056a88/fpls-15-1492504-g009.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验