• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于多模态茶梢的实时密集小目标检测算法

Real-time dense small object detection algorithm based on multi-modal tea shoots.

作者信息

Shuai Luyu, Chen Ziao, Li Zhiyong, Li Hongdan, Zhang Boda, Wang Yuchao, Mu Jiong

机构信息

College of Information Engineering, Sichuan Agricultural University, Ya'an, China.

Ya'an Digital Agricultural Engineering Technology Research Center, Sichuan Agricultural University, Ya'an, China.

出版信息

Front Plant Sci. 2023 Jul 18;14:1224884. doi: 10.3389/fpls.2023.1224884. eCollection 2023.

DOI:10.3389/fpls.2023.1224884
PMID:37534292
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10391178/
Abstract

INTRODUCTION

The difficulties in tea shoot recognition are that the recognition is affected by lighting conditions, it is challenging to segment images with similar backgrounds to the shoot color, and the occlusion and overlap between leaves.

METHODS

To solve the problem of low accuracy of dense small object detection of tea shoots, this paper proposes a real-time dense small object detection algorithm based on multimodal optimization. First, RGB, depth, and infrared images are collected form a multimodal image set, and a complete shoot object labeling is performed. Then, the YOLOv5 model is improved and applied to dense and tiny tea shoot detection. Secondly, based on the improved YOLOv5 model, this paper designs two data layer-based multimodal image fusion methods and a feature layerbased multimodal image fusion method; meanwhile, a cross-modal fusion module (FFA) based on frequency domain and attention mechanisms is designed for the feature layer fusion method to adaptively align and focus critical regions in intra- and inter-modal channel and frequency domain dimensions. Finally, an objective-based scale matching method is developed to further improve the detection performance of small dense objects in natural environments with the assistance of transfer learning techniques.

RESULTS AND DISCUSSION

The experimental results indicate that the improved YOLOv5 model increases the mAP50 value by 1.7% compared to the benchmark model with fewer parameters and less computational effort. Compared with the single modality, the multimodal image fusion method increases the mAP50 value in all cases, with the method introducing the FFA module obtaining the highest mAP50 value of 0.827. After the pre-training strategy is used after scale matching, the mAP values can be improved by 1% and 1.4% on the two datasets. The research idea of multimodal optimization in this paper can provide a basis and technical support for dense small object detection.

摘要

引言

茶梢识别的困难在于识别受光照条件影响,分割与茶梢颜色背景相似的图像具有挑战性,以及叶片之间的遮挡和重叠。

方法

为解决茶梢密集小目标检测准确率低的问题,本文提出一种基于多模态优化的实时密集小目标检测算法。首先,从多模态图像集中采集RGB、深度和红外图像,并进行完整的梢目标标注。然后,对YOLOv5模型进行改进并应用于密集微小茶梢检测。其次,基于改进的YOLOv5模型,本文设计了两种基于数据层的多模态图像融合方法和一种基于特征层的多模态图像融合方法;同时,为特征层融合方法设计了一种基于频域和注意力机制的跨模态融合模块(FFA),以在模态内和模态间的通道和频域维度上自适应地对齐和聚焦关键区域。最后,开发了一种基于目标的尺度匹配方法,借助迁移学习技术进一步提高自然环境中密集小目标的检测性能。

结果与讨论

实验结果表明,改进后的YOLOv5模型与基准模型相比,mAP50值提高了1.7%,且参数更少、计算量更小。与单模态相比,多模态图像融合方法在所有情况下均提高了mAP50值,引入FFA模块的方法获得了最高的mAP50值0.827。在尺度匹配后使用预训练策略,两个数据集上的mAP值可分别提高1%和1.4%。本文的多模态优化研究思路可为密集小目标检测提供依据和技术支持。

相似文献

1
Real-time dense small object detection algorithm based on multi-modal tea shoots.基于多模态茶梢的实时密集小目标检测算法
Front Plant Sci. 2023 Jul 18;14:1224884. doi: 10.3389/fpls.2023.1224884. eCollection 2023.
2
CCGL-YOLOV5:A cross-modal cross-scale global-local attention YOLOV5 lung tumor detection model.CCGL-YOLOV5:一种跨模态跨尺度全局-局部注意力 YOLOV5 肺肿瘤检测模型。
Comput Biol Med. 2023 Oct;165:107387. doi: 10.1016/j.compbiomed.2023.107387. Epub 2023 Aug 28.
3
SF-YOLOv5: A Lightweight Small Object Detection Algorithm Based on Improved Feature Fusion Mode.SF-YOLOv5:一种基于改进特征融合模式的轻量级小目标检测算法。
Sensors (Basel). 2022 Aug 4;22(15):5817. doi: 10.3390/s22155817.
4
ASG-YOLOv5: Improved YOLOv5 unmanned aerial vehicle remote sensing aerial images scenario for small object detection based on attention and spatial gating.ASG-YOLOv5:基于注意力和空间门控的改进型 YOLOv5 无人机遥感航空图像场景的小目标检测
PLoS One. 2024 Jun 3;19(6):e0298698. doi: 10.1371/journal.pone.0298698. eCollection 2024.
5
MC-YOLOv5: A Multi-Class Small Object Detection Algorithm.MC-YOLOv5:一种多类小目标检测算法。
Biomimetics (Basel). 2023 Aug 2;8(4):342. doi: 10.3390/biomimetics8040342.
6
Weakly Aligned Feature Fusion for Multimodal Object Detection.用于多模态目标检测的弱对齐特征融合
IEEE Trans Neural Netw Learn Syst. 2025 Mar;36(3):4145-4159. doi: 10.1109/TNNLS.2021.3105143. Epub 2025 Feb 28.
7
An Improved YOLOv5-Based Underwater Object-Detection Framework.一种改进的基于 YOLOv5 的水下目标检测框架。
Sensors (Basel). 2023 Apr 3;23(7):3693. doi: 10.3390/s23073693.
8
YOLOv5_mamba: unmanned aerial vehicle object detection based on bidirectional dense feedback network and adaptive gate feature fusion.YOLOv5_mamba:基于双向密集反馈网络和自适应门特征融合的无人机目标检测
Sci Rep. 2024 Sep 27;14(1):22396. doi: 10.1038/s41598-024-73241-x.
9
MSA-YOLOv5: Multi-scale attention-based YOLOv5 for automatic detection of acute ischemic stroke from multi-modality MRI images.MSA-YOLOv5:基于多尺度注意力的 YOLOv5,用于从多模态 MRI 图像中自动检测急性缺血性脑卒中。
Comput Biol Med. 2023 Oct;165:107471. doi: 10.1016/j.compbiomed.2023.107471. Epub 2023 Sep 6.
10
Lightweight tea bud recognition network integrating GhostNet and YOLOv5.融合GhostNet与YOLOv5的轻量级茶芽识别网络
Math Biosci Eng. 2022 Sep 5;19(12):12897-12914. doi: 10.3934/mbe.2022602.

引用本文的文献

1
Small object detection algorithm incorporating swin transformer for tea buds.用于茶芽的融合 Swin 变换小目标检测算法。
PLoS One. 2024 Mar 21;19(3):e0299902. doi: 10.1371/journal.pone.0299902. eCollection 2024.
2
SFHG-YOLO: A Simple Real-Time Small-Object-Detection Method for Estimating Pineapple Yield from Unmanned Aerial Vehicles.SFHG-YOLO:一种用于从无人机估计菠萝产量的简单实时小目标检测方法。
Sensors (Basel). 2023 Nov 17;23(22):9242. doi: 10.3390/s23229242.

本文引用的文献

1
UIU-Net: U-Net in U-Net for Infrared Small Object Detection.UIU-Net:用于红外小目标检测的U-Net嵌套U-Net结构
IEEE Trans Image Process. 2023;32:364-376. doi: 10.1109/TIP.2022.3228497. Epub 2022 Dec 21.
2
Lightweight tea bud recognition network integrating GhostNet and YOLOv5.融合GhostNet与YOLOv5的轻量级茶芽识别网络
Math Biosci Eng. 2022 Sep 5;19(12):12897-12914. doi: 10.3934/mbe.2022602.
3
Automatic monitoring of lettuce fresh weight by multi-modal fusion based deep learning.基于多模态融合深度学习的生菜鲜重自动监测
Front Plant Sci. 2022 Aug 25;13:980581. doi: 10.3389/fpls.2022.980581. eCollection 2022.
4
Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition.空间金字塔池化在深度卷积网络中的视觉识别。
IEEE Trans Pattern Anal Mach Intell. 2015 Sep;37(9):1904-16. doi: 10.1109/TPAMI.2015.2389824.