• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于调制变形卷积和大核卷积的小物体检测算法。

A Small Object Detection Algorithm Based on Modulated Deformable Convolution and Large Kernel Convolution.

机构信息

College of Information, Yunnan Normal University, Kunming 650500, Yunnan, China.

Yunnan Province Key Laboratory of Opto-Electronic Information Technology, Yunnan Normal University, Kunming, Yunnan 650500, China.

出版信息

Comput Intell Neurosci. 2023 Jan 24;2023:2506274. doi: 10.1155/2023/2506274. eCollection 2023.

DOI:10.1155/2023/2506274
PMID:36733786
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9889137/
Abstract

Object detection is one of the most critical areas in computer vision, and it plays an essential role in a variety of practice scenarios. However, small object detection has always been a key and difficult problem in the field of object detection. Therefore, considering the balance between the effectiveness and efficiency of the small object detection algorithm, this study proposes an improved YOLOX detection algorithm (BGD-YOLOX) to improve the detection effect of small objects. We present the BigGhost module, which combines the Ghost model with a modulated deformable convolution to optimize the YOLOX for greater accuracy. At the same time, it can reduce the inference time by reducing the number of parameters and the amount of computation. The experimental results show that BGD-YOLOX has a higher average accuracy rate in terms of small target detection, with mAP0.5 up to 88.3% and mAP0.95 up to 56.7%, which surpasses the most advanced object detection algorithms such as EfficientDet, CenterNet, and YOLOv4.

摘要

目标检测是计算机视觉中最关键的领域之一,它在各种实际场景中都起着至关重要的作用。然而,小目标检测一直是目标检测领域的一个关键和难题。因此,考虑到小目标检测算法的有效性和效率之间的平衡,本研究提出了一种改进的 YOLOX 检测算法(BGD-YOLOX),以提高小物体的检测效果。我们提出了 BigGhost 模块,它将 Ghost 模型与调制变形卷积相结合,以优化 YOLOX 以获得更高的准确性。同时,它可以通过减少参数数量和计算量来减少推理时间。实验结果表明,BGD-YOLOX 在小目标检测方面具有更高的平均准确率,mAP0.5 高达 88.3%,mAP0.95 高达 56.7%,超过了最先进的目标检测算法,如 EfficientDet、CenterNet 和 YOLOv4。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f2aa/9889137/b7edddbdbbc0/CIN2023-2506274.009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f2aa/9889137/412f1daa0751/CIN2023-2506274.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f2aa/9889137/bc32e4b125bc/CIN2023-2506274.002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f2aa/9889137/e9865c24eb0d/CIN2023-2506274.003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f2aa/9889137/8d964c2ef9bc/CIN2023-2506274.004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f2aa/9889137/4180597264e4/CIN2023-2506274.005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f2aa/9889137/b08fce99f34c/CIN2023-2506274.006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f2aa/9889137/047c693991b1/CIN2023-2506274.007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f2aa/9889137/fb1ff8ab0c2b/CIN2023-2506274.008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f2aa/9889137/b7edddbdbbc0/CIN2023-2506274.009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f2aa/9889137/412f1daa0751/CIN2023-2506274.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f2aa/9889137/bc32e4b125bc/CIN2023-2506274.002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f2aa/9889137/e9865c24eb0d/CIN2023-2506274.003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f2aa/9889137/8d964c2ef9bc/CIN2023-2506274.004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f2aa/9889137/4180597264e4/CIN2023-2506274.005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f2aa/9889137/b08fce99f34c/CIN2023-2506274.006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f2aa/9889137/047c693991b1/CIN2023-2506274.007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f2aa/9889137/fb1ff8ab0c2b/CIN2023-2506274.008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f2aa/9889137/b7edddbdbbc0/CIN2023-2506274.009.jpg

相似文献

1
A Small Object Detection Algorithm Based on Modulated Deformable Convolution and Large Kernel Convolution.基于调制变形卷积和大核卷积的小物体检测算法。
Comput Intell Neurosci. 2023 Jan 24;2023:2506274. doi: 10.1155/2023/2506274. eCollection 2023.
2
CF-YOLOX: An Autonomous Driving Detection Model for Multi-Scale Object Detection.CF-YOLOX:一种用于多尺度目标检测的自动驾驶检测模型。
Sensors (Basel). 2023 Apr 7;23(8):3794. doi: 10.3390/s23083794.
3
CDYL for infrared and visible light image dense small object detection.用于红外和可见光图像密集小目标检测的CDYL
Sci Rep. 2024 Feb 12;14(1):3510. doi: 10.1038/s41598-024-54146-1.
4
MobileYOLO: Real-Time Object Detection Algorithm in Autonomous Driving Scenarios.移动端 YOLO:自动驾驶场景中的实时目标检测算法。
Sensors (Basel). 2022 Apr 27;22(9):3349. doi: 10.3390/s22093349.
5
Online Detection of Fabric Defects Based on Improved CenterNet with Deformable Convolution.基于改进的中心网络与可变形卷积的织物缺陷在线检测。
Sensors (Basel). 2022 Jun 22;22(13):4718. doi: 10.3390/s22134718.
6
Research on application of helmet wearing detection improved by YOLOv4 algorithm.基于 YOLOv4 算法改进的头盔佩戴检测应用研究。
Math Biosci Eng. 2023 Mar 6;20(5):8685-8707. doi: 10.3934/mbe.2023381.
7
A Domestic Trash Detection Model Based on Improved YOLOX.基于改进 YOLOX 的家庭垃圾检测模型。
Sensors (Basel). 2022 Sep 15;22(18):6974. doi: 10.3390/s22186974.
8
ISA: Ingenious Siamese Attention for object detection algorithms towards complex scenes.ISA:面向复杂场景的目标检测算法的精巧暹罗注意力机制
ISA Trans. 2023 Dec;143:205-220. doi: 10.1016/j.isatra.2023.09.001. Epub 2023 Sep 4.
9
Object Detection Based on Swin Deformable Transformer-BiPAFPN-YOLOX.基于 Swin 变形 Transformer-BiPAFPN-YOLOX 的目标检测。
Comput Intell Neurosci. 2023 Mar 9;2023:4228610. doi: 10.1155/2023/4228610. eCollection 2023.
10
Multi-Object Detection Method in Construction Machinery Swarm Operations Based on the Improved YOLOv4 Model.基于改进YOLOv4模型的工程机械集群作业多目标检测方法
Sensors (Basel). 2022 Sep 26;22(19):7294. doi: 10.3390/s22197294.

本文引用的文献

1
DBGC: Dimension-Based Generic Convolution Block for Object Recognition.DBGC:面向目标识别的基于维度的泛化卷积块。
Sensors (Basel). 2022 Feb 24;22(5):1780. doi: 10.3390/s22051780.
2
Histogram of Oriented Gradient-Based Fusion of Features for Human Action Recognition in Action Video Sequences.基于方向梯度直方图的动作视频序列中人体动作识别特征融合直方图
Sensors (Basel). 2020 Dec 18;20(24):7299. doi: 10.3390/s20247299.
3
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks.更快的 R-CNN:基于区域建议网络的实时目标检测。
IEEE Trans Pattern Anal Mach Intell. 2017 Jun;39(6):1137-1149. doi: 10.1109/TPAMI.2016.2577031. Epub 2016 Jun 6.
4
Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition.空间金字塔池化在深度卷积网络中的视觉识别。
IEEE Trans Pattern Anal Mach Intell. 2015 Sep;37(9):1904-16. doi: 10.1109/TPAMI.2015.2389824.