Suppr超能文献

改进的小目标检测算法CRL-YOLOv5

Improved Small Object Detection Algorithm CRL-YOLOv5.

作者信息

Wang Zhiyuan, Men Shujun, Bai Yuntian, Yuan Yutong, Wang Jiamin, Wang Kanglei, Zhang Lei

机构信息

School of Information Science and Engineering, Yanshan University, Qinhuangdao 066004, China.

Silesian College of Intelligent Science and Engineering, Yanshan University, Qinhuangdao 066004, China.

出版信息

Sensors (Basel). 2024 Oct 4;24(19):6437. doi: 10.3390/s24196437.

Abstract

Detecting small objects in images poses significant challenges due to their limited pixel representation and the difficulty in extracting sufficient features, often leading to missed or false detections. To address these challenges and enhance detection accuracy, this paper presents an improved small object detection algorithm, CRL-YOLOv5. The proposed approach integrates the Convolutional Block Attention Module (CBAM) attention mechanism into the C3 module of the backbone network, which enhances the localization accuracy of small objects. Additionally, the Receptive Field Block (RFB) module is introduced to expand the model's receptive field, thereby fully leveraging contextual information. Furthermore, the network architecture is restructured to include an additional detection layer specifically for small objects, allowing for deeper feature extraction from shallow layers. When tested on the VisDrone2019 small object dataset, CRL-YOLOv5 achieved an mAP50 of 39.2%, representing a 5.4% improvement over the original YOLOv5, effectively boosting the detection precision for small objects in images.

摘要

由于图像中微小物体的像素表示有限且难以提取足够的特征,检测这些微小物体面临着重大挑战,这常常导致漏检或误检。为应对这些挑战并提高检测精度,本文提出了一种改进的微小物体检测算法CRL-YOLOv5。所提出的方法将卷积块注意力模块(CBAM)注意力机制集成到主干网络的C3模块中,这提高了微小物体的定位精度。此外,引入了感受野块(RFB)模块来扩展模型的感受野,从而充分利用上下文信息。此外,对网络架构进行了重组,增加了一个专门用于微小物体的检测层,以便从浅层进行更深层次的特征提取。在VisDrone2019微小物体数据集上进行测试时,CRL-YOLOv5的mAP50达到了39.2%,比原始的YOLOv5提高了5.4%,有效地提高了图像中微小物体的检测精度。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e88/11479313/161630a147ad/sensors-24-06437-g001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验