高效轻量级YOLO：改进用于航空图像的YOLO中的小目标检测

Efficient-Lightweight YOLO: Improving Small Object Detection in YOLO for Aerial Images.

作者信息

Hu Mengzi, Li Ziyang, Yu Jiong, Wan Xueqiang, Tan Haotian, Lin Zeyu

机构信息

School of Software, Xinjiang University, Urumqi 830091, China.

College of Information Science and Engineering, Xinjiang University, Urumqi 830046, China.

出版信息

Sensors (Basel). 2023 Jul 15;23(14):6423. doi: 10.3390/s23146423.

DOI:10.3390/s23146423

PMID:37514717

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10385816/

Abstract

The most significant technical challenges of current aerial image object-detection tasks are the extremely low accuracy for detecting small objects that are densely distributed within a scene and the lack of semantic information. Moreover, existing detectors with large parameter scales are unsuitable for aerial image object-detection scenarios oriented toward low-end GPUs. To address this technical challenge, we propose efficient-lightweight You Only Look Once (EL-YOLO), an innovative model that overcomes the limitations of existing detectors and low-end GPU orientation. EL-YOLO surpasses the baseline models in three key areas. Firstly, we design and scrutinize three model architectures to intensify the model's focus on small objects and identify the most effective network structure. Secondly, we design efficient spatial pyramid pooling (ESPP) to augment the representation of small-object features in aerial images. Lastly, we introduce the alpha-complete intersection over union (α-CIoU) loss function to tackle the imbalance between positive and negative samples in aerial images. Our proposed EL-YOLO method demonstrates a strong generalization and robustness for the small-object detection problem in aerial images. The experimental results show that, with the model parameters maintained below 10 M while the input image size was unified at 640 × 640 pixels, the of the EL-YOLOv5 reached 10.8% and 10.7% and enhanced the by 1.9% and 2.2% compared to YOLOv5 on two challenging aerial image datasets, DIOR and VisDrone, respectively.

摘要

当前航空图像目标检测任务最显著的技术挑战在于，检测场景中密集分布的小目标时精度极低，且缺乏语义信息。此外，现有参数规模较大的检测器不适用于面向低端GPU的航空图像目标检测场景。为应对这一技术挑战，我们提出了高效轻量级的单阶段多框检测（EL-YOLO），这是一种创新模型，克服了现有检测器的局限性，并针对低端GPU进行了优化。EL-YOLO在三个关键领域超越了基线模型。首先，我们设计并仔细研究了三种模型架构，以增强模型对小目标的关注，并确定最有效的网络结构。其次，我们设计了高效空间金字塔池化（ESPP），以增强航空图像中小目标特征的表示。最后，我们引入了α-完全交并比（α-CIoU）损失函数，以解决航空图像中正负样本之间的不平衡问题。我们提出的EL-YOLO方法在航空图像小目标检测问题上展现出了强大的泛化能力和鲁棒性。实验结果表明，在将模型参数保持在10M以下且输入图像大小统一为640×640像素的情况下，与YOLOv5相比，EL-YOLOv5在两个具有挑战性的航空图像数据集DIOR和VisDrone上的平均精度均值分别达到了10.8%和10.7%，平均精度提升了1.9%和2.2%。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/969a/10385816/a3480987c134/sensors-23-06423-g001.jpg

相似文献

Efficient-Lightweight YOLO: Improving Small Object Detection in YOLO for Aerial Images.高效轻量级YOLO：改进用于航空图像的YOLO中的小目标检测

Sensors (Basel). 2023 Jul 15;23(14):6423. doi: 10.3390/s23146423.

OD-YOLO: Robust Small Object Detection Model in Remote Sensing Image with a Novel Multi-Scale Feature Fusion.OD-YOLO：基于新型多尺度特征融合的遥感图像稳健小目标检测模型

Sensors (Basel). 2024 Jun 3;24(11):3596. doi: 10.3390/s24113596.

IV-YOLO: A Lightweight Dual-Branch Object Detection Network.IV-YOLO：一种轻量级双分支目标检测网络。

Sensors (Basel). 2024 Sep 24;24(19):6181. doi: 10.3390/s24196181.

Lightweight Object Detection Algorithm for UAV Aerial Imagery.用于无人机航空影像的轻量级目标检测算法。

Sensors (Basel). 2023 Jun 21;23(13):5786. doi: 10.3390/s23135786.

A New Approach for Super Resolution Object Detection Using an Image Slicing Algorithm and the Segment Anything Model.一种使用图像切片算法和“分割一切”模型的超分辨率目标检测新方法。

Sensors (Basel). 2024 Jul 12;24(14):4526. doi: 10.3390/s24144526.

MSA-YOLO: A Remote Sensing Object Detection Model Based on Multi-Scale Strip Attention.MSA-YOLO：一种基于多尺度带状注意力的遥感目标检测模型。

Sensors (Basel). 2023 Jul 30;23(15):6811. doi: 10.3390/s23156811.

ASG-YOLOv5: Improved YOLOv5 unmanned aerial vehicle remote sensing aerial images scenario for small object detection based on attention and spatial gating.ASG-YOLOv5：基于注意力和空间门控的改进型 YOLOv5 无人机遥感航空图像场景的小目标检测

PLoS One. 2024 Jun 3;19(6):e0298698. doi: 10.1371/journal.pone.0298698. eCollection 2024.

LAG: Layered Objects to Generate Better Anchors for Object Detection in Aerial Images.LAG：用于生成航空图像中目标检测更好的锚点的分层对象。

Sensors (Basel). 2022 May 20;22(10):3891. doi: 10.3390/s22103891.

SenseLite: A YOLO-Based Lightweight Model for Small Object Detection in Aerial Imagery.SenseLite：一种基于YOLO的用于航空影像中小目标检测的轻量级模型。

Sensors (Basel). 2023 Sep 27;23(19):8118. doi: 10.3390/s23198118.

A streamlined approach for intelligent ship object detection using EL-YOLO algorithm.一种使用EL-YOLO算法的智能船舶目标检测的简化方法。

Sci Rep. 2024 Jul 2;14(1):15254. doi: 10.1038/s41598-024-64225-y.

引用本文的文献

Automated Classification of Dental Caries in Bitewing Radiographs Using Machine Learning and the ICCMS Framework.使用机器学习和ICCMS框架对咬合翼片X线照片中的龋齿进行自动分类

Int J Dent. 2025 Aug 21;2025:6644310. doi: 10.1155/ijod/6644310. eCollection 2025.

Partial feature reparameterization and shallow-level interaction for remote sensing object detection.用于遥感目标检测的部分特征重新参数化与浅层交互

Sci Rep. 2025 Aug 5;15(1):28629. doi: 10.1038/s41598-025-14035-7.

LDDP-Net: A Lightweight Neural Network with Dual Decoding Paths for Defect Segmentation of LED Chips.LDDP-Net：一种具有双解码路径的轻量级神经网络，用于LED芯片缺陷分割

Sensors (Basel). 2025 Jan 13;25(2):425. doi: 10.3390/s25020425.

Efficient Small Object Detection You Only Look Once: A Small Object Detection Algorithm for Aerial Images.高效小目标检测：你只需看一次——一种用于航空图像的小目标检测算法

Sensors (Basel). 2024 Nov 2;24(21):7067. doi: 10.3390/s24217067.

SOD-YOLO: A lightweight small object detection framework.SOD-YOLO：一种轻量级小目标检测框架。

Sci Rep. 2024 Oct 27;14(1):25624. doi: 10.1038/s41598-024-77513-4.

FocusDet: an efficient object detector for small object.FocusDet：一种用于小目标的高效目标检测器。

Sci Rep. 2024 May 10;14(1):10697. doi: 10.1038/s41598-024-61136-w.

Post-secondary classroom teaching quality evaluation using small object detection model.基于小目标检测模型的高等院校课堂教学质量评估

Sci Rep. 2024 Mar 9;14(1):5816. doi: 10.1038/s41598-024-56505-4.

本文引用的文献

Enhancing Geometric Factors in Model Learning and Inference for Object Detection and Instance Segmentation.增强目标检测与实例分割模型学习与推理中的几何因素

IEEE Trans Cybern. 2022 Aug;52(8):8574-8586. doi: 10.1109/TCYB.2021.3095305. Epub 2022 Jul 19.

Cascade R-CNN: High Quality Object Detection and Instance Segmentation.级联 R-CNN：高质量目标检测和实例分割。

IEEE Trans Pattern Anal Mach Intell. 2021 May;43(5):1483-1498. doi: 10.1109/TPAMI.2019.2956516. Epub 2021 Apr 1.

Focal Loss for Dense Object Detection.用于密集目标检测的焦散损失

IEEE Trans Pattern Anal Mach Intell. 2020 Feb;42(2):318-327. doi: 10.1109/TPAMI.2018.2858826. Epub 2018 Jul 23.

Mask R-CNN.Mask R-CNN。

IEEE Trans Pattern Anal Mach Intell. 2020 Feb;42(2):386-397. doi: 10.1109/TPAMI.2018.2844175. Epub 2018 Jun 5.

DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs.DeepLab：基于深度卷积网络、空洞卷积和全连接条件随机场的语义图像分割。

IEEE Trans Pattern Anal Mach Intell. 2018 Apr;40(4):834-848. doi: 10.1109/TPAMI.2017.2699184. Epub 2017 Apr 27.

Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks.更快的 R-CNN：基于区域建议网络的实时目标检测。

IEEE Trans Pattern Anal Mach Intell. 2017 Jun;39(6):1137-1149. doi: 10.1109/TPAMI.2016.2577031. Epub 2016 Jun 6.

Fully Convolutional Networks for Semantic Segmentation.全卷积网络用于语义分割。

IEEE Trans Pattern Anal Mach Intell. 2017 Apr;39(4):640-651. doi: 10.1109/TPAMI.2016.2572683. Epub 2016 May 24.

Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition.空间金字塔池化在深度卷积网络中的视觉识别。

IEEE Trans Pattern Anal Mach Intell. 2015 Sep;37(9):1904-16. doi: 10.1109/TPAMI.2015.2389824.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

高效轻量级YOLO：改进用于航空图像的YOLO中的小目标检测

Efficient-Lightweight YOLO: Improving Small Object Detection in YOLO for Aerial Images.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献