• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

RFAG-YOLO:一种用于无人机图像中小目标检测的感受野注意力引导YOLO网络。

RFAG-YOLO: A Receptive Field Attention-Guided YOLO Network for Small-Object Detection in UAV Images.

作者信息

Wei Chengmeng, Wang Wenhong

机构信息

College of Computer Science, Liaocheng University, Liaocheng 252059, China.

出版信息

Sensors (Basel). 2025 Mar 30;25(7):2193. doi: 10.3390/s25072193.

DOI:10.3390/s25072193
PMID:40218706
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11991089/
Abstract

The YOLO series of object detection methods have achieved significant success in a wide range of computer vision tasks due to their efficiency and accuracy. However, detecting small objects in UAV images remains a formidable challenge due to factors such as a low resolution, complex background interference, and significant scale variations, which collectively degrade the quality of feature extraction and limit detection performance. To address these challenges, we propose the receptive field attention-guided YOLO (RFAG-YOLO) method, an advanced adaptation of YOLOv8 tailored for small-object detection in UAV imagery, with a focus on improving feature representation and detection robustness. To this end, we introduce a novel network building block, termed the receptive field network block (RFN block), which leverages dynamic kernel parameter adjustments to enhance the model's ability to capture fine-grained local details. To effectively harness multi-scale features, we designed an enhanced FasterNet module based on RFN blocks as the core component of the backbone network in RFAG-YOLO, enabling robust feature extraction across varying resolutions. This approach achieves a balance of semantic information by employing staged downsampling and a hierarchical arrangement of RFN blocks, ensuring optimal feature representation across different resolutions. Additionally, we introduced a Scale-Aware Feature Amalgamation (SAF) component prior to the detection head of RFAG-YOLO. This component employs a scale attention mechanism to dynamically weight features from both higher and lower layers, facilitating richer information flow and significantly improving the model's robustness to complex backgrounds and scale variations. Experimental results on the VisDrone2019 dataset demonstrated that RFAG-YOLO outperformed state-of-the-art models, including YOLOv7, YOLOv8, YOLOv10, and YOLOv11, in terms of detection accuracy and efficiency. In particular, RFAG-YOLO achieved an mAP50 of 38.9%, representing substantial improvements over multiple baseline models: a 12.43% increase over YOLOv7, a 5.99% improvement over YOLOv10, and significant gains of 16.12% compared to YOLOv8n and YOLOv11. Moreover, compared to the larger YOLOv8s model, RFAG-YOLO achieved 97.98% of its mAP50 performance while utilizing only 53.51% of the parameters, highlighting its exceptional efficiency in terms of the performance-to-parameter ratio and making it highly suitable for resource-constrained UAV applications. These results underscore the substantial potential of RFAG-YOLO for real-world UAV applications, particularly in scenarios demanding accurate detection of small objects under challenging conditions such as varying lighting, complex backgrounds, and diverse scales.

摘要

YOLO系列目标检测方法因其高效性和准确性,在广泛的计算机视觉任务中取得了显著成功。然而,由于无人机图像分辨率低、背景干扰复杂以及尺度变化大等因素,检测其中的小目标仍然是一项艰巨的挑战,这些因素共同降低了特征提取的质量并限制了检测性能。为应对这些挑战,我们提出了感受野注意力引导的YOLO(RFAG-YOLO)方法,这是一种针对无人机图像中小目标检测对YOLOv8进行的先进改进,重点在于提高特征表示和检测鲁棒性。为此,我们引入了一种新颖的网络构建块,称为感受野网络块(RFN块),它利用动态内核参数调整来增强模型捕捉细粒度局部细节的能力。为有效利用多尺度特征,我们设计了一种基于RFN块的增强型FasterNet模块,作为RFAG-YOLO主干网络的核心组件,能够在不同分辨率下进行稳健的特征提取。这种方法通过采用分阶段下采样和RFN块的分层排列实现了语义信息的平衡,确保在不同分辨率下都能实现最佳特征表示。此外,我们在RFAG-YOLO的检测头之前引入了尺度感知特征融合(SAF)组件。该组件采用尺度注意力机制对来自较高层和较低层的特征进行动态加权,促进更丰富的信息流,并显著提高模型对复杂背景和尺度变化的鲁棒性。在VisDrone2019数据集上的实验结果表明,RFAG-YOLO在检测精度和效率方面优于包括YOLOv7、YOLOv8、YOLOv10和YOLOv11在内的现有最先进模型。特别是,RFAG-YOLO实现了38.9%的mAP50,相对于多个基线模型有显著提升:比YOLOv7提高了12.43%,比YOLOv10提高了5.99%,与YOLOv8n和YOLOv11相比分别有16.12%的显著提升。此外,与更大的YOLOv8s模型相比,RFAG-YOLO在仅使用53.51%参数的情况下实现了其mAP50性能的97.98%,突出了其在性能与参数比方面的卓越效率,使其非常适合资源受限的无人机应用。这些结果强调了RFAG-YOLO在实际无人机应用中的巨大潜力,特别是在诸如光照变化、复杂背景和多样尺度等具有挑战性条件下需要精确检测小目标的场景中。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b72/11991089/828d0174ed5f/sensors-25-02193-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b72/11991089/e0062f30bf44/sensors-25-02193-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b72/11991089/1a2994ce44b0/sensors-25-02193-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b72/11991089/64ba1008e1aa/sensors-25-02193-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b72/11991089/a0c521a721ed/sensors-25-02193-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b72/11991089/dc8b961c4282/sensors-25-02193-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b72/11991089/b78699db42fc/sensors-25-02193-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b72/11991089/9b0d6c8237f7/sensors-25-02193-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b72/11991089/ea88b93b3b67/sensors-25-02193-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b72/11991089/0412d21de37b/sensors-25-02193-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b72/11991089/5a49e3b30c63/sensors-25-02193-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b72/11991089/828d0174ed5f/sensors-25-02193-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b72/11991089/e0062f30bf44/sensors-25-02193-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b72/11991089/1a2994ce44b0/sensors-25-02193-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b72/11991089/64ba1008e1aa/sensors-25-02193-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b72/11991089/a0c521a721ed/sensors-25-02193-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b72/11991089/dc8b961c4282/sensors-25-02193-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b72/11991089/b78699db42fc/sensors-25-02193-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b72/11991089/9b0d6c8237f7/sensors-25-02193-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b72/11991089/ea88b93b3b67/sensors-25-02193-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b72/11991089/0412d21de37b/sensors-25-02193-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b72/11991089/5a49e3b30c63/sensors-25-02193-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6b72/11991089/828d0174ed5f/sensors-25-02193-g011.jpg

相似文献

1
RFAG-YOLO: A Receptive Field Attention-Guided YOLO Network for Small-Object Detection in UAV Images.RFAG-YOLO:一种用于无人机图像中小目标检测的感受野注意力引导YOLO网络。
Sensors (Basel). 2025 Mar 30;25(7):2193. doi: 10.3390/s25072193.
2
A multi-scale small object detection algorithm SMA-YOLO for UAV remote sensing images.一种用于无人机遥感图像的多尺度小目标检测算法SMA-YOLO
Sci Rep. 2025 Mar 18;15(1):9255. doi: 10.1038/s41598-025-92344-7.
3
Fusion of multi-scale attention for aerial images small-target detection model based on PARE-YOLO.基于PARE-YOLO的航空图像小目标检测模型的多尺度注意力融合
Sci Rep. 2025 Feb 8;15(1):4753. doi: 10.1038/s41598-025-88857-w.
4
IV-YOLO: A Lightweight Dual-Branch Object Detection Network.IV-YOLO:一种轻量级双分支目标检测网络。
Sensors (Basel). 2024 Sep 24;24(19):6181. doi: 10.3390/s24196181.
5
GFI-YOLOv8: Sika Deer Posture Recognition Target Detection Method Based on YOLOv8.GFI-YOLOv8:基于YOLOv8的梅花鹿姿态识别目标检测方法
Animals (Basel). 2024 Sep 11;14(18):2640. doi: 10.3390/ani14182640.
6
YOLO-MARS: An Enhanced YOLOv8n for Small Object Detection in UAV Aerial Imagery.YOLO-MARS:一种用于无人机航空图像中小目标检测的增强型YOLOv8n。
Sensors (Basel). 2025 Apr 17;25(8):2534. doi: 10.3390/s25082534.
7
Accurate UAV Small Object Detection Based on HRFPN and EfficentVMamba.基于HRFPN和高效VMamba的无人机小目标精确检测
Sensors (Basel). 2024 Jul 31;24(15):4966. doi: 10.3390/s24154966.
8
Efficient Optimized YOLOv8 Model with Extended Vision.具有扩展视觉的高效优化YOLOv8模型
Sensors (Basel). 2024 Oct 10;24(20):6506. doi: 10.3390/s24206506.
9
A lightweight wheat ear counting model in UAV images based on improved YOLOv8.一种基于改进YOLOv8的无人机图像中小麦穗计数轻量级模型。
Front Plant Sci. 2025 Feb 11;16:1536017. doi: 10.3389/fpls.2025.1536017. eCollection 2025.
10
Efficient Small Object Detection You Only Look Once: A Small Object Detection Algorithm for Aerial Images.高效小目标检测:你只需看一次——一种用于航空图像的小目标检测算法
Sensors (Basel). 2024 Nov 2;24(21):7067. doi: 10.3390/s24217067.

引用本文的文献

1
AeroLight: A Lightweight Architecture with Dynamic Feature Fusion for High-Fidelity Small-Target Detection in Aerial Imagery.AeroLight:一种用于航空图像中高保真小目标检测的具有动态特征融合的轻量级架构。
Sensors (Basel). 2025 Aug 30;25(17):5369. doi: 10.3390/s25175369.
2
UAV-DETR: An Enhanced RT-DETR Architecture for Efficient Small Object Detection in UAV Imagery.无人机检测Transformer(UAV-DETR):一种用于无人机图像中高效小目标检测的增强型实时检测Transformer(RT-DETR)架构。
Sensors (Basel). 2025 Jul 24;25(15):4582. doi: 10.3390/s25154582.
3
YOLO-MARS: An Enhanced YOLOv8n for Small Object Detection in UAV Aerial Imagery.

本文引用的文献

1
Drone-DETR: Efficient Small Object Detection for Remote Sensing Image Using Enhanced RT-DETR Model.无人机DETR:使用增强型RT-DETR模型对遥感图像进行高效小目标检测
Sensors (Basel). 2024 Aug 24;24(17):5496. doi: 10.3390/s24175496.
2
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks.更快的 R-CNN:基于区域建议网络的实时目标检测。
IEEE Trans Pattern Anal Mach Intell. 2017 Jun;39(6):1137-1149. doi: 10.1109/TPAMI.2016.2577031. Epub 2016 Jun 6.
3
Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition.
YOLO-MARS:一种用于无人机航空图像中小目标检测的增强型YOLOv8n。
Sensors (Basel). 2025 Apr 17;25(8):2534. doi: 10.3390/s25082534.
空间金字塔池化在深度卷积网络中的视觉识别。
IEEE Trans Pattern Anal Mach Intell. 2015 Sep;37(9):1904-16. doi: 10.1109/TPAMI.2015.2389824.