• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

无人机检测Transformer(UAV-DETR):一种用于无人机图像中高效小目标检测的增强型实时检测Transformer(RT-DETR)架构。

UAV-DETR: An Enhanced RT-DETR Architecture for Efficient Small Object Detection in UAV Imagery.

作者信息

Zhou Yu, Wei Yan

机构信息

College of Computer and Information Science, Chongqing Normal University, Chongqing 401331, China.

出版信息

Sensors (Basel). 2025 Jul 24;25(15):4582. doi: 10.3390/s25154582.

DOI:10.3390/s25154582
PMID:40807753
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12349633/
Abstract

To mitigate the technical challenges associated with small-object detection, feature degradation, and spatial-contextual misalignment in UAV-acquired imagery, this paper proposes UAV-DETR, an enhanced Transformer-based object detection model designed for aerial scenarios. Specifically, UAV imagery often suffers from feature degradation due to low resolution and complex backgrounds and from semantic-spatial misalignment caused by dynamic shooting conditions. This work addresses these challenges by enhancing feature perception, semantic representation, and spatial alignment. Architecturally extending the RT-DETR framework, UAV-DETR incorporates three novel modules: the Channel-Aware Sensing Module (CAS), the Scale-Optimized Enhancement Pyramid Module (SOEP), and the newly designed Context-Spatial Alignment Module (CSAM), which integrates the functionalities of contextual and spatial calibration. These components collaboratively strengthen multi-scale feature extraction, semantic representation, and spatial-contextual alignment. The CAS module refines the backbone to improve multi-scale feature perception, while SOEP enhances semantic richness in shallow layers through lightweight channel-weighted fusion. CSAM further optimizes the hybrid encoder by simultaneously correcting contextual inconsistencies and spatial misalignments during feature fusion, enabling more precise cross-scale integration. Comprehensive comparisons with mainstream detectors, including Faster R-CNN and YOLOv5, demonstrate that UAV-DETR achieves superior small-object detection performance in complex aerial scenarios. The performance is thoroughly evaluated in terms of mAP@0.5, parameter count, and computational complexity (GFLOPs). Experiments on the VisDrone2019 dataset benchmark demonstrate that UAV-DETR achieves an mAP@0.5 of 51.6%, surpassing RT-DETR by 3.5% while reducing the number of model parameters from 19.8 million to 16.8 million.

摘要

为了缓解无人机获取图像中与小目标检测、特征退化和空间上下文错位相关的技术挑战,本文提出了UAV-DETR,这是一种专为航空场景设计的基于Transformer的增强型目标检测模型。具体而言,无人机图像常常因分辨率低和背景复杂而出现特征退化,以及因动态拍摄条件导致语义空间错位。这项工作通过增强特征感知、语义表示和空间对齐来应对这些挑战。UAV-DETR在架构上扩展了RT-DETR框架,包含三个新颖的模块:通道感知传感模块(CAS)、尺度优化增强金字塔模块(SOEP)和新设计的上下文空间对齐模块(CSAM),后者整合了上下文和空间校准的功能。这些组件协同加强多尺度特征提取、语义表示和空间上下文对齐。CAS模块优化主干以改善多尺度特征感知,而SOEP通过轻量级通道加权融合增强浅层的语义丰富度。CSAM在特征融合期间通过同时校正上下文不一致和空间错位进一步优化混合编码器,实现更精确的跨尺度整合。与主流检测器(包括Faster R-CNN和YOLOv5)的全面比较表明,UAV-DETR在复杂航空场景中实现了卓越的小目标检测性能。在mAP@0.5、参数数量和计算复杂度(GFLOPs)方面对性能进行了全面评估。在VisDrone2019数据集基准上的实验表明,UAV-DETR实现了51.6%的mAP@0.5,比RT-DETR高出3.5%,同时将模型参数数量从1980万减少到1680万。

相似文献

1
UAV-DETR: An Enhanced RT-DETR Architecture for Efficient Small Object Detection in UAV Imagery.无人机检测Transformer(UAV-DETR):一种用于无人机图像中高效小目标检测的增强型实时检测Transformer(RT-DETR)架构。
Sensors (Basel). 2025 Jul 24;25(15):4582. doi: 10.3390/s25154582.
2
Integrated neural network framework for multi-object detection and recognition using UAV imagery.用于使用无人机图像进行多目标检测与识别的集成神经网络框架。
Front Neurorobot. 2025 Jul 30;19:1643011. doi: 10.3389/fnbot.2025.1643011. eCollection 2025.
3
A dual-domain perception gate-controlled adaptive fusion algorithm for road crack detection.一种用于道路裂缝检测的双域感知门控自适应融合算法。
Sci Rep. 2025 Jul 1;15(1):21485. doi: 10.1038/s41598-025-07610-5.
4
I-YOLOv11n: A Lightweight and Efficient Small Target Detection Framework for UAV Aerial Images.I-YOLOv11n:一种用于无人机航空图像的轻量级高效小目标检测框架。
Sensors (Basel). 2025 Aug 7;25(15):4857. doi: 10.3390/s25154857.
5
LRDS-YOLO enhances small object detection in UAV aerial images with a lightweight and efficient design.LRDS-YOLO通过轻量级且高效的设计增强了无人机航拍图像中的小目标检测能力。
Sci Rep. 2025 Jul 2;15(1):22627. doi: 10.1038/s41598-025-07021-6.
6
BCRT-DETR: A Lightweight Blood Cell Detection Transformer Model for Enhanced Medical Diagnostics.
Biotechnol Appl Biochem. 2025 Jul 21. doi: 10.1002/bab.70030.
7
Multi-level channel-spatial attention and light-weight scale-fusion network (MCSLF-Net): multi-level channel-spatial attention and light-weight scale-fusion transformer for 3D brain tumor segmentation.多级通道空间注意力与轻量级尺度融合网络(MCSLF-Net):用于3D脑肿瘤分割的多级通道空间注意力与轻量级尺度融合变换器
Quant Imaging Med Surg. 2025 Jul 1;15(7):6301-6325. doi: 10.21037/qims-2025-354. Epub 2025 Jun 30.
8
HSF-DETR: A Special Vehicle Detection Algorithm Based on Hypergraph Spatial Features and Bipolar Attention.HSF-DETR:一种基于超图空间特征和双极注意力的特殊车辆检测算法。
Sensors (Basel). 2025 Jul 13;25(14):4381. doi: 10.3390/s25144381.
9
An improved EAE-DETR model for defect detection of server motherboard.一种用于服务器主板缺陷检测的改进型EAE-DETR模型。
Sci Rep. 2025 Aug 8;15(1):29063. doi: 10.1038/s41598-025-14957-2.
10
LEAD-YOLO: A Lightweight and Accurate Network for Small Object Detection in Autonomous Driving.LEAD-YOLO:一种用于自动驾驶中小目标检测的轻量级且精确的网络。
Sensors (Basel). 2025 Aug 4;25(15):4800. doi: 10.3390/s25154800.

本文引用的文献

1
RFAG-YOLO: A Receptive Field Attention-Guided YOLO Network for Small-Object Detection in UAV Images.RFAG-YOLO:一种用于无人机图像中小目标检测的感受野注意力引导YOLO网络。
Sensors (Basel). 2025 Mar 30;25(7):2193. doi: 10.3390/s25072193.
2
YOLO-PEL: The Efficient and Lightweight Vehicle Detection Method Based on YOLO Algorithm.YOLO-PEL:基于YOLO算法的高效轻量级车辆检测方法。
Sensors (Basel). 2025 Mar 21;25(7):1959. doi: 10.3390/s25071959.
3
DV-DETR: Improved UAV Aerial Small Target Detection Algorithm Based on RT-DETR.DV-DETR:基于RT-DETR的改进型无人机航空小目标检测算法
Sensors (Basel). 2024 Nov 19;24(22):7376. doi: 10.3390/s24227376.
4
Drone-DETR: Efficient Small Object Detection for Remote Sensing Image Using Enhanced RT-DETR Model.无人机DETR:使用增强型RT-DETR模型对遥感图像进行高效小目标检测
Sensors (Basel). 2024 Aug 24;24(17):5496. doi: 10.3390/s24175496.
5
Accurate UAV Small Object Detection Based on HRFPN and EfficentVMamba.基于HRFPN和高效VMamba的无人机小目标精确检测
Sensors (Basel). 2024 Jul 31;24(15):4966. doi: 10.3390/s24154966.
6
Context and Spatial Feature Calibration for Real-Time Semantic Segmentation.用于实时语义分割的上下文和空间特征校准
IEEE Trans Image Process. 2023;32:5465-5477. doi: 10.1109/TIP.2023.3318967. Epub 2023 Oct 25.
7
YOLOv5 with ConvMixer Prediction Heads for Precise Object Detection in Drone Imagery.YOLOv5 与 ConvMixer 预测头在无人机图像中的精确目标检测。
Sensors (Basel). 2022 Nov 2;22(21):8424. doi: 10.3390/s22218424.
8
Detection and Tracking Meet Drones Challenge.检测与跟踪遭遇无人机挑战。
IEEE Trans Pattern Anal Mach Intell. 2022 Nov;44(11):7380-7399. doi: 10.1109/TPAMI.2021.3119563. Epub 2022 Oct 4.
9
AlignSeg: Feature-Aligned Segmentation Networks.AlignSeg:特征对齐分割网络。
IEEE Trans Pattern Anal Mach Intell. 2022 Jan;44(1):550-557. doi: 10.1109/TPAMI.2021.3062772. Epub 2021 Dec 7.
10
UAV-YOLO: Small Object Detection on Unmanned Aerial Vehicle Perspective.UAV-YOLO:无人机视角下的小目标检测。
Sensors (Basel). 2020 Apr 15;20(8):2238. doi: 10.3390/s20082238.