• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

ACD-DETR:用于无人机图像中小目标检测的自适应跨尺度检测变压器

ACD-DETR: Adaptive Cross-Scale Detection Transformer for Small Object Detection in UAV Imagery.

作者信息

Tong Yang, Ye Hui, Yang Jishen, Yang Xiulong

机构信息

School of Computer Science, Central China Normal University, Wuhan 430079, China.

Department of Computer Science, Georgia State University, Atlanta, GA 30303, USA.

出版信息

Sensors (Basel). 2025 Sep 5;25(17):5556. doi: 10.3390/s25175556.

DOI:10.3390/s25175556
PMID:40942984
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12431001/
Abstract

Small object detection in UAV imagery remains challenging due to complex aerial perspectives and the presence of dense, small targets with blurred boundaries. To address these challenges, we propose ACD-DETR, an adaptive end-to-end Transformer detector tailored for UAV-based small object detection. The framework introduces three core modules: the Multi-Scale Edge-Enhanced Feature Fusion Module (MSEFM) to preserve fine-grained details; the Omni-Grained Boundary Calibrator (OG-BC) for boundary-aware semantic fusion; and the Dynamic Position Bias Attention-based Intra-scale Feature Interaction (DPB-AIFI) to enhance spatial reasoning. Furthermore, we introduce ACD-DETR-SBA+, a fusion-enhanced variant that removes OG-BC and DPB-AIFI while deploying densely connected Semantic-Boundary Aggregation (SBA) modules to intensify boundary-semantic fusion. This design sacrifices computational efficiency in exchange for higher detection precision, making it suitable for resource-rich deployment scenarios. On the VisDrone2019 dataset, ACD-DETR achieves 50.9% mAP@0.5, outperforming the RT-DETR-R18 baseline by 3.6 percentage points, while reducing parameters by 18.5%. ACD-DETR-SBA+ further improves accuracy to 52.0% mAP@0.5, demonstrating the benefit of SBA-based fusion. Extensive experiments on the VisDrone2019 and DOTA datasets demonstrate that ACD-DETR achieves a state-of-the-art trade-off between accuracy and efficiency, while ACD-DETR-SBA+ achieves further performance improvements at higher computational cost. Ablation studies and visual analyses validate the effectiveness of the proposed modules and design strategies.

摘要

由于复杂的空中视角以及存在密集、边界模糊的小目标,无人机图像中的小目标检测仍然具有挑战性。为应对这些挑战,我们提出了ACD-DETR,这是一种专为基于无人机的小目标检测量身定制的自适应端到端Transformer检测器。该框架引入了三个核心模块:用于保留细粒度细节的多尺度边缘增强特征融合模块(MSEFM);用于边界感知语义融合的全粒度边界校准器(OG-BC);以及用于增强空间推理的基于动态位置偏差注意力的尺度内特征交互(DPB-AIFI)。此外,我们引入了ACD-DETR-SBA+,这是一种融合增强变体,它在部署密集连接的语义边界聚合(SBA)模块以强化边界语义融合时,去除了OG-BC和DPB-AIFI。这种设计牺牲了计算效率以换取更高的检测精度,使其适用于资源丰富的部署场景。在VisDrone2019数据集上,ACD-DETR实现了50.9%的mAP@0.5,比RT-DETR-R18基线高出3.6个百分点,同时参数减少了18.5%。ACD-DETR-SBA+进一步将准确率提高到52.0%的mAP@0.5,证明了基于SBA融合的优势。在VisDrone2019和DOTA数据集上进行的大量实验表明,ACD-DETR在准确性和效率之间实现了最优平衡,而ACD-DETR-SBA+在更高的计算成本下实现了进一步的性能提升。消融研究和可视化分析验证了所提出模块和设计策略的有效性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8ba6/12431001/e247a8e6b465/sensors-25-05556-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8ba6/12431001/34b4890f0bf0/sensors-25-05556-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8ba6/12431001/ea31e7ee22fb/sensors-25-05556-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8ba6/12431001/31bca1f7033e/sensors-25-05556-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8ba6/12431001/91666050a1b6/sensors-25-05556-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8ba6/12431001/6fdc5543562b/sensors-25-05556-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8ba6/12431001/5684b3666fc8/sensors-25-05556-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8ba6/12431001/abfe69d6dc40/sensors-25-05556-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8ba6/12431001/b8ee08b3a22d/sensors-25-05556-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8ba6/12431001/2e359acea636/sensors-25-05556-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8ba6/12431001/899b670e53ea/sensors-25-05556-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8ba6/12431001/e247a8e6b465/sensors-25-05556-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8ba6/12431001/34b4890f0bf0/sensors-25-05556-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8ba6/12431001/ea31e7ee22fb/sensors-25-05556-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8ba6/12431001/31bca1f7033e/sensors-25-05556-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8ba6/12431001/91666050a1b6/sensors-25-05556-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8ba6/12431001/6fdc5543562b/sensors-25-05556-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8ba6/12431001/5684b3666fc8/sensors-25-05556-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8ba6/12431001/abfe69d6dc40/sensors-25-05556-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8ba6/12431001/b8ee08b3a22d/sensors-25-05556-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8ba6/12431001/2e359acea636/sensors-25-05556-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8ba6/12431001/899b670e53ea/sensors-25-05556-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8ba6/12431001/e247a8e6b465/sensors-25-05556-g011.jpg

相似文献

1
ACD-DETR: Adaptive Cross-Scale Detection Transformer for Small Object Detection in UAV Imagery.ACD-DETR:用于无人机图像中小目标检测的自适应跨尺度检测变压器
Sensors (Basel). 2025 Sep 5;25(17):5556. doi: 10.3390/s25175556.
2
UAV-DETR: An Enhanced RT-DETR Architecture for Efficient Small Object Detection in UAV Imagery.无人机检测Transformer(UAV-DETR):一种用于无人机图像中高效小目标检测的增强型实时检测Transformer(RT-DETR)架构。
Sensors (Basel). 2025 Jul 24;25(15):4582. doi: 10.3390/s25154582.
3
FDA-DETR: A frequency-aware DETR with dynamic query and adaptive multi-task optimization for oriented small object detection.FDA-DETR:一种用于定向小目标检测的具有动态查询和自适应多任务优化的频率感知DETR。
PLoS One. 2025 Aug 29;20(8):e0330929. doi: 10.1371/journal.pone.0330929. eCollection 2025.
4
Multi-level channel-spatial attention and light-weight scale-fusion network (MCSLF-Net): multi-level channel-spatial attention and light-weight scale-fusion transformer for 3D brain tumor segmentation.多级通道空间注意力与轻量级尺度融合网络(MCSLF-Net):用于3D脑肿瘤分割的多级通道空间注意力与轻量级尺度融合变换器
Quant Imaging Med Surg. 2025 Jul 1;15(7):6301-6325. doi: 10.21037/qims-2025-354. Epub 2025 Jun 30.
5
A dual-domain perception gate-controlled adaptive fusion algorithm for road crack detection.一种用于道路裂缝检测的双域感知门控自适应融合算法。
Sci Rep. 2025 Jul 1;15(1):21485. doi: 10.1038/s41598-025-07610-5.
6
Integrated neural network framework for multi-object detection and recognition using UAV imagery.用于使用无人机图像进行多目标检测与识别的集成神经网络框架。
Front Neurorobot. 2025 Jul 30;19:1643011. doi: 10.3389/fnbot.2025.1643011. eCollection 2025.
7
An improved EAE-DETR model for defect detection of server motherboard.一种用于服务器主板缺陷检测的改进型EAE-DETR模型。
Sci Rep. 2025 Aug 8;15(1):29063. doi: 10.1038/s41598-025-14957-2.
8
Lightweight underwater debris detection model based on improved RT-DETR.基于改进型RT-DETR的轻量级水下碎片检测模型
Mar Pollut Bull. 2025 Aug 28;222(Pt 1):118537. doi: 10.1016/j.marpolbul.2025.118537.
9
BCRT-DETR: A Lightweight Blood Cell Detection Transformer Model for Enhanced Medical Diagnostics.
Biotechnol Appl Biochem. 2025 Jul 21. doi: 10.1002/bab.70030.
10
Enhanced PCB defect detection via HSA-RTDETR on RT-DETR.通过基于RT-DETR的HSA-RTDETR实现增强的印刷电路板缺陷检测
Sci Rep. 2025 Aug 28;15(1):31783. doi: 10.1038/s41598-025-11394-z.

本文引用的文献

1
Drone-DETR: Efficient Small Object Detection for Remote Sensing Image Using Enhanced RT-DETR Model.无人机DETR:使用增强型RT-DETR模型对遥感图像进行高效小目标检测
Sensors (Basel). 2024 Aug 24;24(17):5496. doi: 10.3390/s24175496.
2
CrossFormer++: A Versatile Vision Transformer Hinging on Cross-Scale Attention.CrossFormer++:一种基于跨尺度注意力的通用视觉Transformer
IEEE Trans Pattern Anal Mach Intell. 2024 May;46(5):3123-3136. doi: 10.1109/TPAMI.2023.3341806. Epub 2024 Apr 3.
3
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks.
更快的 R-CNN:基于区域建议网络的实时目标检测。
IEEE Trans Pattern Anal Mach Intell. 2017 Jun;39(6):1137-1149. doi: 10.1109/TPAMI.2016.2577031. Epub 2016 Jun 6.