• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

NAN-DETR:噪声多锚点使DETR在目标检测方面表现更优。

NAN-DETR: noising multi-anchor makes DETR better for object detection.

作者信息

Huang Zixin, Tao Xuesong, Liu Xinyuan

机构信息

School of Computer Science, Beijing Institute of Technology, Beijing, China.

出版信息

Front Neurorobot. 2024 Oct 14;18:1484088. doi: 10.3389/fnbot.2024.1484088. eCollection 2024.

DOI:10.3389/fnbot.2024.1484088
PMID:39468993
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11513373/
Abstract

Object detection plays a crucial role in robotic vision, focusing on accurately identifying and localizing objects within images. However, many existing methods encounter limitations, particularly when it comes to effectively implementing a one-to-many matching strategy. To address these challenges, we propose NAN-DETR (Noising Multi-Anchor Detection Transformer), an innovative framework based on DETR (Detection Transformer). NAN-DETR introduces three key improvements to transformer-based object detection: a decoder-based multi-anchor strategy, a centralization noising mechanism, and the integration of Complete Intersection over Union (CIoU) loss. The multi-anchor strategy leverages multiple anchors per object, significantly enhancing detection accuracy by improving the one-to-many matching process. The centralization noising mechanism mitigates conflicts among anchors by injecting controlled noise into the detection boxes, thereby increasing the robustness of the model. Additionally, CIoU loss, which incorporates both aspect ratio and spatial distance in its calculations, results in more precise bounding box predictions compared to the conventional IoU loss. Although NAN-DETR may not drastically improve real-time processing capabilities, its exceptional performance positions it as a highly reliable solution for diverse object detection scenarios.

摘要

目标检测在机器人视觉中起着至关重要的作用,专注于在图像中准确识别和定位目标。然而,许多现有方法存在局限性,特别是在有效实施一对多匹配策略方面。为应对这些挑战,我们提出了NAN-DETR(噪声多锚点检测变压器),这是一种基于DETR(检测变压器)的创新框架。NAN-DETR对基于变压器的目标检测进行了三项关键改进:基于解码器的多锚点策略、中心化噪声机制以及完全交并比(CIoU)损失的整合。多锚点策略为每个目标利用多个锚点,通过改进一对多匹配过程显著提高检测精度。中心化噪声机制通过向检测框注入可控噪声来减轻锚点之间的冲突,从而提高模型的鲁棒性。此外,CIoU损失在计算中同时纳入了宽高比和空间距离,与传统的IoU损失相比,能产生更精确的边界框预测。尽管NAN-DETR可能不会大幅提高实时处理能力,但其卓越性能使其成为各种目标检测场景的高度可靠解决方案。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4e79/11513373/5ab60a049a6f/fnbot-18-1484088-g0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4e79/11513373/0a548e879fda/fnbot-18-1484088-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4e79/11513373/bb78ef32f389/fnbot-18-1484088-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4e79/11513373/c0327e1e543b/fnbot-18-1484088-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4e79/11513373/4ec5650b63af/fnbot-18-1484088-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4e79/11513373/fde2a86dacfb/fnbot-18-1484088-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4e79/11513373/b49f069fc179/fnbot-18-1484088-g0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4e79/11513373/44c4d177895c/fnbot-18-1484088-g0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4e79/11513373/5ab60a049a6f/fnbot-18-1484088-g0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4e79/11513373/0a548e879fda/fnbot-18-1484088-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4e79/11513373/bb78ef32f389/fnbot-18-1484088-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4e79/11513373/c0327e1e543b/fnbot-18-1484088-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4e79/11513373/4ec5650b63af/fnbot-18-1484088-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4e79/11513373/fde2a86dacfb/fnbot-18-1484088-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4e79/11513373/b49f069fc179/fnbot-18-1484088-g0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4e79/11513373/44c4d177895c/fnbot-18-1484088-g0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4e79/11513373/5ab60a049a6f/fnbot-18-1484088-g0008.jpg

相似文献

1
NAN-DETR: noising multi-anchor makes DETR better for object detection.NAN-DETR:噪声多锚点使DETR在目标检测方面表现更优。
Front Neurorobot. 2024 Oct 14;18:1484088. doi: 10.3389/fnbot.2024.1484088. eCollection 2024.
2
Drone-DETR: Efficient Small Object Detection for Remote Sensing Image Using Enhanced RT-DETR Model.无人机DETR:使用增强型RT-DETR模型对遥感图像进行高效小目标检测
Sensors (Basel). 2024 Aug 24;24(17):5496. doi: 10.3390/s24175496.
3
CCDN-DETR: A Detection Transformer Based on Constrained Contrast Denoising for Multi-Class Synthetic Aperture Radar Object Detection.CCDN-DETR:一种基于约束对比去噪的检测Transformer,用于多类合成孔径雷达目标检测。
Sensors (Basel). 2024 Mar 11;24(6):1793. doi: 10.3390/s24061793.
4
FSH-DETR: An Efficient End-to-End Fire Smoke and Human Detection Based on a Deformable DEtection TRansformer (DETR).FSH-DETR:一种基于可变形检测变压器(DETR)的高效端到端火灾烟雾和人体检测方法。
Sensors (Basel). 2024 Jun 23;24(13):4077. doi: 10.3390/s24134077.
5
Skip DETR: end-to-end Skip connection model for small object detection in forestry pest dataset.跳过DETR:用于林业害虫数据集中小目标检测的端到端跳过连接模型。
Front Plant Sci. 2023 Aug 15;14:1219474. doi: 10.3389/fpls.2023.1219474. eCollection 2023.
6
SOD-YOLOv8-Enhancing YOLOv8 for Small Object Detection in Aerial Imagery and Traffic Scenes.SOD-YOLOv8——增强YOLOv8以用于航空图像和交通场景中的小目标检测
Sensors (Basel). 2024 Sep 25;24(19):6209. doi: 10.3390/s24196209.
7
DN-DETR: Accelerate DETR Training by Introducing Query DeNoising.DN-DETR:通过引入查询去噪加速DETR训练。
IEEE Trans Pattern Anal Mach Intell. 2024 Apr;46(4):2239-2251. doi: 10.1109/TPAMI.2023.3335410. Epub 2024 Mar 6.
8
Identifying rice field weeds from unmanned aerial vehicle remote sensing imagery using deep learning.利用深度学习从无人机遥感影像中识别稻田杂草
Plant Methods. 2024 Jul 16;20(1):105. doi: 10.1186/s13007-024-01232-0.
9
AD-DETR: DETR with asymmetrical relation and decoupled attention in crowded scenes.AD-DETR:拥挤场景中具有不对称关系和解耦注意力的DETR
Math Biosci Eng. 2023 Jun 26;20(8):14158-14179. doi: 10.3934/mbe.2023633.
10
Corner-Point and Foreground-Area IoU Loss: Better Localization of Small Objects in Bounding Box Regression.角点和前景面积交并比损失:在边界框回归中更好地定位小目标。
Sensors (Basel). 2023 May 22;23(10):4961. doi: 10.3390/s23104961.

引用本文的文献

1
A Study on Systematic Improvement of Transformer Models for Object Pose Estimation.用于目标姿态估计的变压器模型的系统改进研究。
Sensors (Basel). 2025 Feb 18;25(4):1227. doi: 10.3390/s25041227.

本文引用的文献

1
DN-DETR: Accelerate DETR Training by Introducing Query DeNoising.DN-DETR:通过引入查询去噪加速DETR训练。
IEEE Trans Pattern Anal Mach Intell. 2024 Apr;46(4):2239-2251. doi: 10.1109/TPAMI.2023.3335410. Epub 2024 Mar 6.
2
Unsupervised Pre-Training for Detection Transformers.用于检测变压器的无监督预训练
IEEE Trans Pattern Anal Mach Intell. 2023 Nov;45(11):12772-12782. doi: 10.1109/TPAMI.2022.3216514. Epub 2023 Oct 3.