• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

PDNet:通过预测解耦实现更好的单阶段目标检测

PDNet: Towards Better One-stage Object Detection with Prediction Decoupling.

作者信息

Yang Li, Xu Yan, Wang Shaoru, Yuan Chunfeng, Zhang Ziqi, Li Bing, Hu Weiming

出版信息

IEEE Trans Image Process. 2022 Jul 28;PP. doi: 10.1109/TIP.2022.3193223.

DOI:10.1109/TIP.2022.3193223
PMID:35901002
Abstract

Recent one-stage object detectors follow a per-pixel prediction approach that predicts both the object category scores and boundary positions from every single grid location. However, the most suitable positions for inferring different targets, i.e., the object category and boundaries, are generally different. Predicting all these targets from the same grid location thus may lead to sub-optimal results. In this paper, we analyze the suitable inference positions for object category and boundaries, and propose a prediction-target-decoupled detector named PDNet to establish a more flexible detection paradigm. Our PDNet with the prediction decoupling mechanism encodes different targets separately in different locations. A learnable prediction collection module is devised with two sets of dynamic points, i.e., dynamic boundary points and semantic points, to collect and aggregate the predictions from the favorable regions for localization and classification. We adopt a two-step strategy to learn these dynamic point positions, where the prior positions are estimated for different targets first, and the network further predicts residual offsets to the positions with better perceptions of the object properties. Extensive experiments on the MS COCO benchmark demonstrate the effectiveness and efficiency of our method. With a single ResNeXt-64x4d-101-DCN as the backbone, our detector achieves 50.1 AP with single-scale testing, which outperforms the state-of-the-art methods by an appreciable margin under the same experimental settings. Moreover, our detector is highly efficient as a one-stage framework. Our code will be public.

摘要

近期的单阶段目标检测器采用逐像素预测方法,从每个网格位置预测目标类别分数和边界位置。然而,推断不同目标(即目标类别和边界)最合适的位置通常是不同的。因此,从同一网格位置预测所有这些目标可能会导致次优结果。在本文中,我们分析了目标类别和边界的合适推断位置,并提出了一种名为PDNet的预测目标解耦检测器,以建立更灵活的检测范式。我们具有预测解耦机制的PDNet在不同位置分别对不同目标进行编码。设计了一个可学习的预测收集模块,该模块有两组动态点,即动态边界点和语义点,用于从定位和分类的有利区域收集和聚合预测。我们采用两步策略来学习这些动态点的位置,首先为不同目标估计先验位置,然后网络进一步预测到对目标属性有更好感知的位置的残差偏移。在MS COCO基准上进行的大量实验证明了我们方法的有效性和效率。以单个ResNeXt-64x4d-101-DCN作为主干,我们的检测器在单尺度测试下达到了50.1 AP,在相同实验设置下比当前最先进的方法有显著优势。此外,作为一个单阶段框架,我们的检测器效率很高。我们的代码将公开。

相似文献

1
PDNet: Towards Better One-stage Object Detection with Prediction Decoupling.PDNet:通过预测解耦实现更好的单阶段目标检测
IEEE Trans Image Process. 2022 Jul 28;PP. doi: 10.1109/TIP.2022.3193223.
2
HTD: Heterogeneous Task Decoupling for Two-Stage Object Detection.HTD:用于两阶段目标检测的异构任务解耦
IEEE Trans Image Process. 2021;30:9456-9469. doi: 10.1109/TIP.2021.3126423. Epub 2021 Nov 18.
3
Mutual-Assistance Learning for Object Detection.用于目标检测的互助学习
IEEE Trans Pattern Anal Mach Intell. 2023 Dec;45(12):15171-15184. doi: 10.1109/TPAMI.2023.3319634. Epub 2023 Nov 3.
4
Interactive Regression and Classification for Dense Object Detector.密集目标检测器的交互式回归与分类
IEEE Trans Image Process. 2022;31:3684-3696. doi: 10.1109/TIP.2022.3174391. Epub 2022 May 26.
5
Mask SSD: An Effective Single-Stage Approach to Object Instance Segmentation.Mask SSD:一种有效的单阶段目标实例分割方法。
IEEE Trans Image Process. 2020;29(1):2078-2093. doi: 10.1109/TIP.2019.2947806. Epub 2019 Oct 22.
6
Generalized Focal Loss: Towards Efficient Representation Learning for Dense Object Detection.广义焦点损失:面向密集目标检测的有效表示学习。
IEEE Trans Pattern Anal Mach Intell. 2023 Mar;45(3):3139-3153. doi: 10.1109/TPAMI.2022.3180392. Epub 2023 Feb 3.
7
CBNet: A Composite Backbone Network Architecture for Object Detection.CBNet:一种用于目标检测的复合主干网络架构
IEEE Trans Image Process. 2022 Oct 28;PP. doi: 10.1109/TIP.2022.3216771.
8
Keypoint-Aware Single-Stage 3D Object Detector for Autonomous Driving.用于自动驾驶的关键点感知单阶段 3D 目标检测器。
Sensors (Basel). 2022 Feb 14;22(4):1451. doi: 10.3390/s22041451.
9
Edge Preserving and Multi-Scale Contextual Neural Network for Salient Object Detection.边缘保持和多尺度上下文神经网络的显著目标检测。
IEEE Trans Image Process. 2018;27(1):121-134. doi: 10.1109/TIP.2017.2756825.
10
Hierarchical Regression and Classification for Accurate Object Detection.用于精确目标检测的分层回归与分类
IEEE Trans Neural Netw Learn Syst. 2023 May;34(5):2425-2439. doi: 10.1109/TNNLS.2021.3106641. Epub 2023 May 2.