• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

HTD:用于两阶段目标检测的异构任务解耦

HTD: Heterogeneous Task Decoupling for Two-Stage Object Detection.

作者信息

Li Wuyang, Chen Zhen, Li Baopu, Zhang Dingwen, Yuan Yixuan

出版信息

IEEE Trans Image Process. 2021;30:9456-9469. doi: 10.1109/TIP.2021.3126423. Epub 2021 Nov 18.

DOI:10.1109/TIP.2021.3126423
PMID:34780326
Abstract

Decoupling the sibling head has recently shown great potential in relieving the inherent task-misalignment problem in two-stage object detectors. However, existing works design similar structures for the classification and regression, ignoring task-specific characteristics and feature demands. Besides, the shared knowledge that may benefit the two branches is neglected, leading to potential excessive decoupling and semantic inconsistency. To address these two issues, we propose Heterogeneous task decoupling (HTD) framework for object detection, which utilizes a Progressive Graph (PGraph) module and a Border-aware Adaptation (BA) module for task-decoupling. Specifically, we first devise a Semantic Feature Aggregation (SFA) module to aggregate global semantics with image-level supervision, serving as the shared knowledge for the task-decoupled framework. Then, the PGraph module performs progressive graph reasoning, including local spatial aggregation and global semantic interaction, to enhance semantic representations of region proposals for classification. The proposed BA module integrates multi-level features adaptively, focusing on the low-level border activation to obtain representations with spatial and border perception for regression. Finally, we utilize the aggregated knowledge from SFA to keep the instance-level semantic consistency (ISC) of decoupled frameworks. Extensive experiments demonstrate that HTD outperforms existing detection works by a large margin, and achieves single-model 50.4%AP and 33.2% AP on COCO test-dev set using ResNet-101-DCN backbone, which is the best entry among state-of-the-arts under the same configuration. Our code is available at https://github.com/CityU-AIM-Group/HTD.

摘要

最近,解耦兄弟头部在缓解两阶段目标检测器中固有的任务错位问题方面显示出巨大潜力。然而,现有工作为分类和回归设计了相似的结构,忽略了任务特定的特征和特征需求。此外,可能有益于两个分支的共享知识被忽视,导致潜在的过度解耦和语义不一致。为了解决这两个问题,我们提出了用于目标检测的异构任务解耦(HTD)框架,该框架利用一个渐进图(PGraph)模块和一个边界感知自适应(BA)模块进行任务解耦。具体来说,我们首先设计了一个语义特征聚合(SFA)模块,通过图像级监督聚合全局语义,作为任务解耦框架的共享知识。然后,PGraph模块执行渐进图推理,包括局部空间聚合和全局语义交互,以增强用于分类的区域提议的语义表示。所提出的BA模块自适应地整合多级特征,专注于低级边界激活,以获得具有空间和边界感知的回归表示。最后,我们利用来自SFA的聚合知识来保持解耦框架的实例级语义一致性(ISC)。大量实验表明,HTD在很大程度上优于现有的检测工作,并且使用ResNet-101-DCN主干在COCO测试开发集上实现了单模型50.4%的AP和33.2%的AP,这是相同配置下最先进方法中的最佳成绩。我们的代码可在https://github.com/CityU-AIM-Group/HTD获取。

相似文献

1
HTD: Heterogeneous Task Decoupling for Two-Stage Object Detection.HTD:用于两阶段目标检测的异构任务解耦
IEEE Trans Image Process. 2021;30:9456-9469. doi: 10.1109/TIP.2021.3126423. Epub 2021 Nov 18.
2
Human-Object Interaction detection via Global Context and Pairwise-level Fusion Features Integration.基于全局上下文和对级别融合特征集成的人与对象交互检测。
Neural Netw. 2024 Feb;170:242-253. doi: 10.1016/j.neunet.2023.11.002. Epub 2023 Nov 13.
3
PDNet: Towards Better One-stage Object Detection with Prediction Decoupling.PDNet:通过预测解耦实现更好的单阶段目标检测
IEEE Trans Image Process. 2022 Jul 28;PP. doi: 10.1109/TIP.2022.3193223.
4
Graph-Based Region and Boundary Aggregation for Biomedical Image Segmentation.基于图的区域和边界聚合的生物医学图像分割。
IEEE Trans Med Imaging. 2022 Mar;41(3):690-701. doi: 10.1109/TMI.2021.3123567. Epub 2022 Mar 2.
5
Hierarchical matching and reasoning for multi-query image retrieval.多层次匹配与推理的多查询图像检索。
Neural Netw. 2024 May;173:106200. doi: 10.1016/j.neunet.2024.106200. Epub 2024 Feb 22.
6
Semantic and Correlation Disentangled Graph Convolutions for Multilabel Image Recognition.用于多标签图像识别的语义与相关性解缠图卷积
IEEE Trans Neural Netw Learn Syst. 2025 Jan;36(1):1609-1621. doi: 10.1109/TNNLS.2023.3333542. Epub 2025 Jan 7.
7
Object and spatial discrimination makes weakly supervised local feature better.目标和空间辨别能力使弱监督局部特征更优。
Neural Netw. 2024 Dec;180:106697. doi: 10.1016/j.neunet.2024.106697. Epub 2024 Sep 12.
8
IPGN: Interactiveness Proposal Graph Network for Human-Object Interaction Detection.IPGN:用于人机交互检测的交互提案图网络。
IEEE Trans Image Process. 2021;30:6583-6593. doi: 10.1109/TIP.2021.3096333. Epub 2021 Jul 21.
9
SipMaskv2: Enhanced Fast Image and Video Instance Segmentation.SipMaskv2:增强型快速图像与视频实例分割
IEEE Trans Pattern Anal Mach Intell. 2023 Mar;45(3):3798-3812. doi: 10.1109/TPAMI.2022.3180564.
10
Interactive Regression and Classification for Dense Object Detector.密集目标检测器的交互式回归与分类
IEEE Trans Image Process. 2022;31:3684-3696. doi: 10.1109/TIP.2022.3174391. Epub 2022 May 26.

引用本文的文献

1
SSCI: Self-Supervised Deep Learning Improves Network Structure for Cancer Driver Gene Identification.SSCI:自监督深度学习改进癌症驱动基因识别的网络结构。
Int J Mol Sci. 2024 Sep 26;25(19):10351. doi: 10.3390/ijms251910351.
2
GOI-YOLOv8 Grouping Offset and Isolated GiraffeDet Low-Light Target Detection.基于GOI-YOLOv8的分组偏移与孤立的GiraffeDet低光目标检测
Sensors (Basel). 2024 Sep 5;24(17):5787. doi: 10.3390/s24175787.