• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于弱监督目标检测的增强空间特征学习

Enhanced Spatial Feature Learning for Weakly Supervised Object Detection.

作者信息

Wu Zhihao, Wen Jie, Xu Yong, Yang Jian, Li Xuelong, Zhang David

出版信息

IEEE Trans Neural Netw Learn Syst. 2022 Jun 8;PP. doi: 10.1109/TNNLS.2022.3178180.

DOI:10.1109/TNNLS.2022.3178180
PMID:35675239
Abstract

Weakly supervised object detection (WSOD) has become an effective paradigm, which requires only class labels to train object detectors. However, WSOD detectors are prone to learn highly discriminative features corresponding to local objects rather than complete objects, resulting in imprecise object localization. To address the issue, designing backbones specifically for WSOD is a feasible solution. However, the redesigned backbone generally needs to be pretrained on large-scale ImageNet or trained from scratch, both of which require much more time and computational costs than fine-tuning. In this article, we explore to optimize the backbone without losing the availability of the original pretrained model. Since the pooling layer summarizes neighborhood features, it is crucial to spatial feature learning. In addition, it has no learnable parameters, so its modification will not change the pretrained model. Based on the above analysis, we further propose enhanced spatial feature learning (ESFL) for WSOD, which first takes full advantage of multiple kernels in a single pooling layer to handle multiscale objects and then enhances above-average activations within the rectangular neighborhood to alleviate the problem of ignoring unsalient object parts. The experimental results on the PASCAL VOC and the MS COCO benchmarks demonstrate that ESFL can bring significant performance improvement for the WSOD method and achieve state-of-the-art results.

摘要

弱监督目标检测(WSOD)已成为一种有效的范式,它仅需要类别标签来训练目标检测器。然而,WSOD检测器容易学习到与局部对象而非完整对象相对应的高度判别性特征,导致目标定位不准确。为了解决这个问题,专门为WSOD设计主干网络是一种可行的解决方案。然而,重新设计的主干网络通常需要在大规模的ImageNet上进行预训练或从头开始训练,这两者都比微调需要更多的时间和计算成本。在本文中,我们探索在不损失原始预训练模型可用性的情况下优化主干网络。由于池化层汇总邻域特征,因此对空间特征学习至关重要。此外,它没有可学习的参数,因此对其进行修改不会改变预训练模型。基于上述分析,我们进一步提出了用于WSOD的增强空间特征学习(ESFL),它首先在单个池化层中充分利用多个内核来处理多尺度对象,然后增强矩形邻域内高于平均水平的激活,以缓解忽略不显著对象部分的问题。在PASCAL VOC和MS COCO基准上的实验结果表明,ESFL可以为WSOD方法带来显著的性能提升,并取得了当前最优的结果。

相似文献

1
Enhanced Spatial Feature Learning for Weakly Supervised Object Detection.用于弱监督目标检测的增强空间特征学习
IEEE Trans Neural Netw Learn Syst. 2022 Jun 8;PP. doi: 10.1109/TNNLS.2022.3178180.
2
PCL: Proposal Cluster Learning for Weakly Supervised Object Detection.PCL:用于弱监督目标检测的提议聚类学习
IEEE Trans Pattern Anal Mach Intell. 2020 Jan;42(1):176-191. doi: 10.1109/TPAMI.2018.2876304. Epub 2018 Oct 16.
3
Salvage of Supervision in Weakly Supervised Object Detection and Segmentation.弱监督目标检测和分割中的监控恢复。
IEEE Trans Pattern Anal Mach Intell. 2023 Aug;45(8):10394-10408. doi: 10.1109/TPAMI.2023.3243054. Epub 2023 Jun 30.
4
Object Detection from Scratch with Deep Supervision.基于深度监督的目标从头检测。
IEEE Trans Pattern Anal Mach Intell. 2020 Feb;42(2):398-412. doi: 10.1109/TPAMI.2019.2922181. Epub 2019 Jun 11.
5
Contrastive Proposal Extension With LSTM Network for Weakly Supervised Object Detection.基于 LSTM 网络的对比提案扩展的弱监督目标检测。
IEEE Trans Image Process. 2022;31:6879-6892. doi: 10.1109/TIP.2022.3216772. Epub 2022 Nov 3.
6
Selecting High-Quality Proposals for Weakly Supervised Object Detection With Bottom-Up Aggregated Attention and Phase-Aware Loss.通过自底向上聚合注意力和相位感知损失为弱监督目标检测选择高质量提议。
IEEE Trans Image Process. 2023;32:682-693. doi: 10.1109/TIP.2022.3231744. Epub 2023 Jan 6.
7
Attention-Based Dropout Layer for Weakly Supervised Single Object Localization and Semantic Segmentation.用于弱监督单目标定位和语义分割的基于注意力的随机失活层
IEEE Trans Pattern Anal Mach Intell. 2021 Dec;43(12):4256-4271. doi: 10.1109/TPAMI.2020.2999099. Epub 2021 Nov 3.
8
Pyramidal Multiple Instance Detection Network With Mask Guided Self-Correction for Weakly Supervised Object Detection.用于弱监督目标检测的带掩码引导自校正的金字塔多实例检测网络
IEEE Trans Image Process. 2021;30:3029-3040. doi: 10.1109/TIP.2021.3056887. Epub 2021 Feb 18.
9
Misclassification in Weakly Supervised Object Detection.弱监督目标检测中的误分类
IEEE Trans Image Process. 2024;33:3413-3427. doi: 10.1109/TIP.2024.3402981. Epub 2024 May 31.
10
Continuation Multiple Instance Learning for Weakly and Fully Supervised Object Detection.用于弱监督和全监督目标检测的连续多实例学习
IEEE Trans Neural Netw Learn Syst. 2022 Oct;33(10):5452-5466. doi: 10.1109/TNNLS.2021.3070801. Epub 2022 Oct 5.

引用本文的文献

1
A lightweight network for traffic sign detection via multiple scale context awareness and semantic information guidance.一种通过多尺度上下文感知和语义信息引导实现交通标志检测的轻量级网络。
Sci Rep. 2025 Mar 24;15(1):10110. doi: 10.1038/s41598-025-94610-0.
2
Instance-Level Contrastive Learning for Weakly Supervised Object Detection.基于实例对比的弱监督目标检测。
Sensors (Basel). 2022 Oct 4;22(19):7525. doi: 10.3390/s22197525.