从判别式到完备式：用于弱监督目标检测的强化搜索智能体学习

From Discriminant to Complete: Reinforcement Searching-Agent Learning for Weakly Supervised Object Detection.

作者信息

Zhang Dingwen, Han Junwei, Zhao Long, Zhao Tao

出版信息

IEEE Trans Neural Netw Learn Syst. 2020 Dec;31(12):5549-5560. doi: 10.1109/TNNLS.2020.2969483. Epub 2020 Nov 30.

DOI:10.1109/TNNLS.2020.2969483

PMID:32092016

Abstract

Weakly supervised object detection (WSOD) is an interesting yet challenging task in the computer vision community. The core is to discover the image regions that contain the complete object instances under the image-level supervision. Existing works usually solve this problem via a proposal selection strategy, which selects the most discriminative box regions from the weakly labeled training images. However, these regions usually only contain the discriminative object parts rather than the complete object instances. To address this problem, this article proposes to learn a searching-agent to gradually mine desirable object regions under a region searching paradigm, where we formulate the searching process as a Markov decision process and learn the searching-agent under a deep reinforcement learning framework. To learn such a searching-agent under the weak supervision, we extract the pseudo-complete object regions and the corresponding local discriminative object parts and introduce the obtained pseudo-target-part training pairs into the reinforcement learning process of the search-agent. This learning strategy has twofold advantages: 1) it can mimic the searching process to reveal complete object regions from a certain discriminative part of the object under the weak supervision and 2) it will not suffer from the learning difficulty arise from the long-action sequence that happens when searching from the entire image range. Comprehensive experiments on benchmark data sets demonstrate that by integrating the learned searching-agent with the existing WSOD method, we can achieve better performance than the other state-of-the-art and baseline methods.

摘要

弱监督目标检测（WSOD）是计算机视觉领域中一项有趣但具有挑战性的任务。其核心在于在图像级监督下发现包含完整目标实例的图像区域。现有工作通常通过提议选择策略来解决此问题，该策略从弱标注的训练图像中选择最具判别力的框区域。然而，这些区域通常只包含有判别力的目标部分，而非完整的目标实例。为解决这个问题，本文提出学习一个搜索代理，在区域搜索范式下逐步挖掘理想的目标区域，我们将搜索过程表述为马尔可夫决策过程，并在深度强化学习框架下学习搜索代理。为在弱监督下学习这样一个搜索代理，我们提取伪完整目标区域和相应的局部有判别力的目标部分，并将得到的伪目标 - 部分训练对引入搜索代理的强化学习过程。这种学习策略有两个优点：1）它可以模仿搜索过程，在弱监督下从目标的某个有判别力的部分揭示完整的目标区域；2）它不会遭受从整个图像范围进行搜索时出现的长动作序列所带来的学习困难。在基准数据集上的综合实验表明，通过将学习到的搜索代理与现有的WSOD方法相结合，我们可以取得比其他现有最先进方法和基线方法更好的性能。

相似文献

From Discriminant to Complete: Reinforcement Searching-Agent Learning for Weakly Supervised Object Detection.从判别式到完备式：用于弱监督目标检测的强化搜索智能体学习

IEEE Trans Neural Netw Learn Syst. 2020 Dec;31(12):5549-5560. doi: 10.1109/TNNLS.2020.2969483. Epub 2020 Nov 30.

SPFTN: A Joint Learning Framework for Localizing and Segmenting Objects in Weakly Labeled Videos.SPFTN：一种用于在弱标注视频中定位和分割对象的联合学习框架。

IEEE Trans Pattern Anal Mach Intell. 2020 Feb;42(2):475-489. doi: 10.1109/TPAMI.2018.2881114. Epub 2018 Nov 13.

Min-Entropy Latent Model for Weakly Supervised Object Detection.用于弱监督目标检测的最小熵潜在模型

IEEE Trans Pattern Anal Mach Intell. 2019 Oct;41(10):2395-2409. doi: 10.1109/TPAMI.2019.2898858. Epub 2019 Feb 12.

PCL: Proposal Cluster Learning for Weakly Supervised Object Detection.PCL：用于弱监督目标检测的提议聚类学习

IEEE Trans Pattern Anal Mach Intell. 2020 Jan;42(1):176-191. doi: 10.1109/TPAMI.2018.2876304. Epub 2018 Oct 16.

Salvage of Supervision in Weakly Supervised Object Detection and Segmentation.弱监督目标检测和分割中的监控恢复。

IEEE Trans Pattern Anal Mach Intell. 2023 Aug;45(8):10394-10408. doi: 10.1109/TPAMI.2023.3243054. Epub 2023 Jun 30.

Synthesizing Supervision for Learning Deep Saliency Network without Human Annotation.无人工标注的深度学习显著图网络合成监督学习

IEEE Trans Pattern Anal Mach Intell. 2020 Jul;42(7):1755-1769. doi: 10.1109/TPAMI.2019.2900649. Epub 2019 Feb 20.

Instance-Level Contrastive Learning for Weakly Supervised Object Detection.基于实例对比的弱监督目标检测。

Sensors (Basel). 2022 Oct 4;22(19):7525. doi: 10.3390/s22197525.

Online Attention Accumulation for Weakly Supervised Semantic Segmentation.用于弱监督语义分割的在线注意力积累

IEEE Trans Pattern Anal Mach Intell. 2022 Oct;44(10):7062-7077. doi: 10.1109/TPAMI.2021.3092573. Epub 2022 Sep 14.

Enhanced Spatial Feature Learning for Weakly Supervised Object Detection.用于弱监督目标检测的增强空间特征学习

IEEE Trans Neural Netw Learn Syst. 2022 Jun 8;PP. doi: 10.1109/TNNLS.2022.3178180.

Continuation Multiple Instance Learning for Weakly and Fully Supervised Object Detection.用于弱监督和全监督目标检测的连续多实例学习

IEEE Trans Neural Netw Learn Syst. 2022 Oct;33(10):5452-5466. doi: 10.1109/TNNLS.2021.3070801. Epub 2022 Oct 5.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

从判别式到完备式：用于弱监督目标检测的强化搜索智能体学习

From Discriminant to Complete: Reinforcement Searching-Agent Learning for Weakly Supervised Object Detection.

作者信息

出版信息

相似文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献