• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于弱监督视频目标检测的渐进式帧提议挖掘

Progressive Frame-Proposal Mining for Weakly Supervised Video Object Detection.

作者信息

Han Mingfei, Wang Yali, Li Mingjie, Chang Xiaojun, Yang Yi, Qiao Yu

出版信息

IEEE Trans Image Process. 2024;33:1560-1573. doi: 10.1109/TIP.2024.3364536. Epub 2024 Feb 27.

DOI:10.1109/TIP.2024.3364536
PMID:38358874
Abstract

In this paper, we focus on the weakly supervised video object detection problem, where each training video is only tagged with object labels, without any bounding box annotations of objects. To effectively train object detectors from such weakly-annotated videos, we propose a Progressive Frame-Proposal Mining (PFPM) framework by exploiting discriminative proposals in a coarse-to-fine manner. First, we design a flexible Multi-Level Selection (MLS) scheme, with explicit guidance of video tags. By selecting object-relevant frames and mining important proposals from these frames, the proposed MLS can effectively reduce frame redundancy as well as improve proposal effectiveness to boost weakly-supervised detectors. Moreover, we develop a novel Holistic-View Refinement (HVR) scheme, which can globally evaluate importance of proposals among frames, and thus correctly refine pseudo ground truth boxes for training video detectors in a self-supervised manner. Finally, we evaluate the proposed PFPM on a large-scale benchmark for video object detection, on ImageNet VID, under the setting of weak annotations. The experimental results demonstrate that our PFPM significantly outperforms the state-of-the-art weakly-supervised detectors.

摘要

在本文中,我们聚焦于弱监督视频目标检测问题,其中每个训练视频仅用目标标签进行标注,而没有任何目标的边界框注释。为了从这类弱标注视频中有效地训练目标检测器,我们提出了一种渐进式帧提议挖掘(PFPM)框架,通过从粗到细的方式利用有区分性的提议。首先,我们设计了一种灵活的多级选择(MLS)方案,并在视频标签的明确指导下。通过选择与目标相关的帧并从这些帧中挖掘重要提议,所提出的 MLS 可以有效减少帧冗余并提高提议有效性,以提升弱监督检测器。此外,我们开发了一种新颖的整体视图细化(HVR)方案,它可以全局评估帧之间提议的重要性,从而以自监督的方式正确地细化用于训练视频检测器的伪真实框。最后,我们在 ImageNet VID 上弱注释设置下的大规模视频目标检测基准上评估所提出的 PFPM。实验结果表明,我们的 PFPM 显著优于当前最先进的弱监督检测器。

相似文献

1
Progressive Frame-Proposal Mining for Weakly Supervised Video Object Detection.用于弱监督视频目标检测的渐进式帧提议挖掘
IEEE Trans Image Process. 2024;33:1560-1573. doi: 10.1109/TIP.2024.3364536. Epub 2024 Feb 27.
2
Progressive Representation Adaptation for Weakly Supervised Object Localization.用于弱监督目标定位的渐进式表示适应
IEEE Trans Pattern Anal Mach Intell. 2020 Jun;42(6):1424-1438. doi: 10.1109/TPAMI.2019.2899839. Epub 2019 Feb 15.
3
Cyclic Self-Training With Proposal Weight Modulation for Cross-Supervised Object Detection.用于交叉监督目标检测的带提议权重调制的循环自训练
IEEE Trans Image Process. 2023;32:1992-2002. doi: 10.1109/TIP.2023.3261752. Epub 2023 Apr 4.
4
Weakly-Supervised Salient Object Detection With Saliency Bounding Boxes.基于显著性边界框的弱监督显著目标检测
IEEE Trans Image Process. 2021;30:4423-4435. doi: 10.1109/TIP.2021.3071691. Epub 2021 Apr 21.
5
Deep Graph Metric Learning for Weakly Supervised Person Re-Identification.深度图度量学习在弱监督行人再识别中的应用。
IEEE Trans Pattern Anal Mach Intell. 2022 Oct;44(10):6074-6093. doi: 10.1109/TPAMI.2021.3084613. Epub 2022 Sep 14.
6
PCL: Proposal Cluster Learning for Weakly Supervised Object Detection.PCL:用于弱监督目标检测的提议聚类学习
IEEE Trans Pattern Anal Mach Intell. 2020 Jan;42(1):176-191. doi: 10.1109/TPAMI.2018.2876304. Epub 2018 Oct 16.
7
Semantic Object Segmentation in Tagged Videos via Detection.基于检测的标记视频中的语义对象分割。
IEEE Trans Pattern Anal Mach Intell. 2018 Jul;40(7):1741-1754. doi: 10.1109/TPAMI.2017.2727049. Epub 2017 Jul 20.
8
Weakly Supervised Object Detection via Object-Specific Pixel Gradient.基于特定对象像素梯度的弱监督目标检测
IEEE Trans Neural Netw Learn Syst. 2018 Dec;29(12):5960-5970. doi: 10.1109/TNNLS.2018.2816021. Epub 2018 Apr 9.
9
Segmentation in Weakly Labeled Videos via a Semantic Ranking and Optical Warping Network.通过语义排序和光流变形网络对弱标注视频进行分割
IEEE Trans Image Process. 2018 May 16. doi: 10.1109/TIP.2018.2834221.
10
Weakly-Supervised Salient Object Detection on Light Fields.光场的弱监督显著目标检测
IEEE Trans Image Process. 2022;31:6295-6305. doi: 10.1109/TIP.2022.3207605. Epub 2022 Oct 10.

引用本文的文献

1
Hierarchical Active Learning with Label Proportions on Data Regions.基于数据区域标签比例的分层主动学习
IEEE Trans Knowl Data Eng. 2024 Dec;36(12):8434-8446. doi: 10.1109/tkde.2024.3419588.