• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于极弱监督的视频目标发现与协同分割。

Video Object Discovery and Co-Segmentation with Extremely Weak Supervision.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2017 Oct;39(10):2074-2088. doi: 10.1109/TPAMI.2016.2612187. Epub 2016 Oct 26.

DOI:10.1109/TPAMI.2016.2612187
PMID:28113741
Abstract

We present a spatio-temporal energy minimization formulation for simultaneous video object discovery and co-segmentation across multiple videos containing irrelevant frames. Our approach overcomes a limitation that most existing video co-segmentation methods possess, i.e., they perform poorly when dealing with practical videos in which the target objects are not present in many frames. Our formulation incorporates a spatio-temporal auto-context model, which is combined with appearance modeling for superpixel labeling. The superpixel-level labels are propagated to the frame level through a multiple instance boosting algorithm with spatial reasoning, based on which frames containing the target object are identified. Our method only needs to be bootstrapped with the frame-level labels for a few video frames (e.g., usually 1 to 3) to indicate if they contain the target objects or not. Extensive experiments on four datasets validate the efficacy of our proposed method: 1) object segmentation from a single video on the SegTrack dataset, 2) object co-segmentation from multiple videos on a video co-segmentation dataset, and 3) joint object discovery and co-segmentation from multiple videos containing irrelevant frames on the MOViCS dataset and XJTU-Stevens, a new dataset that we introduce in this paper. The proposed method compares favorably with the state-of-the-art in all of these experiments.

摘要

我们提出了一种用于跨多个包含不相关帧的视频进行同时视频对象发现和共同分割的时空能量最小化公式。我们的方法克服了大多数现有的视频共同分割方法的一个限制,即当处理实际视频时,它们的性能较差,因为目标对象在许多帧中都不存在。我们的公式结合了时空自上下文模型,该模型与超像素标记的外观建模相结合。通过基于空间推理的多实例提升算法将超像素级别的标签传播到帧级别,根据该算法可以识别包含目标对象的帧。我们的方法只需要用几个视频帧(例如,通常为 1 到 3 个)的帧级标签进行引导,以指示它们是否包含目标对象。在四个数据集上的广泛实验验证了我们提出的方法的有效性:1)在 SegTrack 数据集上的单个视频中的对象分割,2)在视频共同分割数据集上的多个视频中的对象共同分割,以及 3)在包含不相关帧的多个视频上的联合对象发现和共同分割 MOViCS 数据集和我们在本文中引入的新数据集 XJTU-Stevens。该方法在所有这些实验中都优于最先进的方法。

相似文献

1
Video Object Discovery and Co-Segmentation with Extremely Weak Supervision.基于极弱监督的视频目标发现与协同分割。
IEEE Trans Pattern Anal Mach Intell. 2017 Oct;39(10):2074-2088. doi: 10.1109/TPAMI.2016.2612187. Epub 2016 Oct 26.
2
Joint Video Object Discovery and Segmentation by Coupled Dynamic Markov Networks.联合视频对象发现和分割的耦合动态马尔可夫网络。
IEEE Trans Image Process. 2018 Dec;27(12):5840-5853. doi: 10.1109/TIP.2018.2859622. Epub 2018 Jul 30.
3
Joint Segmentation and Recognition of Categorized Objects from Noisy Web Image Collection.从噪声网络图像集中对分类对象进行联合分割与识别
IEEE Trans Image Process. 2014 Sep;23(9):4070-4086. doi: 10.1109/TIP.2014.2339196. Epub 2014 Jul 14.
4
Object-Based Multiple Foreground Video Co-Segmentation via Multi-State Selection Graph.基于对象的多前景视频协同分割方法:基于多状态选择图的方法。
IEEE Trans Image Process. 2015 Nov;24(11):3415-24. doi: 10.1109/TIP.2015.2442915. Epub 2015 Jun 9.
5
Segmentation in Weakly Labeled Videos via a Semantic Ranking and Optical Warping Network.通过语义排序和光流变形网络对弱标注视频进行分割
IEEE Trans Image Process. 2018 May 16. doi: 10.1109/TIP.2018.2834221.
6
Video Object Segmentation without Temporal Information.无时间信息的视频对象分割
IEEE Trans Pattern Anal Mach Intell. 2019 Jun;41(6):1515-1530. doi: 10.1109/TPAMI.2018.2838670. Epub 2018 May 23.
7
Exploring Weakly Labeled Images for Video Object Segmentation With Submodular Proposal Selection.基于子模提案选择的视频对象分割中弱标注图像的探索。
IEEE Trans Image Process. 2018 Sep;27(9):4245-4259. doi: 10.1109/TIP.2018.2806995.
8
Adaptive Selection of Reference Frames for Video Object Segmentation.用于视频对象分割的参考帧自适应选择
IEEE Trans Image Process. 2022;31:1057-1071. doi: 10.1109/TIP.2021.3137660. Epub 2022 Jan 19.
9
Segment-Tube: Spatio-Temporal Action Localization in Untrimmed Videos with Per-Frame Segmentation.Segment-Tube:基于逐帧分割的非修剪视频中的时空动作定位。
Sensors (Basel). 2018 May 22;18(5):1657. doi: 10.3390/s18051657.
10
Beyond Appearance: Multi-Frame Spatio-Temporal Context Memory Networks for Efficient and Robust Video Object Segmentation.超越表象:用于高效且稳健视频对象分割的多帧时空上下文记忆网络
IEEE Trans Image Process. 2024;33:4853-4866. doi: 10.1109/TIP.2024.3423390. Epub 2024 Sep 5.

引用本文的文献

1
Action Recognition by an Attention-Aware Temporal Weighted Convolutional Neural Network.基于注意力感知时间加权卷积神经网络的动作识别。
Sensors (Basel). 2018 Jun 21;18(7):1979. doi: 10.3390/s18071979.
2
Segment-Tube: Spatio-Temporal Action Localization in Untrimmed Videos with Per-Frame Segmentation.Segment-Tube:基于逐帧分割的非修剪视频中的时空动作定位。
Sensors (Basel). 2018 May 22;18(5):1657. doi: 10.3390/s18051657.