• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于高性能视频对象分割的自监督渐进网络

Self Supervised Progressive Network for High Performance Video Object Segmentation.

作者信息

Li Guorong, Hong Dexiang, Xu Kai, Zhong Bineng, Su Li, Han Zhenjun, Huang Qingming

出版信息

IEEE Trans Neural Netw Learn Syst. 2024 Jun;35(6):7671-7684. doi: 10.1109/TNNLS.2022.3219936. Epub 2024 Jun 3.

DOI:10.1109/TNNLS.2022.3219936
PMID:36383579
Abstract

Recently, self-supervised video object segmentation (VOS) has attracted much interest. However, most proxy tasks are proposed to train only a single backbone, which relies on a point-to-point correspondence strategy to propagate masks through a video sequence. Due to its simple pipeline, the performance of the single backbone paradigm is still unsatisfactory. Instead of following the previous literature, we propose our self-supervised progressive network (SSPNet) which consists of a memory retrieval module (MRM) and collaborative refinement module (CRM). The MRM can perform point-to-point correspondence and produce a propagated coarse mask for a query frame through self-supervised pixel-level and frame-level similarity learning. The CRM, which is trained via cycle consistency region tracking, aggregates the reference & query information and learns the collaborative relationship among them implicitly to refine the coarse mask. Furthermore, to learn semantic knowledge from unlabeled data, we also design two novel mask-generation strategies to provide the training data with meaningful semantic information for the CRM. Extensive experiments conducted on DAVIS-17, YouTube- VOS and SegTrack v2 demonstrate that our method surpasses the state-of-the-art self-supervised methods and narrows the gap with the fully supervised methods.

摘要

最近,自监督视频对象分割(VOS)引起了广泛关注。然而,大多数代理任务仅用于训练单个主干网络,该主干网络依靠点对点对应策略在视频序列中传播掩码。由于其简单的流程,单个主干网络范式的性能仍然不尽人意。与以往文献不同,我们提出了自监督渐进网络(SSPNet),它由一个内存检索模块(MRM)和协作细化模块(CRM)组成。MRM可以执行点对点对应,并通过自监督像素级和帧级相似性学习为查询帧生成传播的粗掩码。CRM通过循环一致性区域跟踪进行训练,聚合参考信息和查询信息,并隐式学习它们之间的协作关系以细化粗掩码。此外,为了从未标记数据中学习语义知识,我们还设计了两种新颖的掩码生成策略,为CRM提供具有有意义语义信息的训练数据。在DAVIS-17、YouTube-VOS和SegTrack v2上进行的大量实验表明,我们的方法超越了当前最先进的自监督方法,并缩小了与全监督方法的差距。

相似文献

1
Self Supervised Progressive Network for High Performance Video Object Segmentation.用于高性能视频对象分割的自监督渐进网络
IEEE Trans Neural Netw Learn Syst. 2024 Jun;35(6):7671-7684. doi: 10.1109/TNNLS.2022.3219936. Epub 2024 Jun 3.
2
From Pixels to Semantics: Self-Supervised Video Object Segmentation With Multiperspective Feature Mining.
IEEE Trans Image Process. 2022;31:5801-5812. doi: 10.1109/TIP.2022.3201603. Epub 2022 Sep 8.
3
Video Object Segmentation Using Kernelized Memory Network With Multiple Kernels.
IEEE Trans Pattern Anal Mach Intell. 2023 Feb;45(2):2595-2612. doi: 10.1109/TPAMI.2022.3163375. Epub 2023 Jan 6.
4
Directional Deep Embedding and Appearance Learning for Fast Video Object Segmentation.用于快速视频对象分割的定向深度嵌入与外观学习
IEEE Trans Neural Netw Learn Syst. 2022 Aug;33(8):3884-3894. doi: 10.1109/TNNLS.2021.3054769. Epub 2022 Aug 3.
5
Segmentation in Weakly Labeled Videos via a Semantic Ranking and Optical Warping Network.通过语义排序和光流变形网络对弱标注视频进行分割
IEEE Trans Image Process. 2018 May 16. doi: 10.1109/TIP.2018.2834221.
6
SpVOS: Efficient Video Object Segmentation With Triple Sparse Convolution.SpVOS:基于三重稀疏卷积的高效视频对象分割
IEEE Trans Image Process. 2023;32:5977-5991. doi: 10.1109/TIP.2023.3327588. Epub 2023 Nov 7.
7
Collaborative Video Object Segmentation by Multi-Scale Foreground-Background Integration.基于多尺度前景-背景融合的协同视频对象分割
IEEE Trans Pattern Anal Mach Intell. 2022 Sep;44(9):4701-4712. doi: 10.1109/TPAMI.2021.3081597. Epub 2022 Aug 4.
8
Region Aware Video Object Segmentation With Deep Motion Modeling.基于深度运动建模的区域感知视频对象分割
IEEE Trans Image Process. 2024;33:2639-2651. doi: 10.1109/TIP.2024.3381445. Epub 2024 Apr 3.
9
Coarse Mask Guided Interactive Object Segmentation.粗掩码引导的交互式目标分割
IEEE Trans Image Process. 2023;32:5808-5822. doi: 10.1109/TIP.2023.3322564. Epub 2023 Oct 26.
10
Local contrastive loss with pseudo-label based self-training for semi-supervised medical image segmentation.基于伪标签自训练的局部对比损失的半监督医学图像分割。
Med Image Anal. 2023 Jul;87:102792. doi: 10.1016/j.media.2023.102792. Epub 2023 Mar 11.