• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

深度学习与时空谱聚类之间的迭代知识交换用于视频中的无监督分割

Iterative Knowledge Exchange Between Deep Learning and Space-Time Spectral Clustering for Unsupervised Segmentation in Videos.

作者信息

Haller Emanuela, Florea Adina Magda, Leordeanu Marius

出版信息

IEEE Trans Pattern Anal Mach Intell. 2022 Nov;44(11):7638-7656. doi: 10.1109/TPAMI.2021.3120228. Epub 2022 Oct 4.

DOI:10.1109/TPAMI.2021.3120228
PMID:34648435
Abstract

We propose a dual system for unsupervised object segmentation in video, which brings together two modules with complementary properties: a space-time graph that discovers objects in videos and a deep network that learns powerful object features. The system uses an iterative knowledge exchange policy. A novel spectral space-time clustering process on the graph produces unsupervised segmentation masks passed to the network as pseudo-labels. The net learns to segment in single frames what the graph discovers in video and passes back to the graph strong image-level features that improve its node-level features in the next iteration. Knowledge is exchanged for several cycles until convergence. The graph has one node per each video pixel, but the object discovery is fast. It uses a novel power iteration algorithm computing the main space-time cluster as the principal eigenvector of a special Feature-Motion matrix without actually computing the matrix. The thorough experimental analysis validates our theoretical claims and proves the effectiveness of the cyclical knowledge exchange. We also perform experiments on the supervised scenario, incorporating features pretrained with human supervision. We achieve state-of-the-art level on unsupervised and supervised scenarios on four challenging datasets: DAVIS, SegTrack, YouTube-Objects, and DAVSOD. We will make our code publicly available.

摘要

我们提出了一种用于视频中无监督目标分割的双重系统,该系统将两个具有互补特性的模块结合在一起:一个用于发现视频中目标的时空图,以及一个用于学习强大目标特征的深度网络。该系统采用迭代知识交换策略。图上一种新颖的谱时空聚类过程产生无监督分割掩码,并作为伪标签传递给网络。网络学习在单帧中分割图在视频中发现的内容,并将强大的图像级特征反馈给图,以在下次迭代中改进其节点级特征。知识交换进行多个循环直至收敛。图中每个视频像素都有一个节点,但目标发现速度很快。它使用一种新颖的幂迭代算法,通过计算特殊特征 - 运动矩阵的主特征向量来计算主要时空聚类,而无需实际计算该矩阵。全面的实验分析验证了我们的理论主张,并证明了循环知识交换的有效性。我们还在有监督的场景下进行了实验,纳入了在人类监督下预训练的特征。我们在四个具有挑战性的数据集DAVIS、SegTrack、YouTube - Objects和DAVSOD上的无监督和有监督场景中达到了当前最优水平。我们将公开我们的代码。

相似文献

1
Iterative Knowledge Exchange Between Deep Learning and Space-Time Spectral Clustering for Unsupervised Segmentation in Videos.深度学习与时空谱聚类之间的迭代知识交换用于视频中的无监督分割
IEEE Trans Pattern Anal Mach Intell. 2022 Nov;44(11):7638-7656. doi: 10.1109/TPAMI.2021.3120228. Epub 2022 Oct 4.
2
Learning to Segment Human by Watching YouTube.通过观看 YouTube 学习分割人体。
IEEE Trans Pattern Anal Mach Intell. 2017 Jul;39(7):1462-1468. doi: 10.1109/TPAMI.2016.2598340. Epub 2016 Aug 5.
3
Segmentation in Weakly Labeled Videos via a Semantic Ranking and Optical Warping Network.通过语义排序和光流变形网络对弱标注视频进行分割
IEEE Trans Image Process. 2018 May 16. doi: 10.1109/TIP.2018.2834221.
4
Unsupervised Online Video Object Segmentation With Motion Property Understanding.基于运动属性理解的无监督在线视频对象分割。
IEEE Trans Image Process. 2020;29:237-249. doi: 10.1109/TIP.2019.2930152. Epub 2019 Jul 26.
5
Semi Supervised Learning with Deep Embedded Clustering for Image Classification and Segmentation.用于图像分类和分割的深度嵌入聚类半监督学习
IEEE Access. 2019;7:11093-11104. doi: 10.1109/ACCESS.2019.2891970. Epub 2019 Jan 9.
6
Pixel Objectness: Learning to Segment Generic Objects Automatically in Images and Videos.像素目标性:学习在图像和视频中自动分割通用物体
IEEE Trans Pattern Anal Mach Intell. 2019 Nov;41(11):2677-2692. doi: 10.1109/TPAMI.2018.2865794. Epub 2018 Aug 17.
7
TokenCut: Segmenting Objects in Images and Videos With Self-Supervised Transformer and Normalized Cut.TokenCut:利用自监督变压器和归一化切割对图像和视频中的对象进行分割
IEEE Trans Pattern Anal Mach Intell. 2023 Dec;45(12):15790-15801. doi: 10.1109/TPAMI.2023.3305122. Epub 2023 Nov 3.
8
An unsupervised method for histological image segmentation based on tissue cluster level graph cut.基于组织簇级图割的无监督组织学图像分割方法。
Comput Med Imaging Graph. 2021 Oct;93:101974. doi: 10.1016/j.compmedimag.2021.101974. Epub 2021 Aug 21.
9
Video Object Discovery and Co-Segmentation with Extremely Weak Supervision.基于极弱监督的视频目标发现与协同分割。
IEEE Trans Pattern Anal Mach Intell. 2017 Oct;39(10):2074-2088. doi: 10.1109/TPAMI.2016.2612187. Epub 2016 Oct 26.
10
SPFTN: A Joint Learning Framework for Localizing and Segmenting Objects in Weakly Labeled Videos.SPFTN:一种用于在弱标注视频中定位和分割对象的联合学习框架。
IEEE Trans Pattern Anal Mach Intell. 2020 Feb;42(2):475-489. doi: 10.1109/TPAMI.2018.2881114. Epub 2018 Nov 13.