通过动态图嵌入学习用于图像序列时间分割的事件表示。

Learning Event Representations for Temporal Segmentation of Image Sequences by Dynamic Graph Embedding.

作者信息

Dimiccoli Mariella, Wendt Herwig

出版信息

IEEE Trans Image Process. 2021;30:1476-1486. doi: 10.1109/TIP.2020.3044448. Epub 2020 Dec 31.

DOI:10.1109/TIP.2020.3044448

Abstract

Recently, self-supervised learning has proved to be effective to learn representations of events suitable for temporal segmentation in image sequences, where events are understood as sets of temporally adjacent images that are semantically perceived as a whole. However, although this approach does not require expensive manual annotations, it is data hungry and suffers from domain adaptation problems. As an alternative, in this work, we propose a novel approach for learning event representations named Dynamic Graph Embedding (DGE). The assumption underlying our model is that a sequence of images can be represented by a graph that encodes both semantic and temporal similarity. The key novelty of DGE is to learn jointly the graph and its graph embedding. At its core, DGE works by iterating over two steps: 1) updating the graph representing the semantic and temporal similarity of the data based on the current data representation, and 2) updating the data representation to take into account the current data graph structure. The main advantage of DGE over state-of-the-art self-supervised approaches is that it does not require any training set, but instead learns iteratively from the data itself a low-dimensional embedding that reflects their temporal and semantic similarity. Experimental results on two benchmark datasets of real image sequences captured at regular time intervals demonstrate that the proposed DGE leads to event representations effective for temporal segmentation. In particular, it achieves robust temporal segmentation on the EDUBSeg and EDUBSeg-Desc benchmark datasets, outperforming the state of the art. Additional experiments on two Human Motion Segmentation benchmark datasets demonstrate the generalization capabilities of the proposed DGE.

摘要

最近，自监督学习已被证明在学习适用于图像序列中时间分割的事件表示方面是有效的，其中事件被理解为在时间上相邻且在语义上被视为一个整体的图像集合。然而，尽管这种方法不需要昂贵的人工标注，但它对数据要求很高，并且存在领域适应问题。作为一种替代方法，在这项工作中，我们提出了一种名为动态图嵌入（DGE）的学习事件表示的新方法。我们模型的基本假设是，图像序列可以由一个编码语义和时间相似性的图来表示。DGE的关键新颖之处在于联合学习图及其图嵌入。其核心是，DGE通过迭代两个步骤来工作：1）基于当前的数据表示更新表示数据语义和时间相似性的图，2）更新数据表示以考虑当前的数据图结构。DGE相对于现有自监督方法的主要优势在于它不需要任何训练集，而是从数据本身迭代学习一个反映其时间和语义相似性的低维嵌入。在以固定时间间隔捕获的真实图像序列的两个基准数据集上的实验结果表明，所提出的DGE能够生成对时间分割有效的事件表示。特别是，它在EDUBSeg和EDUBSeg-Desc基准数据集上实现了稳健的时间分割，性能优于现有技术。在两个人体运动分割基准数据集上的额外实验证明了所提出的DGE的泛化能力。

相似文献

Learning Event Representations for Temporal Segmentation of Image Sequences by Dynamic Graph Embedding.通过动态图嵌入学习用于图像序列时间分割的事件表示。

IEEE Trans Image Process. 2021;30:1476-1486. doi: 10.1109/TIP.2020.3044448. Epub 2020 Dec 31.

Graph based multi-scale neighboring topology deep learning for kidney and tumor segmentation.基于图的多尺度邻域拓扑深度学习用于肾脏和肿瘤分割

Phys Med Biol. 2022 Nov 18;67(22). doi: 10.1088/1361-6560/ac9e3f.

Semi Supervised Learning with Deep Embedded Clustering for Image Classification and Segmentation.用于图像分类和分割的深度嵌入聚类半监督学习

IEEE Access. 2019;7:11093-11104. doi: 10.1109/ACCESS.2019.2891970. Epub 2019 Jan 9.

TigeCMN: On exploration of temporal interaction graph embedding via Coupled Memory Neural Networks.TigeCMN：基于耦合记忆神经网络的时间交互图嵌入探索。

Neural Netw. 2021 Aug;140:13-26. doi: 10.1016/j.neunet.2021.02.016. Epub 2021 Mar 4.

Learning representations for gene ontology terms by jointly encoding graph structure and textual node descriptors.通过联合编码图结构和文本节点描述符来学习基因本体论术语的表示。

Brief Bioinform. 2022 Sep 20;23(5). doi: 10.1093/bib/bbac318.

Group-Wise Learning for Weakly Supervised Semantic Segmentation.基于群体学习的弱监督语义分割。

IEEE Trans Image Process. 2022;31:799-811. doi: 10.1109/TIP.2021.3132834. Epub 2022 Jan 4.

Unsupervised Event Graph Representation and Similarity Learning on Biomedical Literature.基于生物医学文献的无监督事件图表示和相似性学习。

Sensors (Basel). 2021 Dec 21;22(1):3. doi: 10.3390/s22010003.

A unified semi-supervised model with joint estimation of graph, soft labels and latent subspace.具有图、软标签和潜在子空间联合估计的统一半监督模型。

Neural Netw. 2023 Sep;166:248-259. doi: 10.1016/j.neunet.2023.07.014. Epub 2023 Jul 17.

Text-Graph Enhanced Knowledge Graph Representation Learning.文本-图增强的知识图谱表示学习

Front Artif Intell. 2021 Aug 17;4:697856. doi: 10.3389/frai.2021.697856. eCollection 2021.

Learning Relationship-Enhanced Semantic Graph for Fine-Grained Image-Text Matching.用于细粒度图像-文本匹配的学习关系增强语义图

IEEE Trans Cybern. 2024 Feb;54(2):948-961. doi: 10.1109/TCYB.2022.3179020. Epub 2024 Jan 17.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

通过动态图嵌入学习用于图像序列时间分割的事件表示。

Learning Event Representations for Temporal Segmentation of Image Sequences by Dynamic Graph Embedding.

作者信息

出版信息

相似文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献