用于视频全景分割的实例运动趋势学习

Instance Motion Tendency Learning for Video Panoptic Segmentation.

作者信息

Wang Le, Liu Hongzhen, Zhou Sanping, Tang Wei, Hua Gang

出版信息

IEEE Trans Image Process. 2023;32:764-778. doi: 10.1109/TIP.2022.3226414. Epub 2023 Jan 18.

DOI:10.1109/TIP.2022.3226414

Abstract

Video panoptic segmentation is an important but challenging task in computer vision. It not only performs panoptic segmentation of each frame, but also associates the same instance across adjacent frames. Due to the lack of temporal coherence modeling, most existing approaches often generate identity switches during instance association, and they cannot handle ambiguous segmentation boundaries caused by motion blur. To address these difficult issues, we introduce a simple yet effective Instance Motion Tendency Network (IMTNet) for video panoptic segmentation. It learns a global motion tendency map for instance association, and a hierarchical classifier for motion boundary refinement. Specifically, a Global Motion Tendency Module (GMTM) is designed to learn robust motion features from optical flows, which can directly associate each instance in the previous frame to the corresponding instance in the current frame. In addition, we propose a Motion Boundary Refinement Module (MBRM) to learn a hierarchical classifier to handle the boundary pixels of moving targets, which can effectively revise the inaccurate segmentation predictions. Experimental results on both Cityscapes and Cityscapes-VPS datasets show that our IMTNet outperforms most state-of-the-art approaches.

摘要

视频全景分割是计算机视觉中一项重要但具有挑战性的任务。它不仅要对每一帧进行全景分割，还要在相邻帧之间关联相同的实例。由于缺乏时间一致性建模，大多数现有方法在实例关联过程中经常会产生身份切换，并且无法处理由运动模糊导致的模糊分割边界。为了解决这些难题，我们引入了一种简单而有效的用于视频全景分割的实例运动趋势网络（IMTNet）。它学习用于实例关联的全局运动趋势图以及用于运动边界细化的分层分类器。具体来说，设计了一个全局运动趋势模块（GMTM）来从光流中学习鲁棒的运动特征，该特征可以直接将前一帧中的每个实例与当前帧中的相应实例关联起来。此外，我们提出了一个运动边界细化模块（MBRM）来学习一个分层分类器以处理移动目标的边界像素，这可以有效地修正不准确的分割预测。在Cityscapes和Cityscapes-VPS数据集上进行的实验结果表明，我们的IMTNet优于大多数最先进的方法。

相似文献

Instance Motion Tendency Learning for Video Panoptic Segmentation.用于视频全景分割的实例运动趋势学习

IEEE Trans Image Process. 2023;32:764-778. doi: 10.1109/TIP.2022.3226414. Epub 2023 Jan 18.

Dense Pixel-Level Interpretation of Dynamic Scenes With Video Panoptic Segmentation.基于视频全景分割的动态场景密集像素级解读

IEEE Trans Image Process. 2022;31:5383-5395. doi: 10.1109/TIP.2022.3183440. Epub 2022 Aug 17.

Improving Video Instance Segmentation via Temporal Pyramid Routing.通过时间金字塔路由改进视频实例分割

IEEE Trans Pattern Anal Mach Intell. 2023 May;45(5):6594-6601. doi: 10.1109/TPAMI.2022.3211612. Epub 2023 Apr 3.

Fast Panoptic Segmentation with Soft Attention Embeddings.快速全景分割的软注意嵌入。

Sensors (Basel). 2022 Jan 20;22(3):783. doi: 10.3390/s22030783.

Motion-Guided Cascaded Refinement Network for Video Object Segmentation.用于视频对象分割的运动引导级联细化网络

IEEE Trans Pattern Anal Mach Intell. 2020 Aug;42(8):1957-1967. doi: 10.1109/TPAMI.2019.2906175. Epub 2019 Mar 19.

Panoptic-PartFormer++: A Unified and Decoupled View for Panoptic Part Segmentation.全景部分Former++：用于全景部分分割的统一解耦视图

IEEE Trans Pattern Anal Mach Intell. 2024 Dec;46(12):11087-11103. doi: 10.1109/TPAMI.2024.3453916. Epub 2024 Nov 6.

DVIS++: Improved Decoupled Framework for Universal Video Segmentation.

IEEE Trans Pattern Anal Mach Intell. 2025 Jul;47(7):5918-5929. doi: 10.1109/TPAMI.2025.3552694.

Object-Centric Representation Learning for Video Scene Understanding.用于视频场景理解的以对象为中心的表示学习

IEEE Trans Pattern Anal Mach Intell. 2024 Dec;46(12):8410-8423. doi: 10.1109/TPAMI.2024.3401409. Epub 2024 Nov 6.

Fully Convolutional Networks for Panoptic Segmentation With Point-Based Supervision.基于点监督的全景分割全卷积网络

IEEE Trans Pattern Anal Mach Intell. 2023 Apr;45(4):4552-4568. doi: 10.1109/TPAMI.2022.3200416. Epub 2023 Mar 7.

Unified 3D and 4D Panoptic Segmentation via Dynamic Shifting Networks.通过动态移位网络实现统一的3D和4D全景分割

IEEE Trans Pattern Anal Mach Intell. 2024 May;46(5):3480-3495. doi: 10.1109/TPAMI.2023.3349304. Epub 2024 Apr 3.

用于视频全景分割的实例运动趋势学习

Instance Motion Tendency Learning for Video Panoptic Segmentation.

作者信息

Wang Le, Liu Hongzhen, Zhou Sanping, Tang Wei, Hua Gang

出版信息

IEEE Trans Image Process. 2023;32:764-778. doi: 10.1109/TIP.2022.3226414. Epub 2023 Jan 18.

DOI:10.1109/TIP.2022.3226414

PMID:37015476

Abstract

摘要

Suppr 超能文献

文献检索

文件翻译

深度研究

Suppr 超能文献

文献检索

文件翻译

深度研究

用于视频全景分割的实例运动趋势学习

Instance Motion Tendency Learning for Video Panoptic Segmentation.

作者信息

出版信息

相似文献

用于视频全景分割的实例运动趋势学习

Instance Motion Tendency Learning for Video Panoptic Segmentation.

作者信息

出版信息

相似文献