• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

超越表象:用于高效且稳健视频对象分割的多帧时空上下文记忆网络

Beyond Appearance: Multi-Frame Spatio-Temporal Context Memory Networks for Efficient and Robust Video Object Segmentation.

作者信息

Dang Jisheng, Zheng Huicheng, Xu Xiaohao, Wang Longguang, Guo Yulan

出版信息

IEEE Trans Image Process. 2024;33:4853-4866. doi: 10.1109/TIP.2024.3423390. Epub 2024 Sep 5.

DOI:10.1109/TIP.2024.3423390
PMID:39208058
Abstract

Current video object segmentation approaches primarily rely on frame-wise appearance information to perform matching. Despite significant progress, reliable matching becomes challenging due to rapid changes of the object's appearance over time. Moreover, previous matching mechanisms suffer from redundant computation and noise interference as the number of accumulated frames increases. In this paper, we introduce a multi-frame spatio-temporal context memory (STCM) network to exploit discriminative spatio-temporal cues in multiple adjacent frames by utilizing a multi-frame context interaction module (MCI) for memory construction. Based on the proposed MCI module, a sparse group memory reader is developed to enable efficient sparse matching during memory reading. Our proposed method is generic and achieves state-of-the-art performance with real-time speed on benchmark datasets such as DAVIS and YouTube-VOS. In addition, our model exhibits robustness to sparse videos with low frame rates.

摘要

当前的视频对象分割方法主要依靠逐帧外观信息来进行匹配。尽管取得了显著进展,但由于对象外观随时间的快速变化,可靠的匹配变得具有挑战性。此外,随着累积帧数的增加,先前的匹配机制会受到冗余计算和噪声干扰的影响。在本文中,我们引入了一种多帧时空上下文记忆(STCM)网络,通过利用多帧上下文交互模块(MCI)进行记忆构建,来利用多个相邻帧中的判别性时空线索。基于所提出的MCI模块,开发了一种稀疏组记忆读取器,以在记忆读取期间实现高效的稀疏匹配。我们提出的方法具有通用性,并在DAVIS和YouTube-VOS等基准数据集上以实时速度实现了领先的性能。此外,我们的模型对低帧率的稀疏视频具有鲁棒性。

相似文献

1
Beyond Appearance: Multi-Frame Spatio-Temporal Context Memory Networks for Efficient and Robust Video Object Segmentation.超越表象:用于高效且稳健视频对象分割的多帧时空上下文记忆网络
IEEE Trans Image Process. 2024;33:4853-4866. doi: 10.1109/TIP.2024.3423390. Epub 2024 Sep 5.
2
Adaptive Sparse Memory Networks for Efficient and Robust Video Object Segmentation.用于高效且稳健视频对象分割的自适应稀疏记忆网络
IEEE Trans Neural Netw Learn Syst. 2025 Feb;36(2):3820-3833. doi: 10.1109/TNNLS.2024.3357118. Epub 2025 Feb 6.
3
SpVOS: Efficient Video Object Segmentation With Triple Sparse Convolution.SpVOS:基于三重稀疏卷积的高效视频对象分割
IEEE Trans Image Process. 2023;32:5977-5991. doi: 10.1109/TIP.2023.3327588. Epub 2023 Nov 7.
4
Efficient and Robust Video Object Segmentation Through Isogenous Memory Sampling and Frame Relation Mining.通过同源记忆采样和帧关系挖掘实现高效稳健的视频对象分割。
IEEE Trans Image Process. 2023;32:3924-3938. doi: 10.1109/TIP.2023.3280389. Epub 2023 Jul 17.
5
Adaptive Selection of Reference Frames for Video Object Segmentation.用于视频对象分割的参考帧自适应选择
IEEE Trans Image Process. 2022;31:1057-1071. doi: 10.1109/TIP.2021.3137660. Epub 2022 Jan 19.
6
Directional Deep Embedding and Appearance Learning for Fast Video Object Segmentation.用于快速视频对象分割的定向深度嵌入与外观学习
IEEE Trans Neural Netw Learn Syst. 2022 Aug;33(8):3884-3894. doi: 10.1109/TNNLS.2021.3054769. Epub 2022 Aug 3.
7
Region Aware Video Object Segmentation With Deep Motion Modeling.基于深度运动建模的区域感知视频对象分割
IEEE Trans Image Process. 2024;33:2639-2651. doi: 10.1109/TIP.2024.3381445. Epub 2024 Apr 3.
8
Video Object Discovery and Co-Segmentation with Extremely Weak Supervision.基于极弱监督的视频目标发现与协同分割。
IEEE Trans Pattern Anal Mach Intell. 2017 Oct;39(10):2074-2088. doi: 10.1109/TPAMI.2016.2612187. Epub 2016 Oct 26.
9
Video Object Segmentation Using Kernelized Memory Network With Multiple Kernels.
IEEE Trans Pattern Anal Mach Intell. 2023 Feb;45(2):2595-2612. doi: 10.1109/TPAMI.2022.3163375. Epub 2023 Jan 6.
10
Online Meta Adaptation for Fast Video Object Segmentation.用于快速视频对象分割的在线元自适应
IEEE Trans Pattern Anal Mach Intell. 2020 May;42(5):1205-1217. doi: 10.1109/TPAMI.2018.2890659. Epub 2019 Jan 14.