• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于视频对象分割的参考帧自适应选择

Adaptive Selection of Reference Frames for Video Object Segmentation.

作者信息

Hong Lingyi, Zhang Wei, Chen Liangyu, Zhang Wenqiang, Fan Jianping

出版信息

IEEE Trans Image Process. 2022;31:1057-1071. doi: 10.1109/TIP.2021.3137660. Epub 2022 Jan 19.

DOI:10.1109/TIP.2021.3137660
PMID:34965210
Abstract

Video object segmentation is a challenging task in computer vision because the appearances of target objects might change drastically along the time in the video. To solve this problem, space-time memory (STM) networks are exploited to make use of the information from all the intermediate frames between the first frame and the current frame in the video. However, fully using the information from all the memory frames may make STM not practical for long videos. To overcome this issue, a novel method is developed in this paper to select the reference frames adaptively. First, an adaptive selection criterion is introduced to choose the reference frames with similar appearance and precise mask estimation, which can efficiently capture the rich information of the target object and overcome the challenges of appearance changes, occlusion, and model drift. Secondly, bi-matching (bi-scale and bi-direction) is conducted to obtain more robust correlations for objects of various scales and prevents multiple similar objects in the current frame from being mismatched with the same target object in the reference frame. Thirdly, a novel edge refinement technique is designed by using an edge detection network to obtain smooth edges from the outputs of edge confidence maps, where the edge confidence is quantized into ten sub-intervals to generate smooth edges step by step. Experimental results on the challenging benchmark datasets DAVIS-2016, DAVIS-2017, YouTube-VOS, and a Long-Video dataset have demonstrated the effectiveness of our proposed approach to video object segmentation.

摘要

视频目标分割是计算机视觉中的一项具有挑战性的任务,因为目标物体的外观可能会在视频中随时间发生剧烈变化。为了解决这个问题,人们利用时空记忆(STM)网络来利用视频中第一帧和当前帧之间所有中间帧的信息。然而,充分利用所有记忆帧的信息可能会使STM在处理长视频时不实用。为了克服这个问题,本文开发了一种新颖的方法来自适应地选择参考帧。首先,引入了一种自适应选择标准,以选择具有相似外观和精确掩码估计的参考帧,这可以有效地捕捉目标物体的丰富信息,并克服外观变化、遮挡和模型漂移等挑战。其次,进行双匹配(双尺度和双向),以获得各种尺度物体更稳健的相关性,并防止当前帧中的多个相似物体与参考帧中的同一目标物体不匹配。第三,设计了一种新颖的边缘细化技术,通过使用边缘检测网络从边缘置信度图的输出中获得平滑边缘,其中边缘置信度被量化为十个子区间,以逐步生成平滑边缘。在具有挑战性的基准数据集DAVIS-2016、DAVIS-2017、YouTube-VOS和一个长视频数据集上的实验结果证明了我们提出的视频目标分割方法的有效性。

相似文献

1
Adaptive Selection of Reference Frames for Video Object Segmentation.用于视频对象分割的参考帧自适应选择
IEEE Trans Image Process. 2022;31:1057-1071. doi: 10.1109/TIP.2021.3137660. Epub 2022 Jan 19.
2
Adaptive Online Mutual Learning Bi-Decoders for Video Object Segmentation.用于视频对象分割的自适应在线互学习双解码器
IEEE Trans Image Process. 2022;31:7063-7077. doi: 10.1109/TIP.2022.3219230. Epub 2022 Nov 15.
3
Segmentation in Weakly Labeled Videos via a Semantic Ranking and Optical Warping Network.通过语义排序和光流变形网络对弱标注视频进行分割
IEEE Trans Image Process. 2018 May 16. doi: 10.1109/TIP.2018.2834221.
4
Beyond Appearance: Multi-Frame Spatio-Temporal Context Memory Networks for Efficient and Robust Video Object Segmentation.超越表象:用于高效且稳健视频对象分割的多帧时空上下文记忆网络
IEEE Trans Image Process. 2024;33:4853-4866. doi: 10.1109/TIP.2024.3423390. Epub 2024 Sep 5.
5
Joint Video Object Discovery and Segmentation by Coupled Dynamic Markov Networks.联合视频对象发现和分割的耦合动态马尔可夫网络。
IEEE Trans Image Process. 2018 Dec;27(12):5840-5853. doi: 10.1109/TIP.2018.2859622. Epub 2018 Jul 30.
6
Exploring Weakly Labeled Images for Video Object Segmentation With Submodular Proposal Selection.基于子模提案选择的视频对象分割中弱标注图像的探索。
IEEE Trans Image Process. 2018 Sep;27(9):4245-4259. doi: 10.1109/TIP.2018.2806995.
7
Video Object Segmentation Using Kernelized Memory Network With Multiple Kernels.
IEEE Trans Pattern Anal Mach Intell. 2023 Feb;45(2):2595-2612. doi: 10.1109/TPAMI.2022.3163375. Epub 2023 Jan 6.
8
Meta-VOS: Learning to Adapt Online Target-Specific Segmentation.元虚拟目标分割:学习适应在线特定目标分割
IEEE Trans Image Process. 2021;30:4760-4772. doi: 10.1109/TIP.2021.3075086. Epub 2021 May 5.
9
SpVOS: Efficient Video Object Segmentation With Triple Sparse Convolution.SpVOS:基于三重稀疏卷积的高效视频对象分割
IEEE Trans Image Process. 2023;32:5977-5991. doi: 10.1109/TIP.2023.3327588. Epub 2023 Nov 7.
10
Video Object Discovery and Co-Segmentation with Extremely Weak Supervision.基于极弱监督的视频目标发现与协同分割。
IEEE Trans Pattern Anal Mach Intell. 2017 Oct;39(10):2074-2088. doi: 10.1109/TPAMI.2016.2612187. Epub 2016 Oct 26.