• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于深度视频修复的循环时间聚合框架

Recurrent Temporal Aggregation Framework for Deep Video Inpainting.

作者信息

Kim Dahun, Woo Sanghyun, Lee Joon-Young, Kweon In So

出版信息

IEEE Trans Pattern Anal Mach Intell. 2020 May;42(5):1038-1052. doi: 10.1109/TPAMI.2019.2958083. Epub 2019 Dec 11.

DOI:10.1109/TPAMI.2019.2958083
PMID:31831407
Abstract

Video inpainting aims to fill in spatio-temporal holes in videos with plausible content. Despite tremendous progress on deep learning-based inpainting of a single image, it is still challenging to extend these methods to video domain due to the additional time dimension. In this paper, we propose a recurrent temporal aggregation framework for fast deep video inpainting. In particular, we construct an encoder-decoder model, where the encoder takes multiple reference frames which can provide visible pixels revealed from the scene dynamics. These hints are aggregated and fed into the decoder. We apply a recurrent feedback in an auto-regressive manner to enforce temporal consistency in the video results. We propose two architectural designs based on this framework. Our first model is a blind video decaptioning network (BVDNet) that is designed to automatically remove and inpaint text overlays in videos without any mask information. Our BVDNet wins the first place in the ECCV Chalearn 2018 LAP Inpainting Competition Track 2: Video Decaptioning. Second, we propose a network for more general video inpainting (VINet) to deal with more arbitrary and larger holes. Video results demonstrate the advantage of our framework compared to state-of-the-art methods both qualitatively and quantitatively. The codes are available at https://github.com/mcahny/Deep-Video-Inpainting, and https://github.com/shwoo93/video_decaptioning.

摘要

视频修复旨在用合理的内容填充视频中的时空空洞。尽管基于深度学习的单图像修复取得了巨大进展,但由于额外的时间维度,将这些方法扩展到视频领域仍然具有挑战性。在本文中,我们提出了一种用于快速深度视频修复的循环时间聚合框架。具体来说,我们构建了一个编码器 - 解码器模型,其中编码器采用多个参考帧,这些参考帧可以提供从场景动态中揭示的可见像素。这些线索被聚合并输入到解码器中。我们以自回归的方式应用循环反馈来增强视频结果中的时间一致性。基于这个框架,我们提出了两种架构设计。我们的第一个模型是一个盲视频字幕去除网络(BVDNet),它被设计用于在没有任何掩码信息的情况下自动去除和修复视频中的文本覆盖。我们的BVDNet在ECCV 2018 Chalearn LAP修复竞赛赛道2:视频字幕去除中获得第一名。其次,我们提出了一个用于更通用视频修复的网络(VINet)来处理更任意和更大的空洞。视频结果在定性和定量方面都证明了我们的框架相对于现有方法的优势。代码可在https://github.com/mcahny/Deep-Video-Inpainting和https://github.com/shwoo93/video_decaptioning获取。

相似文献

1
Recurrent Temporal Aggregation Framework for Deep Video Inpainting.用于深度视频修复的循环时间聚合框架
IEEE Trans Pattern Anal Mach Intell. 2020 May;42(5):1038-1052. doi: 10.1109/TPAMI.2019.2958083. Epub 2019 Dec 11.
2
Encoder-Driven Inpainting Strategy in Multiview Video Compression.基于编解码器驱动的多视角视频压缩中的插补策略。
IEEE Trans Image Process. 2016 Jan;25(1):134-49. doi: 10.1109/TIP.2015.2498400. Epub 2015 Nov 5.
3
Learning a spatial-temporal texture transformer network for video inpainting.学习用于视频修复的时空纹理Transformer网络。
Front Neurorobot. 2022 Oct 13;16:1002453. doi: 10.3389/fnbot.2022.1002453. eCollection 2022.
4
Image Inpainting With Local and Global Refinement.图像修复:局部与全局细化
IEEE Trans Image Process. 2022;31:2405-2420. doi: 10.1109/TIP.2022.3152624. Epub 2022 Mar 15.
5
MCD-Net: Toward RGB-D Video Inpainting in Real-World Scenes.MCD-Net:面向真实场景中的RGB-D视频修复
IEEE Trans Image Process. 2024;33:1095-1108. doi: 10.1109/TIP.2024.3358675. Epub 2024 Feb 5.
6
A Temporally-Aware Interpolation Network for Video Frame Inpainting.用于视频帧修复的时间感知插值网络
IEEE Trans Pattern Anal Mach Intell. 2020 May;42(5):1053-1068. doi: 10.1109/TPAMI.2019.2951667. Epub 2019 Nov 6.
7
Video inpainting under constrained camera motion.受限相机运动下的视频修复
IEEE Trans Image Process. 2007 Feb;16(2):545-53. doi: 10.1109/tip.2006.888343.
8
Self-Supervised Video Hashing With Hierarchical Binary Auto-Encoder.基于层次化二值自动编码器的自监督视频哈希。
IEEE Trans Image Process. 2018 Jul;27(7):3210-3221. doi: 10.1109/TIP.2018.2814344.
9
Deep Face Video Inpainting via UV Mapping.
IEEE Trans Image Process. 2023;32:1145-1157. doi: 10.1109/TIP.2023.3240835. Epub 2023 Feb 10.
10
A Temporal Learning Approach to Inpainting Endoscopic Specularities and Its Effect on Image Correspondence.一种用于修复内镜镜面反射的时间学习方法及其对图像匹配的影响。
Med Image Anal. 2023 Dec;90:102994. doi: 10.1016/j.media.2023.102994. Epub 2023 Oct 4.

引用本文的文献

1
An effective video inpainting technique using morphological Haar wavelet transform with krill herd based criminisi algorithm.一种基于磷虾群的Criminisi算法,采用形态学哈尔小波变换的有效视频修复技术。
Sci Rep. 2024 Jul 5;14(1):15485. doi: 10.1038/s41598-024-66496-x.
2
Sequential vessel segmentation via deep channel attention network.基于深度通道注意力网络的血管序列分割。
Neural Netw. 2020 Aug;128:172-187. doi: 10.1016/j.neunet.2020.05.005. Epub 2020 May 13.