• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于视频一致性和传播的深度视频先验

Deep Video Prior for Video Consistency and Propagation.

作者信息

Lei Chenyang, Xing Yazhou, Ouyang Hao, Chen Qifeng

出版信息

IEEE Trans Pattern Anal Mach Intell. 2023 Jan;45(1):356-371. doi: 10.1109/TPAMI.2022.3142071. Epub 2022 Dec 5.

DOI:10.1109/TPAMI.2022.3142071
PMID:35015633
Abstract

Applying an image processing algorithm independently to each video frame often leads to temporal inconsistency in the resulting video. To address this issue, we present a novel and general approach for blind video temporal consistency. Our method is only trained on a pair of original and processed videos directly instead of a large dataset. Unlike most previous methods that enforce temporal consistency with optical flow, we show that temporal consistency can be achieved by training a convolutional network on a video with Deep Video Prior (DVP). Moreover, a carefully designed iteratively reweighted training strategy is proposed to address the challenging multimodal inconsistency problem. We demonstrate the effectiveness of our approach on 7 computer vision tasks on videos. Extensive quantitative and perceptual experiments show that our approach obtains superior performance than state-of-the-art methods on blind video temporal consistency. We further extend DVP to video propagation and demonstrate its effectiveness in propagating three different types of information (color, artistic style, and object segmentation). A progressive propagation strategy with pseudo labels is also proposed to enhance DVP's performance on video propagation. Our source codes are publicly available at https://github.com/ChenyangLEI/deep-video-prior.

摘要

对每个视频帧独立应用图像处理算法通常会导致生成的视频出现时间上的不一致性。为了解决这个问题,我们提出了一种新颖且通用的盲视频时间一致性方法。我们的方法仅在一对原始视频和处理后的视频上直接进行训练,而不是在大型数据集上训练。与大多数以前使用光流来强制时间一致性的方法不同,我们表明通过在具有深度视频先验(DVP)的视频上训练卷积网络可以实现时间一致性。此外,还提出了一种精心设计的迭代重新加权训练策略来解决具有挑战性的多模态不一致问题。我们在视频上的7个计算机视觉任务中展示了我们方法的有效性。广泛的定量和感知实验表明,在盲视频时间一致性方面,我们的方法比现有方法具有更优的性能。我们进一步将DVP扩展到视频传播,并证明了其在传播三种不同类型信息(颜色、艺术风格和对象分割)方面的有效性。还提出了一种带有伪标签的渐进传播策略,以提高DVP在视频传播方面的性能。我们的源代码可在https://github.com/ChenyangLEI/deep-video-prior上公开获取。

相似文献

1
Deep Video Prior for Video Consistency and Propagation.用于视频一致性和传播的深度视频先验
IEEE Trans Pattern Anal Mach Intell. 2023 Jan;45(1):356-371. doi: 10.1109/TPAMI.2022.3142071. Epub 2022 Dec 5.
2
Recurrent Temporal Aggregation Framework for Deep Video Inpainting.用于深度视频修复的循环时间聚合框架
IEEE Trans Pattern Anal Mach Intell. 2020 May;42(5):1038-1052. doi: 10.1109/TPAMI.2019.2958083. Epub 2019 Dec 11.
3
Learning Self-Supervised Space-Time CNN for Fast Video Style Transfer.学习用于快速视频风格迁移的自监督时空卷积神经网络
IEEE Trans Image Process. 2021;30:2501-2512. doi: 10.1109/TIP.2021.3052709. Epub 2021 Feb 1.
4
Video Salient Object Detection via Fully Convolutional Networks.基于全卷积网络的视频显著目标检测
IEEE Trans Image Process. 2018;27(1):38-49. doi: 10.1109/TIP.2017.2754941.
5
Improving Video Temporal Consistency via Broad Learning System.基于广谱学习系统的视频时间一致性改进。
IEEE Trans Cybern. 2022 Jul;52(7):6662-6675. doi: 10.1109/TCYB.2021.3079311. Epub 2022 Jul 4.
6
Video Object Segmentation without Temporal Information.无时间信息的视频对象分割
IEEE Trans Pattern Anal Mach Intell. 2019 Jun;41(6):1515-1530. doi: 10.1109/TPAMI.2018.2838670. Epub 2018 May 23.
7
S-CUDA: Self-cleansing unsupervised domain adaptation for medical image segmentation.S-CUDA:用于医学图像分割的自清洁无监督域适应
Med Image Anal. 2021 Dec;74:102214. doi: 10.1016/j.media.2021.102214. Epub 2021 Aug 12.
8
Weakly Supervised Temporal Action Localization With Bidirectional Semantic Consistency Constraint.具有双向语义一致性约束的弱监督时间动作定位
IEEE Trans Neural Netw Learn Syst. 2024 Sep;35(9):13032-13045. doi: 10.1109/TNNLS.2023.3266062. Epub 2024 Sep 3.
9
Keyframe extraction from laparoscopic videos based on visual saliency detection.基于视觉显著性检测的腹腔镜视频关键帧提取。
Comput Methods Programs Biomed. 2018 Oct;165:13-23. doi: 10.1016/j.cmpb.2018.07.004. Epub 2018 Jul 18.
10
Hierarchical Graph Pattern Understanding for Zero-Shot Video Object Segmentation.用于零样本视频对象分割的分层图模式理解
IEEE Trans Image Process. 2023;32:5909-5920. doi: 10.1109/TIP.2023.3326395. Epub 2023 Nov 1.

引用本文的文献

1
Low-Light Image and Video Enhancement for More Robust Computer Vision Tasks: A Review.用于更强大计算机视觉任务的低光图像和视频增强:综述
J Imaging. 2025 Apr 21;11(4):125. doi: 10.3390/jimaging11040125.