• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于高斯混合模型的自由视点视频和多视点视频压缩的虚拟视图合成。

Virtual View Synthesis for Free Viewpoint Video and Multiview Video Compression using Gaussian Mixture Modelling.

出版信息

IEEE Trans Image Process. 2018 Mar;27(3):1190-1201. doi: 10.1109/TIP.2017.2772858.

DOI:10.1109/TIP.2017.2772858
PMID:29220320
Abstract

High quality virtual views need to be synthesized from adjacent available views for free viewpoint video and multiview video coding (MVC) to provide users with a more realistic 3D viewing experience of a scene. View synthesis techniques suffer from poor rendering quality due to holes created by occlusion and rounding integer error through warping. To remove the holes in the virtual view, the existing techniques use spatial and temporal correlation in intra/inter-view images and depth maps. However, they still suffer quality degradation in the boundary region of foreground and background areas due to the low spatial correlation in texture images and low correspondence in inter-view depth maps. To overcome the above-mentioned limitations, we use a number of models in the Gaussian mixture modeling (GMM) to separate background and foreground pixels in our proposed technique. Here, the missing pixels introduced from the warping process are recovered by the adaptive weighted average of the pixel intensities from the corresponding GMM model(s) and warped image. The weights vary with time to accommodate the changes due to a dynamic background and the motions of the moving objects for view synthesis. We also introduce an adaptive strategy to reset the GMM modeling if the contributions of the pixel intensities drop significantly. Our experimental results indicate that the proposed approach provides 5.40-6.60-dB PSNR improvement compared with the relevant methods. To verify the effectiveness of the proposed view synthesis technique, we use it as an extra reference frame in the motion estimation for MVC. The experimental results confirm that the proposed view synthesis is able to improve PSNR by 3.15-5.13 dB compared with the conventional three reference frames.

摘要

为了提供场景的更真实的 3D 观看体验,自由视点视频和多视点视频编码(MVC)需要从相邻的可用视图中合成高质量的虚拟视图。由于遮挡和整数舍入误差导致的变形而产生的空洞,视图合成技术的渲染质量较差。为了消除虚拟视图中的空洞,现有的技术在帧内/帧间图像和深度图中使用空间和时间相关性。然而,由于纹理图像中的空间相关性较低以及帧间深度图中的对应性较低,它们在前景和背景区域的边界区域仍然存在质量下降的问题。为了克服上述限制,我们在提出的技术中使用了一些高斯混合建模(GMM)模型来分离背景和前景像素。在这里,通过从对应 GMM 模型(多个模型)和变形图像中自适应地对像素强度进行加权平均来恢复变形过程中引入的缺失像素。权重随时间变化,以适应由于动态背景和运动物体的运动而导致的变化,以进行视图合成。我们还引入了一种自适应策略,如果像素强度的贡献显著下降,则重置 GMM 建模。我们的实验结果表明,与相关方法相比,所提出的方法提供了 5.40-6.60dB 的 PSNR 改进。为了验证所提出的视图合成技术的有效性,我们将其用作 MVC 中运动估计的额外参考帧。实验结果证实,与传统的三个参考帧相比,所提出的视图合成能够将 PSNR 提高 3.15-5.13dB。

相似文献

1
Virtual View Synthesis for Free Viewpoint Video and Multiview Video Compression using Gaussian Mixture Modelling.基于高斯混合模型的自由视点视频和多视点视频压缩的虚拟视图合成。
IEEE Trans Image Process. 2018 Mar;27(3):1190-1201. doi: 10.1109/TIP.2017.2772858.
2
Encoder-Driven Inpainting Strategy in Multiview Video Compression.基于编解码器驱动的多视角视频压缩中的插补策略。
IEEE Trans Image Process. 2016 Jan;25(1):134-49. doi: 10.1109/TIP.2015.2498400. Epub 2015 Nov 5.
3
Efficient multiview depth coding optimization based on allowable depth distortion in view synthesis.基于视图合成中允许的深度失真的高效多视图深度编码优化。
IEEE Trans Image Process. 2014 Nov;23(11):4879-92. doi: 10.1109/TIP.2014.2355715. Epub 2014 Sep 8.
4
Adaptive image warping for hole prevention in 3D view synthesis.自适应图像变形防止 3D 视图合成中的空洞。
IEEE Trans Image Process. 2013 Sep;22(9):3420-32. doi: 10.1109/TIP.2013.2268940. Epub 2013 Jun 14.
5
Regional bit allocation and rate distortion optimization for multiview depth video coding with view synthesis distortion model.基于视图合成失真模型的多视点深度视频编码的区域比特分配和率失真优化。
IEEE Trans Image Process. 2013 Sep;22(9):3497-512. doi: 10.1109/TIP.2013.2265883. Epub 2013 Jun 3.
6
Arbitrarily shaped motion prediction for depth video compression using arithmetic edge coding.使用算术边缘编码对深度视频压缩进行任意形状运动预测。
IEEE Trans Image Process. 2014 Nov;23(11):4696-708. doi: 10.1109/TIP.2014.2353817. Epub 2014 Aug 29.
7
Multiview video plus depth transmission via virtual-view-assisted complementary down/upsampling.通过虚拟视图辅助互补下采样/上采样实现多视图视频加深度传输。
EURASIP J Image Video Process. 2016;2016:19. doi: 10.1186/s13640-016-0119-4. Epub 2016 Apr 29.
8
Bit allocation algorithm with novel view synthesis distortion model for multiview video plus depth coding.基于新颖视图合成失真模型的多视点视频加深度编码比特分配算法。
IEEE Trans Image Process. 2014 Aug;23(8):3254-67. doi: 10.1109/TIP.2014.2327801.
9
Cross-View Multi-Lateral Filter for Compressed Multi-View Depth Video.用于压缩多视角深度视频的跨视图多边滤波器。
IEEE Trans Image Process. 2019 Jan;28(1):302-315. doi: 10.1109/TIP.2018.2867740. Epub 2018 Aug 29.
10
Virtual-view PSNR prediction based on a depth distortion tolerance model and support vector machine.基于深度失真容忍模型和支持向量机的虚拟视图峰值信噪比预测
Appl Opt. 2017 Oct 20;56(30):8547-8554. doi: 10.1364/AO.56.008547.

引用本文的文献

1
Towards Quality Assessment for Arbitrary Translational 6DoF Video: Subjective Quality Database and Objective Assessment Metric.面向任意平移6自由度视频的质量评估:主观质量数据库与客观评估指标
Entropy (Basel). 2025 Jan 7;27(1):44. doi: 10.3390/e27010044.
2
Video Rain-Streaks Removal by Combining Data-Driven and Feature-Based Models.结合数据驱动和基于特征的模型去除视频雨痕
Sensors (Basel). 2021 Oct 15;21(20):6856. doi: 10.3390/s21206856.