Zhou Yu, Li Leida, Wang Shiqi, Wu Jinjian, Fang Yuming, Gao Xinbo
IEEE Trans Image Process. 2019 Apr 26. doi: 10.1109/TIP.2019.2912463.
View synthesis is a key technique in free-viewpoint video, which renders virtual views based on texture and depth images. The distortions in synthesized views come from two stages, i.e., the stage of the acquisition and processing of texture and depth images, and the rendering stage using depth-image-based-rendering (DIBR) algorithms. The existing view synthesis quality metrics are designed for the distortions caused by a single stage, which cannot accurately evaluate the quality of the entire view synthesis process. With the considerations that the distortions introduced by two stages both cause edge degradation and texture unnaturalness, and the Difference-of-Gaussian (DoG) representation is powerful in capturing image edge and texture characteristics by simulating the center-surrounding receptive fields of retinal ganglion cells of human eyes, this paper presents a no-reference quality index for Synthesized views using DoG-based Edge statistics and Texture naturalness (SET). To mimic the multi-scale property of the Human Visual System (HVS), DoG images are first calculated at multiple scales. Then the orientation selective statistics features and the texture naturalness features are calculated on the DoG images and the coarsest scale image, producing two groups of quality-aware features. Finally, the quality model is learnt from these features using the random forest regression model. Experimental results on two view synthesis image databases demonstrate that the proposed metric is advantageous over the relevant state-of-the-arts in dealing with the distortions in the whole view synthesis process.
视图合成是自由视点视频中的一项关键技术,它基于纹理和深度图像渲染虚拟视图。合成视图中的失真来自两个阶段,即纹理和深度图像的采集与处理阶段,以及使用基于深度图像渲染(DIBR)算法的渲染阶段。现有的视图合成质量度量是针对单个阶段引起的失真设计的,无法准确评估整个视图合成过程的质量。考虑到两个阶段引入的失真都会导致边缘退化和纹理不自然,且高斯差分(DoG)表示通过模拟人眼视网膜神经节细胞的中心 - 周边感受野,在捕捉图像边缘和纹理特征方面具有强大功能,本文提出了一种基于DoG边缘统计和纹理自然度(SET)的合成视图无参考质量指标。为了模拟人类视觉系统(HVS)的多尺度特性,首先在多个尺度上计算DoG图像。然后在DoG图像和最粗尺度图像上计算方向选择性统计特征和纹理自然度特征,产生两组质量感知特征。最后,使用随机森林回归模型从这些特征中学习质量模型。在两个视图合成图像数据库上的实验结果表明,所提出的度量在处理整个视图合成过程中的失真方面优于相关的现有技术。