Cheng Shan, Zeng Huanqiang, Chen Jing, Hou Junhui, Zhu Jianqing, Ma Kai-Kuang
IEEE Trans Image Process. 2020 Aug 26;PP. doi: 10.1109/TIP.2020.3018256.
In this paper, we make the first attempt to study the subjective and objective quality assessment for the screen content videos (SCVs). For that, we construct the first large-scale video quality assessment (VQA) database specifically for the SCVs, called the screen content video database (SCVD). This SCVD provides 16 reference SCVs, 800 distorted SCVs, and their corresponding subjective scores, and it is made publicly available for research usage. The distorted SCVs are generated from each reference SCV with 10 distortion types and 5 degradation levels for each distortion type. Each distorted SCV is rated by at least 32 subjects in the subjective test. Furthermore, we propose the first full-reference VQA model for the SCVs, called the spatiotemporal Gabor feature tensor-based model (SGFTM), to objectively evaluate the perceptual quality of the distorted SCVs. This is motivated by the observation that 3D-Gabor filter can well stimulate the visual functions of the human visual system (HVS) on perceiving videos, being more sensitive to the edge and motion information that are often-encountered in the SCVs. Specifically, the proposed SGFTM exploits 3D-Gabor filter to individually extract the spatiotemporal Gabor feature tensors from the reference and distorted SCVs, followed by measuring their similarities and later combining them together through the developed spatiotemporal feature tensor pooling strategy to obtain the final SGFTM score. Experimental results on SCVD have shown that the proposed SGFTM yields a high consistency on the subjective perception of SCV quality and consistently outperforms multiple classical and state-of-the-art image/video quality assessment models.
在本文中,我们首次尝试研究屏幕内容视频(SCV)的主观和客观质量评估。为此,我们构建了首个专门用于SCV的大规模视频质量评估(VQA)数据库,称为屏幕内容视频数据库(SCVD)。该SCVD提供了16个参考SCV、800个失真SCV及其相应的主观评分,并已公开发布供研究使用。失真SCV由每个参考SCV生成,有10种失真类型,每种失真类型有5个降级级别。在主观测试中,每个失真SCV至少由32名受试者进行评分。此外,我们提出了首个用于SCV的全参考VQA模型,称为基于时空Gabor特征张量的模型(SGFTM),以客观评估失真SCV的感知质量。这是基于以下观察结果:3D-Gabor滤波器能够很好地刺激人类视觉系统(HVS)在感知视频时的视觉功能,对SCV中经常出现的边缘和运动信息更为敏感。具体而言,所提出的SGFTM利用3D-Gabor滤波器分别从参考SCV和失真SCV中提取时空Gabor特征张量,然后测量它们的相似度,随后通过开发的时空特征张量池化策略将它们组合在一起,以获得最终的SGFTM分数。在SCVD上的实验结果表明,所提出的SGFTM在SCV质量的主观感知上具有高度一致性,并且始终优于多个经典和最新的图像/视频质量评估模型。