School of Engineering, The University of Waikato, Hamilton, New Zealand.
Department of Television and Sound Broadcasting, Moscow Technical University of Communications and Informatics, Moscow, Russian Federation.
PLoS One. 2024 May 30;19(5):e0303987. doi: 10.1371/journal.pone.0303987. eCollection 2024.
Digital video incurs many distortions during processing, compression, storage, and transmission, which can reduce perceived video quality. Developing adaptive video transmission methods that provide increased bandwidth and reduced storage space while preserving visual quality requires quality metrics that accurately describe how people perceive distortion. A severe problem for developing new video quality metrics is the limited data on how the early human visual system simultaneously processes spatial and temporal information. The problem is exacerbated by the fact that the few data collected in the middle of the last century do not consider current display equipment and are subject to medical intervention during collection, which does not guarantee a proper description of the conditions under which media content is currently consumed. In this paper, the 27840 thresholds of the visibility of spatio-temporal sinusoidal variations necessary to determine the artefacts that a human perceives were measured by a new method using different spatial sizes and temporal modulation rates. A multidimensional model of human contrast sensitivity in modern conditions of video content presentation is proposed based on new large-scale data obtained during the experiment. We demonstrate that the presented visibility model has a distinct advantage in predicting subjective video quality by testing with video quality metrics and including our and other visibility models against three publicly available video datasets.
数字视频在处理、压缩、存储和传输过程中会产生许多失真,这会降低视频的感知质量。开发自适应视频传输方法,在提供增加的带宽和减少存储空间的同时保持视觉质量,需要能够准确描述人们如何感知失真的质量指标。为开发新的视频质量指标而面临的一个严重问题是,关于早期人类视觉系统如何同时处理空间和时间信息的数据有限。上个世纪中期收集的数据很少,而且这些数据没有考虑到当前的显示设备,并且在收集过程中受到医学干预,这不能保证对当前媒体内容消费的条件进行适当描述,这加剧了这个问题。在本文中,通过使用不同的空间大小和时间调制率的新方法,测量了确定人眼感知到的时空正弦波变化的可见性的 27840 个阈值。基于在实验过程中获得的新的大规模数据,提出了一种现代视频内容呈现条件下人类对比敏感度的多维模型。我们通过使用视频质量指标进行测试,并将我们的和其他可见性模型与三个公开可用的视频数据集进行比较,证明了所提出的可见性模型在预测主观视频质量方面具有明显的优势。