一种用于客观视频质量评估的卷积神经网络方法。

Le Callet Patrick, Viard-Gaudin Christian, Barba Dominique

Institut de Recherche en Communication et Cybernétique de Nantes, University of Nantes, Nantes 44306, France.

IEEE Trans Neural Netw. 2006 Sep;17(5):1316-27. doi: 10.1109/TNN.2006.879766.

This paper describes an application of neural networks in the field of objective measurement method designed to automatically assess the perceived quality of digital videos. This challenging issue aims to emulate human judgment and to replace very complex and time consuming subjective quality assessment. Several metrics have been proposed in literature to tackle this issue. They are based on a general framework that combines different stages, each of them addressing complex problems. The ambition of this paper is not to present a global perfect quality metric but rather to focus on an original way to use neural networks in such a framework in the context of reduced reference (RR) quality metric. Especially, we point out the interest of such a tool for combining features and pooling them in order to compute quality scores. The proposed approach solves some problems inherent to objective metrics that should predict subjective quality score obtained using the single stimulus continuous quality evaluation (SSCQE) method. This latter has been adopted by video quality expert group (VQEG) in its recently finalized reduced referenced and no reference (RRNR-TV) test plan. The originality of such approach compared to previous attempts to use neural networks for quality assessment, relies on the use of a convolutional neural network (CNN) that allows a continuous time scoring of the video. Objective features are extracted on a frame-by-frame basis on both the reference and the distorted sequences; they are derived from a perceptual-based representation and integrated along the temporal axis using a time-delay neural network (TDNN). Experiments conducted on different MPEG-2 videos, with bit rates ranging 2-6 Mb/s, show the effectiveness of the proposed approach to get a plausible model of temporal pooling from the human vision system (HVS) point of view. More specifically, a linear correlation criteria, between objective and subjective scoring, up to 0.92 has been obtained on a set of typical TV videos.

本文描述了神经网络在客观测量方法领域的一种应用，该方法旨在自动评估数字视频的感知质量。这个具有挑战性的问题旨在模拟人类的判断，并取代非常复杂且耗时的主观质量评估。文献中已经提出了几种指标来解决这个问题。它们基于一个通用框架，该框架结合了不同的阶段，每个阶段都解决复杂的问题。本文的目的不是提出一个全局完美的质量指标，而是专注于在简化参考（RR）质量指标的背景下，在这样一个框架中使用神经网络的一种原创方法。特别是，我们指出了这种工具在组合特征并将它们汇总以计算质量分数方面的作用。所提出的方法解决了客观指标固有的一些问题，这些指标应该预测使用单刺激连续质量评估（SSCQE）方法获得的主观质量分数。视频质量专家组（VQEG）在其最近最终确定的简化参考和无参考（RRNR - TV）测试计划中采用了后者。与之前尝试使用神经网络进行质量评估相比，这种方法的独特之处在于使用了卷积神经网络（CNN），它可以对视频进行连续时间评分。在参考序列和失真序列上逐帧提取客观特征；它们源自基于感知的表示，并使用时延神经网络（TDNN）沿时间轴进行整合。在不同的MPEG - 2视频上进行的实验，比特率范围为2 - 6 Mb/s，从人类视觉系统（HVS）的角度展示了所提出的方法获得合理的时间池化模型的有效性。更具体地说，在一组典型的电视视频上，客观评分与主观评分之间的线性相关标准高达0.92。

相似文献

A convolutional neural network approach for objective video quality assessment.

IEEE Trans Neural Netw. 2006 Sep;17(5):1316-27. doi: 10.1109/TNN.2006.879766.

Blind prediction of natural video quality.

IEEE Trans Image Process. 2014 Mar;23(3):1352-65. doi: 10.1109/TIP.2014.2299154.

Online kernel slow feature analysis for temporal video segmentation and tracking.

IEEE Trans Image Process. 2015 Oct;24(10):2955-70. doi: 10.1109/TIP.2015.2428052. Epub 2015 Apr 29.

Saliency-aware video compression.

IEEE Trans Image Process. 2014 Jan;23(1):19-33. doi: 10.1109/TIP.2013.2282897. Epub 2013 Sep 20.

A neural network-based novelty detector for image sequence analysis.

IEEE Trans Pattern Anal Mach Intell. 2006 Oct;28(10):1664-77. doi: 10.1109/TPAMI.2006.196.

Adaptive online performance evaluation of video trackers.

IEEE Trans Image Process. 2012 May;21(5):2812-23. doi: 10.1109/TIP.2011.2182520. Epub 2012 Jan 2.

Effective gaussian mixture learning for video background subtraction.

IEEE Trans Pattern Anal Mach Intell. 2005 May;27(5):827-32. doi: 10.1109/TPAMI.2005.102.

Salient motion features for video quality assessment.

IEEE Trans Image Process. 2011 Apr;20(4):948-58. doi: 10.1109/TIP.2010.2080279. Epub 2010 Sep 27.

Blind image quality assessment using a general regression neural network.

IEEE Trans Neural Netw. 2011 May;22(5):793-9. doi: 10.1109/TNN.2011.2120620. Epub 2011 Apr 11.

Full-frame video stabilization with motion inpainting.

IEEE Trans Pattern Anal Mach Intell. 2006 Jul;28(7):1150-63. doi: 10.1109/TPAMI.2006.141.

引用本文的文献

Application of Object Detection Algorithms in Non-Destructive Testing of Pressure Equipment: A Review.

Sensors (Basel). 2024 Sep 13;24(18):5944. doi: 10.3390/s24185944.

Keratoconus detection using deep learning of colour-coded maps with anterior segment optical coherence tomography: a diagnostic accuracy study.

BMJ Open. 2019 Sep 27;9(9):e031313. doi: 10.1136/bmjopen-2019-031313.

Testing the ability of unmanned aerial systems and machine learning to map weeds at subfield scales: a test with the weed Alopecurus myosuroides (Huds).

Pest Manag Sci. 2019 Aug;75(8):2283-2294. doi: 10.1002/ps.5444. Epub 2019 May 21.

Intelligent Deep Models Based on Scalograms of Electrocardiogram Signals for Biometrics.

Sensors (Basel). 2019 Feb 22;19(4):935. doi: 10.3390/s19040935.

Objective Video Quality Assessment Based on Machine Learning for Underwater Scientific Applications.

Sensors (Basel). 2017 Mar 23;17(4):664. doi: 10.3390/s17040664.

Deep Adaptive Log-Demons: Diffeomorphic Image Registration with Very Large Deformations.

Comput Math Methods Med. 2015;2015:836202. doi: 10.1155/2015/836202. Epub 2015 May 18.

Suppr 超能文献

核心技术专利：CN118964589B侵权必究

相似文献

A convolutional neural network approach for objective video quality assessment.

IEEE Trans Neural Netw. 2006 Sep;17(5):1316-27. doi: 10.1109/TNN.2006.879766.

Blind prediction of natural video quality.

IEEE Trans Image Process. 2014 Mar;23(3):1352-65. doi: 10.1109/TIP.2014.2299154.

Online kernel slow feature analysis for temporal video segmentation and tracking.

IEEE Trans Image Process. 2015 Oct;24(10):2955-70. doi: 10.1109/TIP.2015.2428052. Epub 2015 Apr 29.

Saliency-aware video compression.

IEEE Trans Image Process. 2014 Jan;23(1):19-33. doi: 10.1109/TIP.2013.2282897. Epub 2013 Sep 20.

A neural network-based novelty detector for image sequence analysis.

IEEE Trans Pattern Anal Mach Intell. 2006 Oct;28(10):1664-77. doi: 10.1109/TPAMI.2006.196.

Adaptive online performance evaluation of video trackers.

IEEE Trans Image Process. 2012 May;21(5):2812-23. doi: 10.1109/TIP.2011.2182520. Epub 2012 Jan 2.

Effective gaussian mixture learning for video background subtraction.

IEEE Trans Pattern Anal Mach Intell. 2005 May;27(5):827-32. doi: 10.1109/TPAMI.2005.102.

Salient motion features for video quality assessment.

IEEE Trans Image Process. 2011 Apr;20(4):948-58. doi: 10.1109/TIP.2010.2080279. Epub 2010 Sep 27.

Blind image quality assessment using a general regression neural network.

IEEE Trans Neural Netw. 2011 May;22(5):793-9. doi: 10.1109/TNN.2011.2120620. Epub 2011 Apr 11.

Full-frame video stabilization with motion inpainting.

IEEE Trans Pattern Anal Mach Intell. 2006 Jul;28(7):1150-63. doi: 10.1109/TPAMI.2006.141.

引用本文的文献

Application of Object Detection Algorithms in Non-Destructive Testing of Pressure Equipment: A Review.

Sensors (Basel). 2024 Sep 13;24(18):5944. doi: 10.3390/s24185944.

Keratoconus detection using deep learning of colour-coded maps with anterior segment optical coherence tomography: a diagnostic accuracy study.

BMJ Open. 2019 Sep 27;9(9):e031313. doi: 10.1136/bmjopen-2019-031313.

Testing the ability of unmanned aerial systems and machine learning to map weeds at subfield scales: a test with the weed Alopecurus myosuroides (Huds).

Pest Manag Sci. 2019 Aug;75(8):2283-2294. doi: 10.1002/ps.5444. Epub 2019 May 21.

Intelligent Deep Models Based on Scalograms of Electrocardiogram Signals for Biometrics.

Sensors (Basel). 2019 Feb 22;19(4):935. doi: 10.3390/s19040935.

Objective Video Quality Assessment Based on Machine Learning for Underwater Scientific Applications.

Sensors (Basel). 2017 Mar 23;17(4):664. doi: 10.3390/s17040664.

Deep Adaptive Log-Demons: Diffeomorphic Image Registration with Very Large Deformations.

Comput Math Methods Med. 2015;2015:836202. doi: 10.1155/2015/836202. Epub 2015 May 18.

A convolutional neural network approach for objective video quality assessment.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献