Univ. Grenoble Alpes, CNRS, Grenoble INP (Institute of Engineering Univ. Grenoble Alpes), TIMA, 38000 Grenoble, France.
Sensors (Basel). 2021 Dec 28;22(1):175. doi: 10.3390/s22010175.
This paper presents the construction of a new objective method for estimation of visual perceiving quality. The proposal provides an assessment of image quality without the need for a reference image or a specific distortion assumption. Two main processes have been used to build our models: The first one uses deep learning with a convolutional neural network process, without any preprocessing. The second objective visual quality is computed by pooling several image features extracted from different concepts: the natural scene statistic in the spatial domain, the gradient magnitude, the Laplacian of Gaussian, as well as the spectral and spatial entropies. The features extracted from the image file are used as the input of machine learning techniques to build the models that are used to estimate the visual quality level of any image. For the machine learning training phase, two main processes are proposed: The first proposed process consists of a direct learning using all the selected features in only one training phase, named direct learning blind visual quality assessment DLBQA. The second process is an indirect learning and consists of two training phases, named indirect learning blind visual quality assessment ILBQA. This second process includes an additional phase of construction of intermediary metrics used for the construction of the prediction model. The produced models are evaluated on many benchmarks image databases as TID2013, LIVE, and LIVE in the wild image quality challenge. The experimental results demonstrate that the proposed models produce the best visual perception quality prediction, compared to the state-of-the-art models. The proposed models have been implemented on an FPGA platform to demonstrate the feasibility of integrating the proposed solution on an image sensor.
本文提出了一种新的视觉感知质量客观估计方法的构建。该建议提供了一种无需参考图像或特定失真假设的图像质量评估。我们的模型构建主要使用了两个过程:第一个过程使用了带有卷积神经网络的深度学习,无需任何预处理。第二个客观视觉质量是通过从不同概念中提取的多个图像特征的池化来计算的:空间域中的自然场景统计、梯度幅度、高斯拉普拉斯、以及光谱和空间熵。从图像文件中提取的特征被用作机器学习技术的输入,以构建用于估计任何图像视觉质量水平的模型。对于机器学习训练阶段,提出了两个主要过程:第一个提出的过程包括仅在一个训练阶段中使用所有选定特征的直接学习,称为直接学习盲视觉质量评估 DLBQA。第二个过程是间接学习,包括两个训练阶段,称为间接学习盲视觉质量评估 ILBQA。第二个过程包括用于构建预测模型的中间度量的构建的附加阶段。所提出的模型在许多基准图像数据库(如 TID2013、LIVE 和 LIVE 野外图像质量挑战赛)上进行了评估。实验结果表明,与最先进的模型相比,所提出的模型可以产生最佳的视觉感知质量预测。所提出的模型已经在 FPGA 平台上实现,以证明在图像传感器上集成所提出的解决方案的可行性。