Talebi Hossein, Milanfar Peyman
IEEE Trans Image Process. 2018 Apr 30. doi: 10.1109/TIP.2018.2831899.
Automatically learned quality assessment for images has recently become a hot topic due to its usefulness in a wide variety of applications such as evaluating image capture pipelines, storage techniques and sharing media. Despite the subjective nature of this problem, most existing methods only predict the mean opinion score provided by datasets such as AVA [1] and TID2013 [2]. Our approach differs from others in that we predict the distribution of human opinion scores using a convolutional neural network. Our architecture also has the advantage of being significantly simpler than other methods with comparable performance. Our proposed approach relies on the success (and retraining) of proven, state-of-the-art deep object recognition networks. Our resulting network can be used to not only score images reliably and with high correlation to human perception, but also to assist with adaptation and optimization of photo editing/enhancement algorithms in a photographic pipeline. All this is done without need for a "golden" reference image, consequently allowing for single-image, semantic- and perceptually-aware, no-reference quality assessment.
由于自动学习的图像质量评估在诸如评估图像采集管道、存储技术和共享媒体等各种应用中非常有用,最近它已成为一个热门话题。尽管这个问题具有主观性,但大多数现有方法仅预测由AVA [1]和TID2013 [2]等数据集提供的平均意见得分。我们的方法与其他方法不同之处在于,我们使用卷积神经网络预测人类意见得分的分布。我们的架构还具有比其他具有可比性能的方法显著更简单的优势。我们提出的方法依赖于经过验证的、最先进的深度目标识别网络的成功(和重新训练)。我们得到的网络不仅可以可靠地对图像进行评分,并且与人类感知具有高度相关性,还可以协助在摄影管道中对照片编辑/增强算法进行调整和优化。所有这些都无需“黄金”参考图像即可完成,从而实现单图像、语义和感知感知的无参考质量评估。