Ding Keyan, Ma Kede, Wang Shiqi, Simoncelli Eero P
Department of Computer Science, City University of Hong Kong, Kowloon, Hong Kong.
Howard Hughes Medical Institute, Center for Neural Science, and Courant Institute of Mathematical Sciences, New York University, New York, USA.
Int J Comput Vis. 2021;129(4):1258-1281. doi: 10.1007/s11263-020-01419-7. Epub 2021 Jan 21.
The performance of objective image quality assessment (IQA) models has been evaluated primarily by comparing model predictions to human quality judgments. Perceptual datasets gathered for this purpose have provided useful benchmarks for improving IQA methods, but their heavy use creates a risk of overfitting. Here, we perform a large-scale comparison of IQA models in terms of their use as objectives for the optimization of image processing algorithms. Specifically, we use eleven full-reference IQA models to train deep neural networks for four low-level vision tasks: denoising, deblurring, super-resolution, and compression. Subjective testing on the optimized images allows us to rank the competing models in terms of their perceptual performance, elucidate their relative advantages and disadvantages in these tasks, and propose a set of desirable properties for incorporation into future IQA models.
客观图像质量评估(IQA)模型的性能主要通过将模型预测与人类质量判断进行比较来评估。为此收集的感知数据集为改进IQA方法提供了有用的基准,但过度使用这些数据集会带来过拟合的风险。在这里,我们就IQA模型作为图像处理算法优化目标的用途进行了大规模比较。具体而言,我们使用11种全参考IQA模型来训练用于四项低级视觉任务的深度神经网络:去噪、去模糊、超分辨率和压缩。对优化后的图像进行主观测试,使我们能够根据竞争模型的感知性能对其进行排名,阐明它们在这些任务中的相对优缺点,并提出一组理想的属性,以便纳入未来的IQA模型。