IEEE Trans Image Process. 2016 Nov;25(11):5063-5076. doi: 10.1109/TIP.2016.2598493.
This paper presents a study of bilevel image similarity, including new objective metrics intended to quantify similarity consistent with human perception, and a subjective experiment to obtain ground truth for judging the performance of the objective similarity metrics. The focus is on scenic bilevel images, which are complex, natural or hand-drawn images, such as landscapes or portraits. The ground truth was obtained from ratings by 77 subjects of 44 distorted versions of seven scenic images, using a modified version of the SDSCE testing methodology. Based on hypotheses about human perception of bilevel images, several new metrics are proposed that outperform existing ones in the sense of attaining significantly higher Pearson and Spearman-rank correlation coefficients with respect to the ground truth from the subjective experiment. The new metrics include adjusted percentage error, bilevel local direction, and connected components comparison. Combinations of these metrics are also proposed, which exploit their complementarity to attain even better performance. These metrics and the ground truth are then used to assess the relative severity of various kinds of distortion and the performance of several lossy bilevel compression methods.
本文研究了双层图像相似度,包括旨在量化与人类感知一致的相似性的新客观指标,以及一个主观实验,以获得判断客观相似性指标性能的基准。重点是风景双层图像,它们是复杂的、自然的或手绘的图像,如风景或肖像。基准是通过 77 名受试者对 7 个风景图像的 44 个变形版本的评分获得的,使用了改进后的 SDSCE 测试方法。基于人类对双层图像感知的假设,提出了几个新的指标,这些指标在与主观实验中的基准相比时,在达到更高的皮尔逊和斯皮尔曼等级相关系数方面表现优于现有的指标。新指标包括调整后的百分比误差、双层局部方向和连通分量比较。还提出了这些指标的组合,它们利用其互补性来获得更好的性能。然后使用这些指标和基准来评估各种失真的相对严重程度和几种有损双层压缩方法的性能。