IEEE Trans Image Process. 2010 Nov;19(11):2861-73. doi: 10.1109/TIP.2010.2050625. Epub 2010 May 18.
This paper presents a new approach to single-image super-resolution, based on sparse signal representation. Research on image statistics suggests that image patches can be well-represented as a sparse linear combination of elements from an appropriately chosen over-complete dictionary. Inspired by this observation, we seek a sparse representation for each patch of the low-resolution input, and then use the coefficients of this representation to generate the high-resolution output. Theoretical results from compressed sensing suggest that under mild conditions, the sparse representation can be correctly recovered from the downsampled signals. By jointly training two dictionaries for the low- and high-resolution image patches, we can enforce the similarity of sparse representations between the low resolution and high resolution image patch pair with respect to their own dictionaries. Therefore, the sparse representation of a low resolution image patch can be applied with the high resolution image patch dictionary to generate a high resolution image patch. The learned dictionary pair is a more compact representation of the patch pairs, compared to previous approaches, which simply sample a large amount of image patch pairs, reducing the computational cost substantially. The effectiveness of such a sparsity prior is demonstrated for both general image super-resolution and the special case of face hallucination. In both cases, our algorithm generates high-resolution images that are competitive or even superior in quality to images produced by other similar SR methods. In addition, the local sparse modeling of our approach is naturally robust to noise, and therefore the proposed algorithm can handle super-resolution with noisy inputs in a more unified framework.
本文提出了一种基于稀疏信号表示的单幅图像超分辨率新方法。 对图像统计的研究表明,图像块可以很好地表示为从适当选择的过完备字典中选择的元素的稀疏线性组合。 受此观察的启发,我们为低分辨率输入的每个图像块寻求稀疏表示,然后使用该表示的系数生成高分辨率输出。 压缩感知的理论结果表明,在温和的条件下,可以从下采样信号中正确恢复稀疏表示。 通过联合训练用于低分辨率和高分辨率图像块的两个字典,我们可以强制低分辨率和高分辨率图像块对之间的稀疏表示相对于其自身字典相似。 因此,可以将低分辨率图像块的稀疏表示应用于高分辨率图像块字典以生成高分辨率图像块。 与之前的方法相比,学习到的字典对是对图像块对的更紧凑的表示,以前的方法只是简单地对大量的图像块对进行采样,从而大大降低了计算成本。 这种稀疏性先验的有效性在一般图像超分辨率和人脸幻觉的特殊情况下都得到了证明。 在这两种情况下,我们的算法生成的高分辨率图像在质量上与其他类似的 SR 方法生成的图像具有竞争力,甚至更优。 此外,我们方法的局部稀疏建模自然对噪声鲁棒,因此所提出的算法可以在更统一的框架中处理具有噪声输入的超分辨率。