IEEE Trans Image Process. 2015 Aug;24(8):2466-77. doi: 10.1109/TIP.2015.2422578. Epub 2015 Apr 13.
Face sketch synthesis has wide applications in digital entertainment and law enforcement. Although there is much research on face sketch synthesis, most existing algorithms cannot handle some nonfacial factors, such as hair style, hairpins, and glasses if these factors are excluded in the training set. In addition, previous methods only work on well controlled conditions and fail on images with different backgrounds and sizes as the training set. To this end, this paper presents a novel method that combines both the similarity between different image patches and prior knowledge to synthesize face sketches. Given training photo-sketch pairs, the proposed method learns a photo patch feature dictionary from the training photo patches and replaces the photo patches with their sparse coefficients during the searching process. For a test photo patch, we first obtain its sparse coefficient via the learnt dictionary and then search its nearest neighbors (candidate patches) in the whole training photo patches with sparse coefficients. After purifying the nearest neighbors with prior knowledge, the final sketch corresponding to the test photo can be obtained by Bayesian inference. The contributions of this paper are as follows: 1) we relax the nearest neighbor search area from local region to the whole image without too much time consuming and 2) our method can produce nonfacial factors that are not contained in the training set and is robust against image backgrounds and can even ignore the alignment and image size aspects of test photos. Our experimental results show that the proposed method outperforms several state-of-the-arts in terms of perceptual and objective metrics.
人脸素描合成在数字娱乐和执法领域有广泛的应用。尽管有很多关于人脸素描合成的研究,但大多数现有的算法无法处理一些非人脸因素,例如头发样式、发夹和眼镜,如果这些因素不在训练集中。此外,以前的方法仅在控制良好的条件下有效,而在具有不同背景和大小的图像上则会失效,因为这些图像不在训练集中。为此,本文提出了一种新的方法,该方法结合了不同图像块之间的相似性和先验知识来合成人脸素描。给定训练照片-素描对,所提出的方法从训练照片中学习一个照片块特征字典,并在搜索过程中用稀疏系数替换照片块。对于测试照片块,我们首先通过学习到的字典获得其稀疏系数,然后在整个训练照片块的稀疏系数中搜索其最近邻(候选块)。在用先验知识对最近邻进行净化后,通过贝叶斯推理可以获得与测试照片相对应的最终素描。本文的贡献如下:1)我们放宽了最近邻搜索区域,从局部区域扩展到整个图像,而不会消耗太多时间;2)我们的方法可以生成训练集中未包含的非人脸因素,并且对图像背景具有鲁棒性,甚至可以忽略测试照片的对齐和图像大小方面。我们的实验结果表明,所提出的方法在感知和客观指标方面均优于几种现有技术。