IEEE Trans Image Process. 2018 Jan;27(1):220-235. doi: 10.1109/TIP.2017.2750419. Epub 2017 Sep 8.
Image and texture synthesis is a challenging task that has long been drawing attention in the fields of image processing, graphics, and machine learning. This problem consists of modeling the desired type of images, either through training examples or via a parametric modeling, and then generating images that belong to the same statistical origin. This paper addresses the image synthesis task, focusing on two specific families of images-handwritten digits and face images. This paper offers two main contributions. First, we suggest a simple and intuitive algorithm capable of generating such images in a unified way. The proposed approach taken is pyramidal, consisting of upscaling and refining the estimated image several times. For each upscaling stage, the algorithm randomly draws small patches from a patch database and merges these to form a coherent and novel image with high visual quality. The second contribution is a general framework for the evaluation of the generation performance, which combines three aspects: the likelihood, the originality, and the spread of the synthesized images. We assess the proposed synthesis scheme and show that the results are similar in nature, and yet different from the ones found in the training set, suggesting that true synthesis effect has been obtained.
图像和纹理合成是图像处理、图形和机器学习等领域中一个长期备受关注的具有挑战性的任务。这个问题包括通过训练示例或通过参数建模来对所需类型的图像进行建模,然后生成属于同一统计起源的图像。本文主要关注手写数字和人脸图像这两种特定类型的图像,针对图像合成任务展开讨论。本文有两个主要贡献。首先,我们提出了一种简单直观的算法,能够以统一的方式生成这些图像。所采用的方法是金字塔形的,包括多次对估计图像进行上采样和细化。对于每个上采样阶段,算法都会从一个补丁数据库中随机抽取小补丁,并将这些补丁合并成一个具有高质量视觉效果的连贯而新颖的图像。第二个贡献是一个用于评估生成性能的通用框架,它结合了三个方面:似然性、新颖性和合成图像的分布。我们评估了所提出的合成方案,并表明结果在本质上是相似的,但与训练集中的结果不同,这表明已经获得了真正的合成效果。