Zhao Xiuzhi, Chen Wenting, Xie Weicheng, Shen Linlin
College of Artificial Intelligence, Zhejiang Industry & Trade Vocational College, Wenzhou, Zhejiang, China.
Department of Electrical Engineering, City University of Hong Kong, Kowloon, Hong Kong SAR, China.
Front Neurosci. 2023 Mar 7;17:1136416. doi: 10.3389/fnins.2023.1136416. eCollection 2023.
Caricature is an exaggerated pictorial representation of a person, which is widely used in entertainment and political media. Recently, GAN-based methods achieved automatic caricature generation through transferring caricature style and performing shape exaggeration simultaneously. However, the caricature synthesized by these methods cannot perfectly reflect the characteristics of the subject, whose shape exaggeration are not reasonable and requires facial landmarks of caricature. In addition, the existing methods always produce the bad cases in caricature style due to the simpleness of their style transfer method.
In this paper, we propose a Style Attention based Global-local Aware GAN to apply the characteristics of a subject to generate personalized caricature. To integrate the facial characteristics of a subject, we introduce a landmark-based warp controller for personalized shape exaggeration, which employs the facial landmarks as control points to warp image according to its facial features, without requirement of the facial landmarks of caricature. To fuse the facial feature with caricature style appropriately, we introduce a style-attention module, which adopts an attention mechanism, instead of the simple Adaptive Instance Normalization (AdaIN) for style transfer. To reduce the bad cases and increase the quality of generated caricatures, we propose a multi-scale discriminator to both globally and locally discriminate the synthesized and real caricature, which improves the whole structure and realistic details of the synthesized caricature.
Experimental results on two publicly available datasets, the WebCaricature and the CaVINet datasets, validate the effectiveness of our proposed method and suggest that our proposed method achieves better performance than the existing methods.
The caricatures generated by the proposed method can not only preserve the identity of input photo but also the characteristic shape exaggeration for each person, which are highly close to the real caricatures drawn by real artists. It indicates that our method can be adopted in the real application.
漫画是对人物的一种夸张的图像表现形式,广泛应用于娱乐和政治媒体中。最近,基于生成对抗网络(GAN)的方法通过同时转移漫画风格和进行形状夸张实现了自动漫画生成。然而,这些方法合成的漫画不能完美地反映主体的特征,其形状夸张不合理且需要漫画的面部标志。此外,由于现有方法的风格转移方法简单,总是会产生漫画风格不佳的情况。
在本文中,我们提出了一种基于风格注意力的全局-局部感知GAN,以应用主体的特征来生成个性化漫画。为了整合主体的面部特征,我们引入了一种基于地标的变形控制器用于个性化形状夸张,该控制器将面部标志作为控制点,根据其面部特征对图像进行变形,而无需漫画的面部标志。为了将面部特征与漫画风格适当融合,我们引入了一个风格注意力模块,该模块采用注意力机制,而不是简单的自适应实例归一化(AdaIN)进行风格转移。为了减少不良情况并提高生成漫画的质量,我们提出了一种多尺度判别器,用于全局和局部判别合成漫画和真实漫画,这改善了合成漫画的整体结构和逼真细节。
在两个公开可用的数据集WebCaricature和CaVINet数据集上的实验结果验证了我们提出的方法的有效性,并表明我们提出的方法比现有方法具有更好的性能。
所提出的方法生成的漫画不仅可以保留输入照片的身份,还可以保留每个人独特的形状夸张特征,与真实艺术家绘制的真实漫画非常接近。这表明我们的方法可以应用于实际应用中。