Lu Xiaoman, Lu Ran, Zhao Wenhao, Ma Erbin
Department of Mathematics, College of Science, Northeastern University, Shenyang, Liaoning, China.
Front Neurorobot. 2023 Jan 12;16:1111621. doi: 10.3389/fnbot.2022.1111621. eCollection 2022.
Big data facial image is an important identity information for people. However, facial image inpainting using existing deep learning methods has some problems such as insufficient feature mining and incomplete semantic expression, leading to output image artifacts or fuzzy textures. Therefore, it is of practical significance to study how to effectively restore an incomplete facial image. In this study, we proposed a facial image inpainting method using a multistage generative adversarial network (GAN) and the global attention mechanism (GAM). For the overall network structure, we used the GAN as the main body, then we established skip connections to optimize the network structure, and used the encoder-decoder structure to better capture the semantic information of the missing part of a facial image. A local refinement network has been proposed to enhance the local restoration effect and to weaken the influence of unsatisfactory results. Moreover, GAM is added to the network to magnify the interactive features of the global dimension while reducing information dispersion, which is more suitable for restoring human facial information. Comparative experiments on CelebA and CelebA-HQ big datasets show that the proposed method generates realistic inpainting results in both regular and irregular masks and achieves peak signal-to-noise ratio (PSNR) and structural similarity (SSIM), as well as other evaluation indicators that illustrate the performance and efficiency of the proposed model.
大数据面部图像是人们重要的身份信息。然而,使用现有深度学习方法进行面部图像修复存在一些问题,如特征挖掘不足和语义表达不完整,导致输出图像出现伪影或纹理模糊。因此,研究如何有效恢复不完整的面部图像具有实际意义。在本研究中,我们提出了一种使用多阶段生成对抗网络(GAN)和全局注意力机制(GAM)的面部图像修复方法。对于整体网络结构,我们以GAN为主体,然后建立跳跃连接来优化网络结构,并使用编码器-解码器结构来更好地捕捉面部图像缺失部分的语义信息。提出了一种局部细化网络来增强局部修复效果并减弱不理想结果的影响。此外,在网络中添加GAM以放大全局维度的交互特征,同时减少信息分散,这更适合恢复人类面部信息。在CelebA和CelebA-HQ大数据集上的对比实验表明,该方法在规则和不规则掩码下均能生成逼真的修复结果,并实现了峰值信噪比(PSNR)和结构相似性(SSIM),以及其他说明所提模型性能和效率的评估指标。