Cheng Xi
School of Health Caring Industry, Sichuan University of Arts and Science, Dazhou, Sichuan, 635000, China.
Heliyon. 2024 Aug 22;10(16):e36665. doi: 10.1016/j.heliyon.2024.e36665. eCollection 2024 Aug 30.
In the evolving landscape of deep learning technologies, the emergence of Deepfakes and synthetic media is becoming increasingly prominent within digital media production. This research addresses the limitations inherent in existing face image generation algorithms based on Generative Adversarial Networks (GAN), particularly the challenges of domain irrelevancy and inadequate facial detail representation. The study introduces an enhanced face image generation algorithm, aiming to refine the CycleGAN framework. The enhancement involves a two-fold strategy: firstly, the generator's architecture is refined through the integration of an attention mechanism and adaptive residual blocks, enabling the extraction of more nuanced facial features. Secondly, the discriminator's accuracy in distinguishing real from synthetic images is improved by incorporating a relative loss concept into the loss function. Additionally, this study presents a novel model training approach that incorporates age constraints, thereby mitigating the effects of age variations on the synthesized images. The effectiveness of the proposed algorithm is empirically validated through comparative analysis with existing methodologies, utilizing the CelebA dataset. The results demonstrate that the proposed algorithm significantly enhances the realism of generated face images, outperforming current methods in terms of Peak Signal-to-Noise Ratio (PSNR) and Structural Similarity Index Measure (SSIM), while also achieving notable improvements in subjective visual quality. The implementation of this advanced method is anticipated to substantially elevate the efficiency and quality of digital media production, contributing positively to the broader field of digital media creation.
在深度学习技术不断发展的背景下,深度伪造和合成媒体在数字媒体制作中日益突出。本研究针对基于生成对抗网络(GAN)的现有面部图像生成算法所固有的局限性,特别是领域不相关性和面部细节表示不足的挑战。该研究引入了一种增强的面部图像生成算法,旨在改进循环生成对抗网络(CycleGAN)框架。这种增强涉及双重策略:首先,通过集成注意力机制和自适应残差块来改进生成器的架构,从而能够提取更细微的面部特征。其次,通过将相对损失概念纳入损失函数来提高判别器区分真实图像和合成图像的准确性。此外,本研究提出了一种结合年龄约束的新型模型训练方法,从而减轻年龄变化对合成图像的影响。通过使用CelebA数据集与现有方法进行对比分析,对所提算法的有效性进行了实证验证。结果表明,所提算法显著提高了生成面部图像的真实感,在峰值信噪比(PSNR)和结构相似性指数测量(SSIM)方面优于当前方法,同时在主观视觉质量上也有显著提升。预计这种先进方法的实施将大幅提高数字媒体制作的效率和质量,为数字媒体创作的更广泛领域做出积极贡献。