• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

面部表情变形:在基于循环生成对抗网络(CycleGAN)的表情合成中增强视觉保真度并保留面部细节

Facial expression morphing: enhancing visual fidelity and preserving facial details in CycleGAN-based expression synthesis.

作者信息

Sub-R-Pa Chayanon, Chen Rung-Ching, Fan Ming-Zhong

机构信息

Department of Information Management, Chaoyang University of Technology, Taichung, Taiwan.

出版信息

PeerJ Comput Sci. 2024 Oct 25;10:e2438. doi: 10.7717/peerj-cs.2438. eCollection 2024.

DOI:10.7717/peerj-cs.2438
PMID:39650358
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11623074/
Abstract

Recent advancements in facial expression synthesis using deep learning, particularly with Cycle-Consistent Adversarial Networks (CycleGAN), have led to impressive results. However, a critical challenge persists: the generated expressions often lack the sharpness and fine details of the original face, such as freckles, moles, or birthmarks. To address this issue, we introduce the Facial Expression Morphing (FEM) algorithm, a novel post-processing method designed to enhance the visual fidelity of CycleGAN-based outputs. The FEM method blends the input image with the generated expression, prioritizing the preservation of crucial facial details. We experimented with our method on the Radboud Faces Database (RafD) and evaluated employing the Fréchet Inception Distance (FID) standard benchmark for image-to-image translation and introducing a new metric, FSD (Facial Similarity Distance), to specifically measure the similarity between translated and real images. Our comprehensive analysis of CycleGAN, UNet Vision Transformer cycle-consistent GAN versions 1 (UVCGANv1) and 2 (UVCGANv2) reveals a substantial enhancement in image clarity and preservation of intricate details. The average FID score of 31.92 achieved by our models represents a remarkable 50% reduction compared to the previous state-of-the-art model's score of 63.82, showcasing the significant advancements made in this domain. This substantial enhancement in image quality is further supported by our proposed FSD metric, which shows a closer resemblance between FEM-processed images and the original faces.

摘要

近年来,利用深度学习进行面部表情合成取得了显著进展,特别是使用循环一致对抗网络(CycleGAN),取得了令人瞩目的成果。然而,一个关键挑战仍然存在:生成的表情往往缺乏原始面部的清晰度和精细细节,如雀斑、痣或胎记。为了解决这个问题,我们引入了面部表情变形(FEM)算法,这是一种新颖的后处理方法,旨在提高基于CycleGAN的输出的视觉保真度。FEM方法将输入图像与生成的表情融合,优先保留关键面部细节。我们在拉德堡德面部数据库(RafD)上对我们的方法进行了实验,并采用弗雷歇因距离(FID)标准基准进行图像到图像翻译评估,并引入了一种新的度量标准,即面部相似性距离(FSD),以专门测量翻译图像与真实图像之间的相似性。我们对CycleGAN、UNet视觉Transformer循环一致GAN版本1(UVCGANv1)和2(UVCGANv2)的综合分析表明,在图像清晰度和复杂细节保留方面有了显著提高。我们的模型获得的平均FID分数为31.92,与之前的最先进模型的分数63.82相比,显著降低了50%,展示了该领域取得的重大进展。我们提出的FSD度量标准进一步支持了图像质量的显著提高,该标准表明经FEM处理的图像与原始面部更相似。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b174/11623074/f58edde06f6e/peerj-cs-10-2438-g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b174/11623074/ac5fb144779c/peerj-cs-10-2438-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b174/11623074/5ad4f30e7ad6/peerj-cs-10-2438-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b174/11623074/90111b04d4d5/peerj-cs-10-2438-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b174/11623074/e970567258d4/peerj-cs-10-2438-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b174/11623074/1441d6558bed/peerj-cs-10-2438-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b174/11623074/124ab698187d/peerj-cs-10-2438-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b174/11623074/a8ff67fdde63/peerj-cs-10-2438-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b174/11623074/fa8028dd1a00/peerj-cs-10-2438-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b174/11623074/b63b870e675b/peerj-cs-10-2438-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b174/11623074/7e335d7075dc/peerj-cs-10-2438-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b174/11623074/eeb12be1e11b/peerj-cs-10-2438-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b174/11623074/46d0759dc1ac/peerj-cs-10-2438-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b174/11623074/f58edde06f6e/peerj-cs-10-2438-g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b174/11623074/ac5fb144779c/peerj-cs-10-2438-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b174/11623074/5ad4f30e7ad6/peerj-cs-10-2438-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b174/11623074/90111b04d4d5/peerj-cs-10-2438-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b174/11623074/e970567258d4/peerj-cs-10-2438-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b174/11623074/1441d6558bed/peerj-cs-10-2438-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b174/11623074/124ab698187d/peerj-cs-10-2438-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b174/11623074/a8ff67fdde63/peerj-cs-10-2438-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b174/11623074/fa8028dd1a00/peerj-cs-10-2438-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b174/11623074/b63b870e675b/peerj-cs-10-2438-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b174/11623074/7e335d7075dc/peerj-cs-10-2438-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b174/11623074/eeb12be1e11b/peerj-cs-10-2438-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b174/11623074/46d0759dc1ac/peerj-cs-10-2438-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b174/11623074/f58edde06f6e/peerj-cs-10-2438-g013.jpg

相似文献

1
Facial expression morphing: enhancing visual fidelity and preserving facial details in CycleGAN-based expression synthesis.面部表情变形:在基于循环生成对抗网络(CycleGAN)的表情合成中增强视觉保真度并保留面部细节
PeerJ Comput Sci. 2024 Oct 25;10:e2438. doi: 10.7717/peerj-cs.2438. eCollection 2024.
2
A Method based on Evolutionary Algorithms and Channel Attention Mechanism to Enhance Cycle Generative Adversarial Network Performance for Image Translation.基于进化算法和通道注意力机制的方法来提高用于图像翻译的循环生成对抗网络性能。
Int J Neural Syst. 2023 May;33(5):2350026. doi: 10.1142/S0129065723500260. Epub 2023 Apr 5.
3
Image Translation by Ad CycleGAN for COVID-19 X-Ray Images: A New Approach for Controllable GAN.基于 AdCycleGAN 的 COVID-19 射线图像翻译:一种新的可控 GAN 方法。
Sensors (Basel). 2022 Dec 8;22(24):9628. doi: 10.3390/s22249628.
4
Enhancing Ultrasound Image Quality Across Disease Domains: Application of Cycle-Consistent Generative Adversarial Network and Perceptual Loss.跨疾病领域提升超声图像质量:循环一致生成对抗网络与感知损失的应用
JMIR Biomed Eng. 2024 Dec 17;9:e58911. doi: 10.2196/58911.
5
Transformer CycleGAN with uncertainty estimation for CBCT based synthetic CT in adaptive radiotherapy.基于自适应放疗的锥形束 CT 合成 CT 的 Transformer CycleGAN 与不确定性估计。
Phys Med Biol. 2024 Jan 30;69(3). doi: 10.1088/1361-6560/ad1cfc.
6
Improvement of 2D cine image quality using 3D priors and cycle generative adversarial network for low field MRI-guided radiation therapy.利用三维先验信息和循环生成对抗网络改善低场 MRI 引导放射治疗的二维电影图像质量。
Med Phys. 2024 May;51(5):3495-3509. doi: 10.1002/mp.16860. Epub 2023 Dec 3.
7
IE-CycleGAN: improved cycle consistent adversarial network for unpaired PET image enhancement.IE-CycleGAN:用于非配对 PET 图像增强的改进的循环一致对抗网络。
Eur J Nucl Med Mol Imaging. 2024 Nov;51(13):3874-3887. doi: 10.1007/s00259-024-06823-6. Epub 2024 Jul 23.
8
Improving realism in abdominal ultrasound simulation combining a segmentation-guided loss and polar coordinates training.结合分割引导损失和极坐标训练提高腹部超声模拟的逼真度。
Med Phys. 2025 Jun;52(6):4540-4556. doi: 10.1002/mp.17801. Epub 2025 Mar 30.
9
Generative Adversarial Network Based Contrast Enhancement: Synthetic Contrast Brain Magnetic Resonance Imaging.基于生成对抗网络的对比度增强:合成对比脑磁共振成像
Acad Radiol. 2025 Apr;32(4):2220-2232. doi: 10.1016/j.acra.2024.11.021. Epub 2024 Dec 18.
10
Compensation cycle consistent generative adversarial networks (Comp-GAN) for synthetic CT generation from MR scans with truncated anatomy.基于截断解剖的磁共振成像到 CT 合成的补偿循环一致生成对抗网络(Comp-GAN)。
Med Phys. 2023 Jul;50(7):4399-4414. doi: 10.1002/mp.16246. Epub 2023 Feb 4.

本文引用的文献

1
Real-Time Facial Affective Computing on Mobile Devices.移动设备上的实时面部情感计算。
Sensors (Basel). 2020 Feb 6;20(3):870. doi: 10.3390/s20030870.
2
AttGAN: Facial Attribute Editing by Only Changing What You Want.AttGAN:仅通过改变你想要改变的内容来进行面部属性编辑。
IEEE Trans Image Process. 2019 Nov;28(11):5464-5478. doi: 10.1109/TIP.2019.2916751. Epub 2019 May 20.
3
GANimation: Anatomically-aware Facial Animation from a Single Image.GANimation:基于单张图像的解剖学感知面部动画
Comput Vis ECCV. 2018 Sep;11214:835-851. doi: 10.1007/978-3-030-01249-6_50. Epub 2018 Oct 6.
4
Reliable Crowdsourcing and Deep Locality-Preserving Learning for Unconstrained Facial Expression Recognition.无约束面部表情识别的可靠众包和深度保局学习。
IEEE Trans Image Process. 2019 Jan;28(1):356-370. doi: 10.1109/TIP.2018.2868382. Epub 2018 Sep 3.