• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

GANimation:基于单张图像的解剖学感知面部动画

GANimation: Anatomically-aware Facial Animation from a Single Image.

作者信息

Pumarola Albert, Agudo Antonio, Martinez Aleix M, Sanfeliu Alberto, Moreno-Noguer Francesc

机构信息

Institut de Robòtica i Informàtica Industrial, CSIC-UPC, 08028, Barcelona, Spain.

The Ohio State University, Columbus, OH 43210, USA.

出版信息

Comput Vis ECCV. 2018 Sep;11214:835-851. doi: 10.1007/978-3-030-01249-6_50. Epub 2018 Oct 6.

DOI:10.1007/978-3-030-01249-6_50
PMID:30465044
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6240441/
Abstract

Recent advances in Generative Adversarial Networks (GANs) have shown impressive results for task of facial expression synthesis. The most successful architecture is StarGAN [4], that conditions GANs' generation process with images of a specific domain, namely a set of images of persons sharing the same expression. While effective, this approach can only generate a discrete number of expressions, determined by the content of the dataset. To address this limitation, in this paper, we introduce a novel GAN conditioning scheme based on Action Units (AU) annotations, which describes in a continuous manifold the anatomical facial movements defining a human expression. Our approach allows controlling the magnitude of activation of each AU and combine several of them. Additionally, we propose a fully unsupervised strategy to train the model, that only requires images annotated with their activated AUs, and exploit attention mechanisms that make our network robust to changing backgrounds and lighting conditions. Extensive evaluation show that our approach goes beyond competing conditional generators both in the capability to synthesize a much wider range of expressions ruled by anatomically feasible muscle movements, as in the capacity of dealing with images in the wild.

摘要

生成对抗网络(GAN)的最新进展在面部表情合成任务中取得了令人瞩目的成果。最成功的架构是StarGAN [4],它通过特定领域的图像(即一组具有相同表情的人物图像)来调节GAN的生成过程。虽然这种方法有效,但它只能生成由数据集内容决定的离散数量的表情。为了解决这一局限性,在本文中,我们引入了一种基于动作单元(AU)注释的新型GAN调节方案,该方案在连续流形中描述了定义人类表情的解剖学面部运动。我们的方法允许控制每个AU的激活幅度并将其中几个结合起来。此外,我们提出了一种完全无监督的策略来训练模型,该策略仅需要用激活的AU进行注释的图像,并利用注意力机制使我们的网络对变化的背景和光照条件具有鲁棒性。广泛的评估表明,我们的方法在合成由解剖学上可行的肌肉运动所支配的更广泛表情的能力方面,以及在处理自然图像的能力方面,都超越了竞争的条件生成器。

相似文献

1
GANimation: Anatomically-aware Facial Animation from a Single Image.GANimation:基于单张图像的解剖学感知面部动画
Comput Vis ECCV. 2018 Sep;11214:835-851. doi: 10.1007/978-3-030-01249-6_50. Epub 2018 Oct 6.
2
Exploring Explicit Domain Supervision for Latent Space Disentanglement in Unpaired Image-to-Image Translation.探索用于无配对图像到图像翻译中潜在空间解缠的显式域监督
IEEE Trans Pattern Anal Mach Intell. 2021 Apr;43(4):1254-1266. doi: 10.1109/TPAMI.2019.2950198. Epub 2021 Mar 5.
3
F³A-GAN: Facial Flow for Face Animation With Generative Adversarial Networks.F³A-GAN:基于生成对抗网络的人脸动画的面部流
IEEE Trans Image Process. 2021;30:8658-8670. doi: 10.1109/TIP.2021.3112059. Epub 2021 Oct 21.
4
Dynamic Facial Expression Generation on Hilbert Hypersphere With Conditional Wasserstein Generative Adversarial Nets.基于条件 Wasserstein 生成对抗网络的 Hilbert 超球面上的动态面部表情生成。
IEEE Trans Pattern Anal Mach Intell. 2022 Feb;44(2):848-863. doi: 10.1109/TPAMI.2020.3002500. Epub 2022 Jan 7.
5
StackGAN++: Realistic Image Synthesis with Stacked Generative Adversarial Networks.StackGAN++:基于堆叠生成对抗网络的逼真图像合成
IEEE Trans Pattern Anal Mach Intell. 2019 Aug;41(8):1947-1962. doi: 10.1109/TPAMI.2018.2856256. Epub 2018 Jul 16.
6
SuperstarGAN: Generative adversarial networks for image-to-image translation in large-scale domains.《SuperstarGAN:大规模域图像到图像转换的生成对抗网络》。
Neural Netw. 2023 May;162:330-339. doi: 10.1016/j.neunet.2023.02.042. Epub 2023 Mar 7.
7
Weakly Supervised Facial Action Unit Recognition With Domain Knowledge.基于领域知识的弱监督人脸动作单元识别
IEEE Trans Cybern. 2018 Nov;48(11):3265-3276. doi: 10.1109/TCYB.2018.2868194. Epub 2018 Sep 26.
8
GD-StarGAN: Multi-domain image-to-image translation in garment design.GD-StarGAN:服装设计中的多领域图像到图像转换。
PLoS One. 2020 Apr 21;15(4):e0231719. doi: 10.1371/journal.pone.0231719. eCollection 2020.
9
Extracting Semantic Knowledge From GANs With Unsupervised Learning.通过无监督学习从生成对抗网络中提取语义知识。
IEEE Trans Pattern Anal Mach Intell. 2023 Aug;45(8):9654-9668. doi: 10.1109/TPAMI.2023.3262140. Epub 2023 Jun 30.
10
CariGAN: Caricature generation through weakly paired adversarial learning.CariGAN:通过弱配对对抗学习进行漫画生成。
Neural Netw. 2020 Dec;132:66-74. doi: 10.1016/j.neunet.2020.08.011. Epub 2020 Aug 20.

引用本文的文献

1
Development of an Interactive Digital Human with Context-Sensitive Facial Expressions.具有情境敏感面部表情的交互式数字人的开发。
Sensors (Basel). 2025 Aug 18;25(16):5117. doi: 10.3390/s25165117.
2
A Lightweight Dual-Stream Network with an Adaptive Strategy for Efficient Micro-Expression Recognition.一种具有自适应策略的轻量级双流网络,用于高效微表情识别。
Sensors (Basel). 2025 May 1;25(9):2866. doi: 10.3390/s25092866.
3
Facial expression recognition through muscle synergies and estimation of facial keypoint displacements through a skin-musculoskeletal model using facial sEMG signals.通过肌肉协同作用进行面部表情识别,并使用面部表面肌电信号通过皮肤-肌肉骨骼模型估计面部关键点位移。
Front Bioeng Biotechnol. 2025 Feb 12;13:1490919. doi: 10.3389/fbioe.2025.1490919. eCollection 2025.
4
Facial expression morphing: enhancing visual fidelity and preserving facial details in CycleGAN-based expression synthesis.面部表情变形:在基于循环生成对抗网络(CycleGAN)的表情合成中增强视觉保真度并保留面部细节
PeerJ Comput Sci. 2024 Oct 25;10:e2438. doi: 10.7717/peerj-cs.2438. eCollection 2024.
5
Additive effects of emotional expression and stimulus size on the perception of genuine and artificial facial expressions: an ERP study.情绪表达和刺激大小对面部真实和人工表情感知的相加效应:一项 ERP 研究。
Sci Rep. 2024 Mar 6;14(1):5574. doi: 10.1038/s41598-024-55678-2.
6
A Chinese Face Dataset with Dynamic Expressions and Diverse Ages Synthesized by Deep Learning.基于深度学习的具有动态表情和多样年龄的中文人脸数据集。
Sci Data. 2023 Dec 7;10(1):878. doi: 10.1038/s41597-023-02701-2.
7
Object-stable unsupervised dual contrastive learning image-to-image translation with query-selected attention and convolutional block attention module.基于查询选择注意力和卷积块注意力模块的目标稳定无监督双对比对比学习图像到图像翻译。
PLoS One. 2023 Nov 6;18(11):e0293885. doi: 10.1371/journal.pone.0293885. eCollection 2023.
8
Advancing Naturalistic Affective Science with Deep Learning.利用深度学习推动自然主义情感科学发展。
Affect Sci. 2023 Aug 25;4(3):550-562. doi: 10.1007/s42761-023-00215-z. eCollection 2023 Sep.
9
Combining GAN with reverse correlation to construct personalized facial expressions.结合生成对抗网络和反向相关技术构建个性化面部表情。
PLoS One. 2023 Aug 25;18(8):e0290612. doi: 10.1371/journal.pone.0290612. eCollection 2023.
10
Domain-Scalable Unpaired Image Translation via Latent Space Anchoring.通过潜在空间锚定实现域可扩展的无配对图像翻译
IEEE Trans Pattern Anal Mach Intell. 2023 Oct;45(10):11707-11719. doi: 10.1109/TPAMI.2023.3287774. Epub 2023 Sep 5.

本文引用的文献

1
StackGAN++: Realistic Image Synthesis with Stacked Generative Adversarial Networks.StackGAN++:基于堆叠生成对抗网络的逼真图像合成
IEEE Trans Pattern Anal Mach Intell. 2019 Aug;41(8):1947-1962. doi: 10.1109/TPAMI.2018.2856256. Epub 2018 Jul 16.
2
Compound facial expressions of emotion.复合情绪表情。
Proc Natl Acad Sci U S A. 2014 Apr 15;111(15):E1454-62. doi: 10.1073/pnas.1322355111. Epub 2014 Mar 31.