Suppr超能文献

基于条件 Wasserstein 生成对抗网络的 Hilbert 超球面上的动态面部表情生成。

Dynamic Facial Expression Generation on Hilbert Hypersphere With Conditional Wasserstein Generative Adversarial Nets.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2022 Feb;44(2):848-863. doi: 10.1109/TPAMI.2020.3002500. Epub 2022 Jan 7.

Abstract

In this work, we propose a novel approach for generating videos of the six basic facial expressions given a neutral face image. We propose to exploit the face geometry by modeling the facial landmarks motion as curves encoded as points on a hypersphere. By proposing a conditional version of manifold-valued Wasserstein generative adversarial network (GAN) for motion generation on the hypersphere, we learn the distribution of facial expression dynamics of different classes, from which we synthesize new facial expression motions. The resulting motions can be transformed to sequences of landmarks and then to images sequences by editing the texture information using another conditional Generative Adversarial Network. To the best of our knowledge, this is the first work that explores manifold-valued representations with GAN to address the problem of dynamic facial expression generation. We evaluate our proposed approach both quantitatively and qualitatively on two public datasets; Oulu-CASIA and MUG Facial Expression. Our experimental results demonstrate the effectiveness of our approach in generating realistic videos with continuous motion, realistic appearance and identity preservation. We also show the efficiency of our framework for dynamic facial expressions generation, dynamic facial expression transfer and data augmentation for training improved emotion recognition models.

摘要

在这项工作中,我们提出了一种新的方法,用于在给定中性面部图像的情况下生成六种基本面部表情的视频。我们建议通过将面部地标运动建模为编码在超球上的点的曲线来利用面部几何形状。通过提出用于超球上运动生成的条件形式的流形值 Wasserstein 生成对抗网络 (GAN),我们学习了不同类别的面部表情动力学的分布,从中我们合成了新的面部表情运动。通过使用另一个条件生成对抗网络编辑纹理信息,可以将生成的运动转换为地标序列,然后转换为图像序列。据我们所知,这是首次探索使用 GAN 进行流形值表示以解决动态面部表情生成问题的工作。我们在两个公共数据集 Oulu-CASIA 和 MUG Facial Expression 上对我们提出的方法进行了定量和定性评估。我们的实验结果证明了我们的方法在生成具有连续运动、真实外观和身份保留的逼真视频方面的有效性。我们还展示了我们的框架在动态面部表情生成、动态面部表情转移和数据增强方面的效率,以训练改进的情感识别模型。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验