• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

CMOS-GAN:用于跨模态人脸图像合成的半监督生成对抗模型

CMOS-GAN: Semi-Supervised Generative Adversarial Model for Cross-Modality Face Image Synthesis.

作者信息

Yu Shikang, Han Hu, Shan Shiguang, Chen Xilin

出版信息

IEEE Trans Image Process. 2023;32:144-158. doi: 10.1109/TIP.2022.3226413. Epub 2022 Dec 19.

DOI:10.1109/TIP.2022.3226413
PMID:37015478
Abstract

Cross-modality face image synthesis such as sketch-to-photo, NIR-to-RGB, and RGB-to-depth has wide applications in face recognition, face animation, and digital entertainment. Conventional cross-modality synthesis methods usually require paired training data, i.e., each subject has images of both modalities. However, paired data can be difficult to acquire, while unpaired data commonly exist. In this paper, we propose a novel semi-supervised cross-modality synthesis method (namely CMOS-GAN), which can leverage both paired and unpaired face images to learn a robust cross-modality synthesis model. Specifically, CMOS-GAN uses a generator of encoder-decoder architecture for new modality synthesis. We leverage pixel-wise loss, adversarial loss, classification loss, and face feature loss to exploit the information from both paired multi-modality face images and unpaired face images for model learning. In addition, since we expect the synthetic new modality can also be helpful for improving face recognition accuracy, we further use a modified triplet loss to retain the discriminative features of the subject in the synthetic modality. Experiments on three cross-modality face synthesis tasks (NIR-to-VIS, RGB-to-depth, and sketch-to-photo) show the effectiveness of the proposed approach compared with the state-of-the-art. In addition, we also collect a large-scale RGB-D dataset (VIPL-MumoFace-3K) for the RGB-to-depth synthesis task. We plan to open-source our code and VIPL-MumoFace-3K dataset to the community (https://github.com/skgyu/CMOS-GAN).

摘要

跨模态人脸图像合成,如草图到照片、近红外到RGB以及RGB到深度,在人脸识别、人脸动画和数字娱乐等领域有着广泛应用。传统的跨模态合成方法通常需要配对的训练数据,即每个受试者都有两种模态的图像。然而,配对数据可能难以获取,而未配对的数据却普遍存在。在本文中,我们提出了一种新颖的半监督跨模态合成方法(即CMOS-GAN),它可以利用配对和未配对的人脸图像来学习一个强大的跨模态合成模型。具体而言,CMOS-GAN使用编码器-解码器架构的生成器进行新模态合成。我们利用像素级损失、对抗损失、分类损失和人脸特征损失,从配对的多模态人脸图像和未配对的人脸图像中提取信息用于模型学习。此外,由于我们期望合成的新模态也有助于提高人脸识别准确率,我们进一步使用改进的三元组损失来保留合成模态中受试者的判别特征。在三个跨模态人脸合成任务(近红外到可见光、RGB到深度以及草图到照片)上的实验表明,与现有技术相比,所提出的方法是有效的。此外,我们还为RGB到深度合成任务收集了一个大规模的RGB-D数据集(VIPL-MumoFace-3K)。我们计划将我们的代码和VIPL-MumoFace-3K数据集开源给社区(https://github.com/skgyu/CMOS-GAN)。

相似文献

1
CMOS-GAN: Semi-Supervised Generative Adversarial Model for Cross-Modality Face Image Synthesis.CMOS-GAN:用于跨模态人脸图像合成的半监督生成对抗模型
IEEE Trans Image Process. 2023;32:144-158. doi: 10.1109/TIP.2022.3226413. Epub 2022 Dec 19.
2
Bi-Modality Medical Image Synthesis Using Semi-Supervised Sequential Generative Adversarial Networks.基于半监督序贯生成对抗网络的双模态医学图像合成。
IEEE J Biomed Health Inform. 2020 Mar;24(3):855-865. doi: 10.1109/JBHI.2019.2922986. Epub 2019 Jun 14.
3
A modality-collaborative convolution and transformer hybrid network for unpaired multi-modal medical image segmentation with limited annotations.一种用于具有有限标注的未配对多模态医学图像分割的模态协作卷积与Transformer混合网络。
Med Phys. 2023 Sep;50(9):5460-5478. doi: 10.1002/mp.16338. Epub 2023 Mar 15.
4
Semi-supervised segmentation of lesion from breast ultrasound images with attentional generative adversarial network.基于注意力生成对抗网络的乳腺超声图像病灶半监督分割。
Comput Methods Programs Biomed. 2020 Jun;189:105275. doi: 10.1016/j.cmpb.2019.105275. Epub 2019 Dec 12.
5
Common feature learning for brain tumor MRI synthesis by context-aware generative adversarial network.基于上下文感知生成对抗网络的脑肿瘤 MRI 合成的通用特征学习。
Med Image Anal. 2022 Jul;79:102472. doi: 10.1016/j.media.2022.102472. Epub 2022 May 4.
6
A medical image classification method based on self-regularized adversarial learning.基于自正则化对抗学习的医学图像分类方法。
Med Phys. 2024 Nov;51(11):8232-8246. doi: 10.1002/mp.17320. Epub 2024 Jul 30.
7
2D facial landmark localization method for multi-view face synthesis image using a two-pathway generative adversarial network approach.基于双通路生成对抗网络方法的多视角人脸合成图像的二维面部地标定位方法
PeerJ Comput Sci. 2022 Feb 16;8:e897. doi: 10.7717/peerj-cs.897. eCollection 2022.
8
Unpaired Deep Cross-Modality Synthesis with Fast Training.基于快速训练的非配对深度跨模态合成
Deep Learn Med Image Anal Multimodal Learn Clin Decis Support (2018). 2018 Sep;11045:155-164. doi: 10.1007/978-3-030-00889-5_18. Epub 2018 Sep 20.
9
Ea-GANs: Edge-Aware Generative Adversarial Networks for Cross-Modality MR Image Synthesis.Ea-GANs:用于跨模态磁共振图像合成的边缘感知生成对抗网络。
IEEE Trans Med Imaging. 2019 Jul;38(7):1750-1762. doi: 10.1109/TMI.2019.2895894. Epub 2019 Jan 29.
10
Toward Realistic Face Photo-Sketch Synthesis via Composition-Aided GANs.通过构图辅助的 GAN 实现逼真的人脸照片素描合成。
IEEE Trans Cybern. 2021 Sep;51(9):4350-4362. doi: 10.1109/TCYB.2020.2972944. Epub 2021 Sep 15.

引用本文的文献

1
Toward identity preserving in face sketch-photo synthesis using a hybrid CNN-Mamba framework.使用混合CNN-Mamba框架在面部素描-照片合成中实现身份保留。
Sci Rep. 2024 Sep 28;14(1):22495. doi: 10.1038/s41598-024-72066-y.