• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于拼图求解和模态解耦的漫画视觉人脸识别

Caricature-visual face recognition based on jigsaw solving and modal decoupling.

作者信息

Yao Yajun, Wang Chongwen

机构信息

Zhengzhou Power Supply Company, State Grid Henan Electric Power Company, Zhengzhou, 450000, China.

School of Computer Science, Beijing Institute of Technology, Beijing, 100000, China.

出版信息

Sci Rep. 2024 Nov 18;14(1):28419. doi: 10.1038/s41598-024-80032-x.

DOI:10.1038/s41598-024-80032-x
PMID:39558061
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11574276/
Abstract

In recent years, face recognition technology has made significant progress in the field of real visual images, yet face recognition involving caricature-visual images remains a challenge due to the exaggerated and unrealistic features of caricature faces. To tackle this issue, this paper introduces the Caricature-visual Face Recognition Model Based on Jigsaw Solving and Modal Decoupling (CVF-JSM). The CVF-JSM consists of two modules: feature extraction and decoupling. The feature extraction module incorporates a graph attention network at the intermediate stage of the backbone network, which constructs and solves jigsaw puzzles to enable the network to extract shape features. The feature decoupling module features a three-branch structure that divides the features into modal and identity features. The real and caricature face recognition branches separate identity features for recognition through parameter sharing and orthogonality constraints. The feature common subspace alignment branch maps the anchor image, as well as the positive and negative sample images, into a common subspace to isolate identity features. Subsequently, by aligning the features, it further refines the effective identity features. The experimental results conducted on multiple datasets demonstrate that the CVF-JSM model outperforms existing technologies in the realm of caricature-visual face recognition.

摘要

近年来,人脸识别技术在真实视觉图像领域取得了重大进展,然而,由于漫画人脸具有夸张和不现实的特征,涉及漫画视觉图像的人脸识别仍然是一个挑战。为了解决这个问题,本文介绍了基于拼图求解和模态解耦的漫画视觉人脸识别模型(CVF-JSM)。CVF-JSM由两个模块组成:特征提取和解耦。特征提取模块在主干网络的中间阶段引入了一个图注意力网络,该网络构建并解决拼图问题,以使网络能够提取形状特征。特征解耦模块具有一个三分支结构,将特征分为模态特征和身份特征。真实和漫画人脸识别分支通过参数共享和正交约束分离身份特征以进行识别。特征公共子空间对齐分支将锚定图像以及正、负样本图像映射到一个公共子空间中,以分离身份特征。随后,通过对齐特征,进一步细化有效的身份特征。在多个数据集上进行的实验结果表明,CVF-JSM模型在漫画视觉人脸识别领域优于现有技术。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14b0/11574276/b5f8e3368e5a/41598_2024_80032_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14b0/11574276/23425fd3efc9/41598_2024_80032_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14b0/11574276/e866898744f4/41598_2024_80032_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14b0/11574276/fabc4bc62bd3/41598_2024_80032_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14b0/11574276/4109dbd1c6f3/41598_2024_80032_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14b0/11574276/b5f8e3368e5a/41598_2024_80032_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14b0/11574276/23425fd3efc9/41598_2024_80032_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14b0/11574276/e866898744f4/41598_2024_80032_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14b0/11574276/fabc4bc62bd3/41598_2024_80032_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14b0/11574276/4109dbd1c6f3/41598_2024_80032_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/14b0/11574276/b5f8e3368e5a/41598_2024_80032_Fig5_HTML.jpg

相似文献

1
Caricature-visual face recognition based on jigsaw solving and modal decoupling.基于拼图求解和模态解耦的漫画视觉人脸识别
Sci Rep. 2024 Nov 18;14(1):28419. doi: 10.1038/s41598-024-80032-x.
2
Graph Jigsaw Learning for Cartoon Face Recognition.用于卡通人脸识别的图形拼图学习
IEEE Trans Image Process. 2022;31:3961-3972. doi: 10.1109/TIP.2022.3177952. Epub 2022 Jun 9.
3
Caricature and face recognition.漫画与面部识别。
Mem Cognit. 1992 Jul;20(4):433-40. doi: 10.3758/bf03210927.
4
Style attention based global-local aware GAN for personalized facial caricature generation.基于风格注意力的全局-局部感知生成对抗网络用于个性化面部漫画生成
Front Neurosci. 2023 Mar 7;17:1136416. doi: 10.3389/fnins.2023.1136416. eCollection 2023.
5
JigsawGAN: Auxiliary Learning for Solving Jigsaw Puzzles With Generative Adversarial Networks.拼图生成对抗网络(JigsawGAN):利用生成对抗网络进行拼图求解的辅助学习
IEEE Trans Image Process. 2022;31:513-524. doi: 10.1109/TIP.2021.3120052. Epub 2021 Dec 16.
6
3D-CariGAN: An End-to-End Solution to 3D Caricature Generation From Normal Face Photos.3D-CariGAN:一种从正常人脸照片生成 3D 卡通人像的端到端解决方案。
IEEE Trans Vis Comput Graph. 2023 Apr;29(4):2203-2210. doi: 10.1109/TVCG.2021.3126659. Epub 2023 Feb 28.
7
CAST: Learning Both Geometric and Texture Style Transfers for Effective Caricature Generation.CAST:学习几何和纹理风格迁移以实现有效的漫画生成
IEEE Trans Image Process. 2022;31:3347-3358. doi: 10.1109/TIP.2022.3154238. Epub 2022 May 9.
8
Caricature generalization benefits for faces learned with enhanced idiosyncratic shape or texture.通过增强独特形状或纹理来学习面部的漫画式概括优势。
Cogn Affect Behav Neurosci. 2017 Feb;17(1):185-197. doi: 10.3758/s13415-016-0471-y.
9
Adaptation may cause some of the face caricature effect.适应可能会导致一些面部漫画效应。
Perception. 2011;40(3):317-22. doi: 10.1068/p6865.
10
Hierarchical Attention-Based Multimodal Fusion Network for Video Emotion Recognition.基于分层注意力的多模态融合网络的视频情绪识别。
Comput Intell Neurosci. 2021 Sep 25;2021:5585041. doi: 10.1155/2021/5585041. eCollection 2021.

本文引用的文献

1
Graph Jigsaw Learning for Cartoon Face Recognition.用于卡通人脸识别的图形拼图学习
IEEE Trans Image Process. 2022;31:3961-3972. doi: 10.1109/TIP.2022.3177952. Epub 2022 Jun 9.
2
Heterogeneous Face Interpretable Disentangled Representation for Joint Face Recognition and Synthesis.用于联合人脸识别和合成的异构人脸可解释解缠表示。
IEEE Trans Neural Netw Learn Syst. 2022 Oct;33(10):5611-5625. doi: 10.1109/TNNLS.2021.3071119. Epub 2022 Oct 5.
3
DVG-Face: Dual Variational Generation for Heterogeneous Face Recognition.DVG-Face:用于异构人脸识别的双变分生成
IEEE Trans Pattern Anal Mach Intell. 2022 Jun;44(6):2938-2952. doi: 10.1109/TPAMI.2021.3052549. Epub 2022 May 5.
4
Wasserstein CNN: Learning Invariant Features for NIR-VIS Face Recognition. Wasserstein CNN:用于近红外-可见光人脸识别的不变特征学习。
IEEE Trans Pattern Anal Mach Intell. 2019 Jul;41(7):1761-1773. doi: 10.1109/TPAMI.2018.2842770. Epub 2018 Jun 1.