• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于草图人脸识别的跨任务模态对齐网络

Cross Task Modality Alignment Network for Sketch Face Recognition.

作者信息

Guo Yanan, Cao Lin, Du Kangning

机构信息

Key Laboratory of Information and Communication Systems, Ministry of Information Industry, Beijing Information Science and Technology University, Beijing, China.

Key Laboratory of the Ministry of Education for Optoelectronic Measurement Technology and Instrument, Beijing Information Science and Technology University, Beijing, China.

出版信息

Front Neurorobot. 2022 Jun 10;16:823484. doi: 10.3389/fnbot.2022.823484. eCollection 2022.

DOI:10.3389/fnbot.2022.823484
PMID:35756158
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9231523/
Abstract

The task of sketch face recognition refers to matching cross-modality facial images from sketch to photo, which is widely applied in the criminal investigation area. Existing works aim to bridge the cross-modality gap by inter-modality feature alignment approaches, however, the small sample problem has received much less attention, resulting in limited performance. In this paper, an effective Cross Task Modality Alignment Network (CTMAN) is proposed for sketch face recognition. To address the small sample problem, a meta learning training episode strategy is first introduced to mimic few-shot tasks. Based on the episode strategy, a two-stream network termed modality alignment embedding learning is used to capture more modality-specific and modality-sharable features, meanwhile, two cross task memory mechanisms are proposed to collect sufficient negative features to further improve the feature learning. Finally, a cross task modality alignment loss is proposed to capture modality-related information of cross task features for more effective training. Extensive experiments are conducted to validate the superiority of the CTMAN, which significantly outperforms state-of-the-art methods on the UoM-SGFSv2 set A, set B, CUFSF, and PRIP-VSGC dataset.

摘要

草图人脸识别任务是指将跨模态的面部图像从草图匹配到照片,该任务在刑事侦查领域有着广泛的应用。现有工作旨在通过跨模态特征对齐方法来弥合跨模态差距,然而,小样本问题却很少受到关注,导致性能有限。在本文中,我们提出了一种用于草图人脸识别的有效跨任务模态对齐网络(CTMAN)。为了解决小样本问题,我们首先引入了一种元学习训练情节策略来模拟少样本任务。基于该情节策略,我们使用了一种称为模态对齐嵌入学习的双流网络来捕获更多特定模态和模态共享的特征,同时,提出了两种跨任务记忆机制来收集足够的负特征以进一步改进特征学习。最后,我们提出了一种跨任务模态对齐损失,以捕获跨任务特征的模态相关信息,从而进行更有效的训练。我们进行了大量实验来验证CTMAN的优越性,它在UoM-SGFSv2集A、集B、CUFSF和PRIP-VSGC数据集上显著优于现有方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a806/9231523/3277474e1822/fnbot-16-823484-g0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a806/9231523/058f52a7ba6f/fnbot-16-823484-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a806/9231523/c36459c37d85/fnbot-16-823484-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a806/9231523/6db62a89348d/fnbot-16-823484-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a806/9231523/f693c4c0b03b/fnbot-16-823484-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a806/9231523/ca943049cfb6/fnbot-16-823484-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a806/9231523/c9881c2c747c/fnbot-16-823484-g0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a806/9231523/eddb3bb852c1/fnbot-16-823484-g0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a806/9231523/3277474e1822/fnbot-16-823484-g0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a806/9231523/058f52a7ba6f/fnbot-16-823484-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a806/9231523/c36459c37d85/fnbot-16-823484-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a806/9231523/6db62a89348d/fnbot-16-823484-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a806/9231523/f693c4c0b03b/fnbot-16-823484-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a806/9231523/ca943049cfb6/fnbot-16-823484-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a806/9231523/c9881c2c747c/fnbot-16-823484-g0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a806/9231523/eddb3bb852c1/fnbot-16-823484-g0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a806/9231523/3277474e1822/fnbot-16-823484-g0008.jpg

相似文献

1
Cross Task Modality Alignment Network for Sketch Face Recognition.用于草图人脸识别的跨任务模态对齐网络
Front Neurorobot. 2022 Jun 10;16:823484. doi: 10.3389/fnbot.2022.823484. eCollection 2022.
2
Exploiting an Intermediate Latent Space between Photo and Sketch for Face Photo-Sketch Recognition.利用照片和素描之间的中间潜在空间进行人脸照片素描识别。
Sensors (Basel). 2022 Sep 26;22(19):7299. doi: 10.3390/s22197299.
3
Progressive Cross-Modal Semantic Network for Zero-Shot Sketch-Based Image Retrieval.用于零样本基于草图的图像检索的渐进式跨模态语义网络
IEEE Trans Image Process. 2020 Sep 10;PP. doi: 10.1109/TIP.2020.3020383.
4
SketchTrans: Disentangled Prototype Learning With Transformer for Sketch-Photo Recognition.
IEEE Trans Pattern Anal Mach Intell. 2024 May;46(5):2950-2964. doi: 10.1109/TPAMI.2023.3337005. Epub 2024 Apr 3.
5
CMOS-GAN: Semi-Supervised Generative Adversarial Model for Cross-Modality Face Image Synthesis.CMOS-GAN:用于跨模态人脸图像合成的半监督生成对抗模型
IEEE Trans Image Process. 2023;32:144-158. doi: 10.1109/TIP.2022.3226413. Epub 2022 Dec 19.
6
Deep Neural Representation Guided Face Sketch Synthesis.基于深度神经网络的人脸素描图合成。
IEEE Trans Vis Comput Graph. 2019 Dec;25(12):3216-3230. doi: 10.1109/TVCG.2018.2866090. Epub 2018 Aug 20.
7
Augmented Multimodality Fusion for Generalized Zero-Shot Sketch-Based Visual Retrieval.用于广义零样本基于草图的视觉检索的增强多模态融合
IEEE Trans Image Process. 2022;31:3657-3668. doi: 10.1109/TIP.2022.3173815. Epub 2022 May 26.
8
Cross-Modality Person Re-Identification via Modality-aware Collaborative Ensemble Learning.通过模态感知协作集成学习实现跨模态行人重识别
IEEE Trans Image Process. 2020 Jun 3;PP. doi: 10.1109/TIP.2020.2998275.
9
Mind the Gap: Learning Modality-Agnostic Representations With a Cross-Modality UNet.注意差距:使用跨模态UNet学习模态无关表示。
IEEE Trans Image Process. 2024;33:655-670. doi: 10.1109/TIP.2023.3348656. Epub 2024 Jan 12.
10
Knowledge Distillation for Face Photo-Sketch Synthesis.知识蒸馏在人脸照片素描合成中的应用。
IEEE Trans Neural Netw Learn Syst. 2022 Feb;33(2):893-906. doi: 10.1109/TNNLS.2020.3030536. Epub 2022 Feb 3.

本文引用的文献

1
Deep Learning for Person Re-Identification: A Survey and Outlook.用于行人重识别的深度学习:综述与展望
IEEE Trans Pattern Anal Mach Intell. 2022 Jun;44(6):2872-2893. doi: 10.1109/TPAMI.2021.3054775. Epub 2022 May 5.
2
Fine-Tuning CNN Image Retrieval with No Human Annotation.无人工标注微调卷积神经网络图像检索。
IEEE Trans Pattern Anal Mach Intell. 2019 Jul;41(7):1655-1668. doi: 10.1109/TPAMI.2018.2846566. Epub 2018 Jun 12.
3
Bayesian Face Sketch Synthesis.贝叶斯人脸素描合成。
IEEE Trans Image Process. 2017 Mar;26(3):1264-1274. doi: 10.1109/TIP.2017.2651375. Epub 2017 Jan 10.
4
Eigenfaces for recognition.特征脸识别。
J Cogn Neurosci. 1991 Winter;3(1):71-86. doi: 10.1162/jocn.1991.3.1.71.
5
Multi-PIE.多姿态、光照和表情数据库
Proc Int Conf Autom Face Gesture Recognit. 2010 May 1;28(5):807-813. doi: 10.1016/j.imavis.2009.08.002.
6
Face photo-sketch synthesis and recognition.面部照片-素描合成与识别。
IEEE Trans Pattern Anal Mach Intell. 2009 Nov;31(11):1955-67. doi: 10.1109/TPAMI.2008.222.