• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用混合CNN-Mamba框架在面部素描-照片合成中实现身份保留。

Toward identity preserving in face sketch-photo synthesis using a hybrid CNN-Mamba framework.

作者信息

Tang Duoxun, Jiang Xinhang, Wang Kunpeng, Guo Weichen, Zhang Jingyuan, Lin Ye, Pu Haibo

机构信息

College of Science, Sichuan Agricultural University, Ya'an, 625000, China.

College of Information Engineering, Sichuan Agricultural University, Ya'an, 625000, China.

出版信息

Sci Rep. 2024 Sep 28;14(1):22495. doi: 10.1038/s41598-024-72066-y.

DOI:10.1038/s41598-024-72066-y
PMID:39341858
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11438986/
Abstract

The synthesis of facial sketch-photo has important applications in practical life, such as crime investigation. Many convolutional neural networks (CNNs) based methods have been proposed to address this issue. However, due to the substantial modal differences between sketch and photo, the CNN's insensitivity to global information, and insufficient utilization of hierarchical features, synthesized photos struggle to balance both identity preservation and image quality. Recently, State Space Sequence Models (SSMs) have achieved exciting results in computer vision (CV) tasks. Inspired by SSMs, we design a hybrid CNN-SSM model called FaceMamba for the Face Sketch-Photo Synthesis (FSPS) task. It includes an original Face Vision Mamba Attention for modeling in latent space using SSM. Additionally, it incorporates a general auxiliary method called Attention Feature Injection that combines encoding features, decoding features, and external auxiliary features using attention mechanisms. FaceMamba combines Mamba's modeling ability for long-range dependencies with CNN's powerful local feature extraction ability, and utilizes hierarchical features at the appropriate position. Adequate experimental and evaluation results reveal that FaceMamba has strong competitiveness in FSPS task, achieving the best balance between identity preservation and image quality.

摘要

面部素描-照片合成在实际生活中有重要应用,比如犯罪调查。已经提出了许多基于卷积神经网络(CNN)的方法来解决这个问题。然而,由于素描和照片之间存在显著的模态差异、CNN对全局信息不敏感以及对分层特征利用不足,合成照片难以在身份保留和图像质量之间取得平衡。最近,状态空间序列模型(SSM)在计算机视觉(CV)任务中取得了令人兴奋的成果。受SSM启发,我们为面部素描-照片合成(FSPS)任务设计了一种名为FaceMamba的混合CNN-SSM模型。它包括一个原始的面部视觉曼巴注意力机制,用于在潜在空间中使用SSM进行建模。此外,它还结合了一种名为注意力特征注入的通用辅助方法,该方法使用注意力机制将编码特征、解码特征和外部辅助特征相结合。FaceMamba将曼巴对长程依赖的建模能力与CNN强大的局部特征提取能力相结合,并在适当位置利用分层特征。充分的实验和评估结果表明,FaceMamba在FSPS任务中具有很强的竞争力,在身份保留和图像质量之间实现了最佳平衡。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b97d/11438986/ca796ddb111b/41598_2024_72066_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b97d/11438986/0f2df2f5f6f8/41598_2024_72066_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b97d/11438986/23987b765c3d/41598_2024_72066_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b97d/11438986/3d67125ef0ad/41598_2024_72066_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b97d/11438986/907d9e5f307c/41598_2024_72066_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b97d/11438986/3ff899f1396d/41598_2024_72066_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b97d/11438986/dc109c89b285/41598_2024_72066_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b97d/11438986/deb6f7e815dd/41598_2024_72066_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b97d/11438986/ca796ddb111b/41598_2024_72066_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b97d/11438986/0f2df2f5f6f8/41598_2024_72066_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b97d/11438986/23987b765c3d/41598_2024_72066_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b97d/11438986/3d67125ef0ad/41598_2024_72066_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b97d/11438986/907d9e5f307c/41598_2024_72066_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b97d/11438986/3ff899f1396d/41598_2024_72066_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b97d/11438986/dc109c89b285/41598_2024_72066_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b97d/11438986/deb6f7e815dd/41598_2024_72066_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b97d/11438986/ca796ddb111b/41598_2024_72066_Fig8_HTML.jpg

相似文献

1
Toward identity preserving in face sketch-photo synthesis using a hybrid CNN-Mamba framework.使用混合CNN-Mamba框架在面部素描-照片合成中实现身份保留。
Sci Rep. 2024 Sep 28;14(1):22495. doi: 10.1038/s41598-024-72066-y.
2
An Efficient Transformer Based on Global and Local Self-Attention for Face Photo-Sketch Synthesis.一种基于全局和局部自注意力机制的高效Transformer用于面部照片-草图合成。
IEEE Trans Image Process. 2023;32:483-495. doi: 10.1109/TIP.2022.3229614. Epub 2022 Dec 30.
3
Toward Realistic Face Photo-Sketch Synthesis via Composition-Aided GANs.通过构图辅助的 GAN 实现逼真的人脸照片素描合成。
IEEE Trans Cybern. 2021 Sep;51(9):4350-4362. doi: 10.1109/TCYB.2020.2972944. Epub 2021 Sep 15.
4
Neural Probabilistic Graphical Model for Face Sketch Synthesis.用于人脸素描合成的神经概率图模型。
IEEE Trans Neural Netw Learn Syst. 2020 Jul;31(7):2623-2637. doi: 10.1109/TNNLS.2019.2933590. Epub 2019 Sep 4.
5
Graph-Regularized Locality-Constrained Joint Dictionary and Residual Learning for Face Sketch Synthesis.基于图正则化的局部约束联合字典和残差学习的人脸素描合成。
IEEE Trans Image Process. 2019 Feb;28(2):628-641. doi: 10.1109/TIP.2018.2870936. Epub 2018 Sep 18.
6
Multiple Representations-Based Face Sketch-Photo Synthesis.基于多种表示的人脸素描-照片合成。
IEEE Trans Neural Netw Learn Syst. 2016 Nov;27(11):2201-2215. doi: 10.1109/TNNLS.2015.2464681. Epub 2015 Sep 7.
7
Robust Face Sketch Style Synthesis.鲁棒人脸素描风格合成。
IEEE Trans Image Process. 2016 Jan;25(1):220-32. doi: 10.1109/TIP.2015.2501755. Epub 2015 Nov 18.
8
Multi-Level Cycle-Consistent Adversarial Networks with Attention Mechanism for Face Sketch-Photo Synthesis.基于注意力机制的多层次循环一致性对抗网络的人脸素描-照片合成。
Sensors (Basel). 2022 Sep 6;22(18):6725. doi: 10.3390/s22186725.
9
A Decision Support System for Face Sketch Synthesis Using Deep Learning and Artificial Intelligence.一种使用深度学习和人工智能的面部草图合成决策支持系统。
Sensors (Basel). 2021 Dec 8;21(24):8178. doi: 10.3390/s21248178.
10
Knowledge Distillation for Face Photo-Sketch Synthesis.知识蒸馏在人脸照片素描合成中的应用。
IEEE Trans Neural Netw Learn Syst. 2022 Feb;33(2):893-906. doi: 10.1109/TNNLS.2020.3030536. Epub 2022 Feb 3.

本文引用的文献

1
Biphasic Face Photo-Sketch Synthesis via Semantic-Driven Generative Adversarial Network With Graph Representation Learning.基于语义驱动生成对抗网络与图表示学习的双相面部照片-素描合成
IEEE Trans Neural Netw Learn Syst. 2025 Feb;36(2):2182-2195. doi: 10.1109/TNNLS.2023.3341246. Epub 2025 Feb 6.
2
HiFiSketch: High Fidelity Face Photo-Sketch Synthesis and Manipulation.HiFiSketch:高保真面部照片-素描合成与操控
IEEE Trans Image Process. 2023;32:5865-5876. doi: 10.1109/TIP.2023.3326680. Epub 2023 Nov 3.
3
Video Captioning Using Global-Local Representation.
使用全局-局部表示的视频字幕
IEEE Trans Circuits Syst Video Technol. 2022 Oct;32(10):6642-6656. doi: 10.1109/tcsvt.2022.3177320. Epub 2022 May 23.
4
CMOS-GAN: Semi-Supervised Generative Adversarial Model for Cross-Modality Face Image Synthesis.CMOS-GAN:用于跨模态人脸图像合成的半监督生成对抗模型
IEEE Trans Image Process. 2023;32:144-158. doi: 10.1109/TIP.2022.3226413. Epub 2022 Dec 19.
5
An Efficient Transformer Based on Global and Local Self-Attention for Face Photo-Sketch Synthesis.一种基于全局和局部自注意力机制的高效Transformer用于面部照片-草图合成。
IEEE Trans Image Process. 2023;32:483-495. doi: 10.1109/TIP.2022.3229614. Epub 2022 Dec 30.
6
Controllable Sketch-to-Image Translation for Robust Face Synthesis.用于稳健面部合成的可控草图到图像翻译
IEEE Trans Image Process. 2021;30:8797-8810. doi: 10.1109/TIP.2021.3120669. Epub 2021 Oct 27.
7
Complementary, Heterogeneous and Adversarial Networks for Image-to-Image Translation.用于图像到图像翻译的互补、异构和对抗网络。
IEEE Trans Image Process. 2021;30:3487-3498. doi: 10.1109/TIP.2021.3061286. Epub 2021 Mar 11.
8
Toward Realistic Face Photo-Sketch Synthesis via Composition-Aided GANs.通过构图辅助的 GAN 实现逼真的人脸照片素描合成。
IEEE Trans Cybern. 2021 Sep;51(9):4350-4362. doi: 10.1109/TCYB.2020.2972944. Epub 2021 Sep 15.
9
Face Sketch Synthesis by Multidomain Adversarial Learning.基于多域对抗学习的面部草图合成
IEEE Trans Neural Netw Learn Syst. 2019 May;30(5):1419-1428. doi: 10.1109/TNNLS.2018.2869574. Epub 2018 Oct 1.
10
Dual-Transfer Face Sketch-Photo Synthesis.双重迁移人脸素描-照片合成。
IEEE Trans Image Process. 2019 Feb;28(2):642-657. doi: 10.1109/TIP.2018.2869688. Epub 2018 Sep 12.