• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于语义解析转换的非配对人物图像生成。

Unpaired Person Image Generation With Semantic Parsing Transformation.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2021 Nov;43(11):4161-4176. doi: 10.1109/TPAMI.2020.2992105. Epub 2021 Oct 1.

DOI:10.1109/TPAMI.2020.2992105
PMID:32365019
Abstract

In this paper, we tackle the problem of pose-guided person image generation with unpaired data, which is a challenging problem due to non-rigid spatial deformation. Instead of learning a fixed mapping directly between human bodies as previous methods, we propose a new pathway to decompose a single fixed mapping into two subtasks, namely, semantic parsing transformation and appearance generation. First, to simplify the learning for non-rigid deformation, a semantic generative network is developed to transform semantic parsing maps between different poses. Second, guided by semantic parsing maps, we render the foreground and background image, respectively. A foreground generative network learns to synthesize semantic-aware textures, and another background generative network learns to predict missing background regions caused by pose changes. Third, we enable pseudo-label training with unpaired data, and demonstrate that end-to-end training of the overall network further refines the semantic map prediction and final results accordingly. Moreover, our method is generalizable to other person image generation tasks defined on semantic maps, e.g., clothing texture transfer, controlled image manipulation, and virtual try-on. Experimental results on DeepFashion and Market-1501 datasets demonstrate the superiority of our method, especially in keeping better body shapes and clothing attributes, as well as rendering structure-coherent backgrounds.

摘要

在本文中,我们解决了使用非配对数据进行姿势引导的人像生成问题,这是一个具有挑战性的问题,因为存在非刚体的空间变形。与之前的方法直接学习人体之间的固定映射不同,我们提出了一种新的途径,可以将单一的固定映射分解为两个子任务,即语义解析转换和外观生成。首先,为了简化非刚体变形的学习,我们开发了一个语义生成网络,以在不同姿势之间转换语义解析图。其次,通过语义解析图指导,我们分别渲染前景和背景图像。一个前景生成网络学习合成语义感知纹理,另一个背景生成网络学习预测由于姿势变化而导致的缺失背景区域。第三,我们可以使用非配对数据进行伪标签训练,并证明整个网络的端到端训练可以进一步细化语义图预测,并相应地改进最终结果。此外,我们的方法可推广到其他基于语义图的人像生成任务,例如,服装纹理转换、控制图像操作和虚拟试穿。在 DeepFashion 和 Market-1501 数据集上的实验结果表明,我们的方法具有优越性,尤其是在保持更好的身体形状和服装属性以及渲染结构一致的背景方面。

相似文献

1
Unpaired Person Image Generation With Semantic Parsing Transformation.基于语义解析转换的非配对人物图像生成。
IEEE Trans Pattern Anal Mach Intell. 2021 Nov;43(11):4161-4176. doi: 10.1109/TPAMI.2020.2992105. Epub 2021 Oct 1.
2
Person image generation through graph-based and appearance-decomposed generative adversarial network.通过基于图和外观分解的生成对抗网络生成人物图像。
PeerJ Comput Sci. 2021 Dec 24;7:e761. doi: 10.7717/peerj-cs.761. eCollection 2021.
3
Human Co-Parsing Guided Alignment for Occluded Person Re-Identification.用于遮挡行人重识别的人类协同解析引导对齐
IEEE Trans Image Process. 2023;32:458-470. doi: 10.1109/TIP.2022.3229639. Epub 2022 Dec 28.
4
On Symbiosis of Attribute Prediction and Semantic Segmentation.属性预测与语义分割的共生。
IEEE Trans Pattern Anal Mach Intell. 2021 May;43(5):1620-1635. doi: 10.1109/TPAMI.2019.2956039. Epub 2021 Apr 1.
5
Pose Flow Learning From Person Images for Pose Guided Synthesis.从人体图像中进行姿态流学习以实现姿态引导合成。
IEEE Trans Image Process. 2021;30:1898-1909. doi: 10.1109/TIP.2020.3031108. Epub 2021 Jan 20.
6
Semantic and Geometric-Aware Day-to-Night Image Translation Network.语义与几何感知的昼夜图像翻译网络
Sensors (Basel). 2024 Feb 19;24(4):1339. doi: 10.3390/s24041339.
7
Graphonomy: Universal Image Parsing via Graph Reasoning and Transfer.图谱学:通过图推理和迁移进行通用图像解析。
IEEE Trans Pattern Anal Mach Intell. 2022 May;44(5):2504-2518. doi: 10.1109/TPAMI.2020.3043268. Epub 2022 Apr 1.
8
Pose-Guided Hierarchical Semantic Decomposition and Composition for Human Parsing.用于人体解析的姿态引导分层语义分解与合成
IEEE Trans Cybern. 2023 Mar;53(3):1641-1652. doi: 10.1109/TCYB.2021.3107544. Epub 2023 Feb 15.
9
Local and Global GANs With Semantic-Aware Upsampling for Image Generation.用于图像生成的具有语义感知上采样的局部和全局生成对抗网络
IEEE Trans Pattern Anal Mach Intell. 2023 Jan;45(1):768-784. doi: 10.1109/TPAMI.2022.3155989. Epub 2022 Dec 5.
10
Spatial-Aware Texture Transformer for High-Fidelity Garment Transfer.面向高逼真度服装传递的空间感知纹理变换。
IEEE Trans Image Process. 2021;30:7499-7510. doi: 10.1109/TIP.2021.3107235. Epub 2021 Sep 3.

引用本文的文献

1
Automated rock mass condition assessment during TBM tunnel excavation using deep learning.基于深度学习的 TBM 隧道掘进过程中岩体质量自动化评估
Sci Rep. 2022 Feb 2;12(1):1722. doi: 10.1038/s41598-022-05727-5.