• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于地标定位的结构感知全卷积网络的对抗学习

Adversarial Learning of Structure-Aware Fully Convolutional Networks for Landmark Localization.

作者信息

Chen Yu, Shen Chunhua, Chen Hao, Wei Xiu-Shen, Liu Lingqiao, Yang Jian

出版信息

IEEE Trans Pattern Anal Mach Intell. 2020 Jul;42(7):1654-1669. doi: 10.1109/TPAMI.2019.2901875. Epub 2019 Feb 26.

DOI:10.1109/TPAMI.2019.2901875
PMID:30835211
Abstract

Landmark/pose estimation in single monocular images has received much effort in computer vision due to its important applications. It remains a challenging task when input images come with severe occlusions caused by, e.g., adverse camera views. Under such circumstances, biologically implausible pose predictions may be produced. In contrast, human vision is able to predict poses by exploiting geometric constraints of landmark point inter-connectivity. To address the problem, by incorporating priors about the structure of pose components, we propose a novel structure-aware fully convolutional network to implicitly take such priors into account during training of the deep network. Explicit learning of such constraints is typically challenging. Instead, inspired by how human identifies implausible poses, we design discriminators to distinguish the real poses from the fake ones (such as biologically implausible ones). If the pose generator G generates results that the discriminator fails to distinguish from real ones, the network successfully learns the priors. Training of the network follows the strategy of conditional Generative Adversarial Networks (GANs). The effectiveness of the proposed network is evaluated on three pose-related tasks: 2D human pose estimation, 2D facial landmark estimation and 3D human pose estimation. The proposed approach significantly outperforms several state-of-the-art methods and almost always generates plausible pose predictions, demonstrating the usefulness of implicit learning of structures using GANs.

摘要

由于其重要应用,单目图像中的地标/姿态估计在计算机视觉领域受到了广泛关注。当输入图像存在严重遮挡(例如,不利的相机视角导致)时,这仍然是一项具有挑战性的任务。在这种情况下,可能会产生不符合生物学常理的姿态预测。相比之下,人类视觉能够通过利用地标点相互连接的几何约束来预测姿态。为了解决这个问题,通过纳入关于姿态组件结构的先验知识,我们提出了一种新颖的结构感知全卷积网络,以便在深度网络训练期间隐式地考虑这些先验知识。显式学习此类约束通常具有挑战性。相反,受人类如何识别不合理姿态的启发,我们设计了鉴别器来区分真实姿态和虚假姿态(例如不符合生物学常理的姿态)。如果姿态生成器G生成的结果鉴别器无法与真实结果区分开来,那么网络就成功地学习了先验知识。网络的训练遵循条件生成对抗网络(GAN)的策略。在三个与姿态相关的任务上评估了所提出网络的有效性:二维人体姿态估计、二维面部地标估计和三维人体姿态估计。所提出的方法显著优于几种最新方法,并且几乎总是生成合理的姿态预测,证明了使用GAN隐式学习结构的有用性。

相似文献

1
Adversarial Learning of Structure-Aware Fully Convolutional Networks for Landmark Localization.用于地标定位的结构感知全卷积网络的对抗学习
IEEE Trans Pattern Anal Mach Intell. 2020 Jul;42(7):1654-1669. doi: 10.1109/TPAMI.2019.2901875. Epub 2019 Feb 26.
2
3D conditional generative adversarial networks for high-quality PET image estimation at low dose.基于三维条件生成对抗网络的低剂量 PET 图像高质量估计。
Neuroimage. 2018 Jul 1;174:550-562. doi: 10.1016/j.neuroimage.2018.03.045. Epub 2018 Mar 20.
3
3D Human Pose Machines with Self-Supervised Learning.基于自监督学习的 3D 人体姿态估计
IEEE Trans Pattern Anal Mach Intell. 2020 May;42(5):1069-1082. doi: 10.1109/TPAMI.2019.2892452. Epub 2019 Jan 14.
4
Fusing information from multiple 2D depth cameras for 3D human pose estimation in the operating room.将来自多个 2D 深度相机的信息融合用于手术室中的 3D 人体姿态估计。
Int J Comput Assist Radiol Surg. 2019 Nov;14(11):1871-1879. doi: 10.1007/s11548-019-02044-7. Epub 2019 Aug 6.
5
2D facial landmark localization method for multi-view face synthesis image using a two-pathway generative adversarial network approach.基于双通路生成对抗网络方法的多视角人脸合成图像的二维面部地标定位方法
PeerJ Comput Sci. 2022 Feb 16;8:e897. doi: 10.7717/peerj-cs.897. eCollection 2022.
6
Learning Depth from Single Monocular Images Using Deep Convolutional Neural Fields.利用深度卷积神经场从单目图像中学习深度。
IEEE Trans Pattern Anal Mach Intell. 2016 Oct;38(10):2024-39. doi: 10.1109/TPAMI.2015.2505283. Epub 2015 Dec 3.
7
Synthesizing Depth Hand Images with GANs and Style Transfer for Hand Pose Estimation.使用 GAN 和风格迁移合成深度手图像进行手姿势估计。
Sensors (Basel). 2019 Jul 1;19(13):2919. doi: 10.3390/s19132919.
8
Weakly Supervised Adversarial Learning for 3D Human Pose Estimation from Point Clouds.基于点云的弱监督对抗学习三维人体姿态估计
IEEE Trans Vis Comput Graph. 2020 May;26(5):1851-1859. doi: 10.1109/TVCG.2020.2973076. Epub 2020 Feb 13.
9
Real-Time 3D Hand Pose Estimation with 3D Convolutional Neural Networks.基于3D卷积神经网络的实时3D手部姿态估计
IEEE Trans Pattern Anal Mach Intell. 2019 Apr;41(4):956-970. doi: 10.1109/TPAMI.2018.2827052. Epub 2018 Apr 16.
10
Adversarial Learning Semantic Volume for 2D/3D Face Shape Regression in the Wild.用于野外二维/三维人脸形状回归的对抗学习语义体积
IEEE Trans Image Process. 2019 Apr 19. doi: 10.1109/TIP.2019.2911114.

引用本文的文献

1
Fully automated image-based estimation of postural point-features in children with cerebral palsy using deep learning.使用深度学习对脑瘫儿童的姿势点特征进行基于图像的全自动估计。
R Soc Open Sci. 2019 Nov 6;6(11):191011. doi: 10.1098/rsos.191011. eCollection 2019 Nov.