• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于概率潜在一致性的目标遮挡人体形状与姿态估计

Object-Occluded Human Shape and Pose Estimation With Probabilistic Latent Consistency.

作者信息

Huang Buzhen, Zhang Tianshu, Wang Yangang

出版信息

IEEE Trans Pattern Anal Mach Intell. 2023 Apr;45(4):5010-5026. doi: 10.1109/TPAMI.2022.3199449. Epub 2023 Mar 7.

DOI:10.1109/TPAMI.2022.3199449
PMID:35976842
Abstract

Occlusions between human and objects, especially for the activities of human-object interactions, are very common in practical applications. However, most of the existing approaches for 3D human shape and pose estimation require that human bodies are well captured without occlusions or with minor self-occlusions. In this paper, we focus on the problem of directly estimating the object-occluded human shape and pose from single color images. Our key idea is to utilize a partial UV map to represent an object-occluded human body, and the full 3D human shape estimation is ultimately converted as an image inpainting problem. We propose a novel two-branch network architecture to train an end-to-end regressor via a latent distribution consistency, which also includes a novel visible feature sub-net to extract the human information from object-occluded color images. To supervise the network training, we further build a novel dataset named as 3DOH50K. Several experiments are conducted to reveal the effectiveness of the proposed method. Experimental results demonstrate that the proposed method achieves state-of-the-art compared with previous methods. The dataset and codes are publicly available at https://www.yangangwang.com/papers/ZHANG-OOH-2020-03.html.

摘要

人与物体之间的遮挡,特别是在人机交互活动中,在实际应用中非常常见。然而,现有的大多数三维人体形状和姿态估计方法都要求人体在无遮挡或仅有轻微自遮挡的情况下被良好捕捉。在本文中,我们专注于从单张彩色图像直接估计被物体遮挡的人体形状和姿态的问题。我们的关键思想是利用部分UV映射来表示被物体遮挡的人体,而完整的三维人体形状估计最终被转化为一个图像修复问题。我们提出了一种新颖的双分支网络架构,通过潜在分布一致性来训练一个端到端回归器,其中还包括一个新颖的可见特征子网,用于从被物体遮挡的彩色图像中提取人体信息。为了监督网络训练,我们进一步构建了一个名为3DOH50K的新颖数据集。进行了多项实验以揭示所提方法的有效性。实验结果表明,与先前方法相比,所提方法达到了当前最优水平。数据集和代码可在https://www.yangangwang.com/papers/ZHANG-OOH-2020-03.html上公开获取。

相似文献

1
Object-Occluded Human Shape and Pose Estimation With Probabilistic Latent Consistency.基于概率潜在一致性的目标遮挡人体形状与姿态估计
IEEE Trans Pattern Anal Mach Intell. 2023 Apr;45(4):5010-5026. doi: 10.1109/TPAMI.2022.3199449. Epub 2023 Mar 7.
2
LASOR: Learning Accurate 3D Human Pose and Shape via Synthetic Occlusion-Aware Data and Neural Mesh Rendering.LASOR:通过合成遮挡感知数据和神经网格渲染学习精确的 3D 人体姿态和形状。
IEEE Trans Image Process. 2022;31:1938-1948. doi: 10.1109/TIP.2022.3149229. Epub 2022 Feb 16.
3
MSSPA-GC: Multi-Scale Shape Prior Adaptation with 3D Graph Convolutions for Category-Level Object Pose Estimation.MSSPA-GC:基于 3D 图卷积的多尺度形状先验自适应的类别级物体位姿估计。
Neural Netw. 2023 Sep;166:609-621. doi: 10.1016/j.neunet.2023.07.037. Epub 2023 Jul 31.
4
Pose2UV: Single-Shot Multiperson Mesh Recovery With Deep UV Prior.Pose2UV:基于深度UV先验的单帧多人网格恢复
IEEE Trans Image Process. 2022;31:4679-4692. doi: 10.1109/TIP.2022.3187294. Epub 2022 Jul 12.
5
RotationNet for Joint Object Categorization and Unsupervised Pose Estimation from Multi-View Images.用于多视图图像联合目标分类和无监督姿态估计的旋转网络
IEEE Trans Pattern Anal Mach Intell. 2021 Jan;43(1):269-283. doi: 10.1109/TPAMI.2019.2922640. Epub 2020 Dec 4.
6
Category-Level Object Pose Estimation with Statistic Attention.基于统计注意力的类别级目标姿态估计
Sensors (Basel). 2024 Aug 19;24(16):5347. doi: 10.3390/s24165347.
7
RNNPose: 6-DoF Object Pose Estimation via Recurrent Correspondence Field Estimation and Pose Optimization.RNNPose:通过循环对应场估计和姿态优化实现的6自由度物体姿态估计
IEEE Trans Pattern Anal Mach Intell. 2024 Jul;46(7):4669-4683. doi: 10.1109/TPAMI.2024.3360181. Epub 2024 Jun 5.
8
Real-Time Simultaneous Pose and Shape Estimation for Articulated Objects Using a Single Depth Camera.使用单目深度相机实时估计关节物体的姿态和形状。
IEEE Trans Pattern Anal Mach Intell. 2016 Aug;38(8):1517-32. doi: 10.1109/TPAMI.2016.2557783. Epub 2016 Apr 21.
9
Detection, segmentation, and 3D pose estimation of surgical tools using convolutional neural networks and algebraic geometry.使用卷积神经网络和代数几何进行手术工具的检测、分割和三维姿态估计。
Med Image Anal. 2021 May;70:101994. doi: 10.1016/j.media.2021.101994. Epub 2021 Feb 7.
10
LCR-Net++: Multi-Person 2D and 3D Pose Detection in Natural Images.LCR-Net++:自然图像中的多人 2D 和 3D 姿态检测。
IEEE Trans Pattern Anal Mach Intell. 2020 May;42(5):1146-1161. doi: 10.1109/TPAMI.2019.2892985. Epub 2019 Jan 14.

引用本文的文献

1
OccFusion: Rendering Occluded Humans with Generative Diffusion Priors.OccFusion:利用生成扩散先验渲染被遮挡的人体。
Adv Neural Inf Process Syst. 2024;37:92184-92209.