Suppr超能文献

LCR-Net++:自然图像中的多人 2D 和 3D 姿态检测。

LCR-Net++: Multi-Person 2D and 3D Pose Detection in Natural Images.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2020 May;42(5):1146-1161. doi: 10.1109/TPAMI.2019.2892985. Epub 2019 Jan 14.

Abstract

We propose an end-to-end architecture for joint 2D and 3D human pose estimation in natural images. Key to our approach is the generation and scoring of a number of pose proposals per image, which allows us to predict 2D and 3D poses of multiple people simultaneously. Hence, our approach does not require an approximate localization of the humans for initialization. Our Localization-Classification-Regression architecture, named LCR-Net, contains 3 main components: 1) the pose proposal generator that suggests candidate poses at different locations in the image; 2) a classifier that scores the different pose proposals; and 3) a regressor that refines pose proposals both in 2D and 3D. All three stages share the convolutional feature layers and are trained jointly. The final pose estimation is obtained by integrating over neighboring pose hypotheses, which is shown to improve over a standard non maximum suppression algorithm. Our method recovers full-body 2D and 3D poses, hallucinating plausible body parts when the persons are partially occluded or truncated by the image boundary. Our approach significantly outperforms the state of the art in 3D pose estimation on Human3.6M, a controlled environment. Moreover, it shows promising results on real images for both single and multi-person subsets of the MPII 2D pose benchmark and demonstrates satisfying 3D pose results even for multi-person images.

摘要

我们提出了一种用于自然图像中二维和三维人体姿态估计的端到端架构。我们方法的关键是为每张图像生成和评分多个姿态提案,这使我们能够同时预测多个人的二维和三维姿态。因此,我们的方法不需要对人类进行近似定位作为初始化。我们的定位-分类-回归架构,命名为 LCR-Net,包含 3 个主要组件:1)姿态提案生成器,在图像的不同位置建议候选姿态;2)分类器,对不同的姿态提案进行评分;3)回归器,在二维和三维上细化姿态提案。所有三个阶段都共享卷积特征层,并进行联合训练。最终的姿态估计是通过对相邻姿态假设进行积分得到的,这比标准的非极大值抑制算法有所改进。我们的方法能够恢复全身的二维和三维姿态,当人体被图像边界部分遮挡或截断时,能够生成合理的身体部位。我们的方法在受控环境的 Human3.6M 上的三维姿态估计方面明显优于现有技术,并且在 MPII 2D 姿态基准的单人子集和多人子集的真实图像上都表现出了有希望的结果,并且即使对于多人图像也能得到令人满意的三维姿态结果。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验