从关节运动中构建结构：无需训练数据的准确且稳定的单目 3D 重建。

Structure from Articulated Motion: Accurate and Stable Monocular 3D Reconstruction without Training Data.

机构信息

Department Augmented Vision, German Research Center for Artificial Intelligence (DFKI), 67663 Kaiserslautern, Germany.

Department of Computer Graphics, Max Planck Institute for Informatics, 66123 Saarbrücken, Germany.

出版信息

Sensors (Basel). 2019 Oct 22;19(20):4603. doi: 10.3390/s19204603.

DOI:10.3390/s19204603

PMID:31652665

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6833108/

Abstract

Recovery of articulated 3D structure from 2D observations is a challenging computer vision problem with many applications. Current learning-based approaches achieve state-of-the-art accuracy on public benchmarks but are restricted to specific types of objects and motions covered by the training datasets. Model-based approaches do not rely on training data but show lower accuracy on these datasets. In this paper, we introduce a model-based method called (SfAM), which can recover multiple object and motion types without training on extensive data collections. At the same time, it performs on par with learning-based state-of-the-art approaches on public benchmarks and outperforms previous non-rigid structure from motion (NRSfM) methods. SfAM is built upon a general-purpose NRSfM technique while integrating a soft spatio-temporal constraint on the bone lengths. We use alternating optimization strategy to recover optimal geometry (i.e., bone proportions) together with 3D joint positions by enforcing the bone lengths consistency over a series of frames. SfAM is highly robust to noisy 2D annotations, generalizes to arbitrary objects and does not rely on training data, which is shown in extensive experiments on public benchmarks and real video sequences. We believe that it brings a new perspective on the domain of monocular 3D recovery of articulated structures, including human motion capture.

摘要

从二维观测中恢复关节 3D 结构是一个具有广泛应用的计算机视觉难题。当前基于学习的方法在公共基准测试中达到了最先进的准确性，但仅限于训练数据集涵盖的特定类型的对象和运动。基于模型的方法不依赖于训练数据，但在这些数据集上的准确性较低。在本文中，我们介绍了一种名为（SfAM）的基于模型的方法，它可以在不依赖于广泛数据集合的情况下恢复多种对象和运动类型。同时，它在公共基准测试上与基于学习的最先进方法表现相当，并优于以前的非刚性运动结构（NRSfM）方法。SfAM 建立在通用的 NRSfM 技术之上，同时在骨骼长度上集成了软时空约束。我们使用交替优化策略通过在一系列帧上强制骨骼长度的一致性来恢复最佳几何形状（即骨骼比例）和 3D 关节位置。SfAM 对噪声二维注释具有高度的鲁棒性，可泛化到任意对象，并且不依赖于训练数据，这在公共基准测试和真实视频序列上的广泛实验中得到了证明。我们相信，它为包括人类运动捕捉在内的单目关节结构 3D 恢复领域带来了新的视角。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6fa9/6833108/a5aa1790f4e1/sensors-19-04603-g0A1.jpg

相似文献

Structure from Articulated Motion: Accurate and Stable Monocular 3D Reconstruction without Training Data.

Sensors (Basel). 2019 Oct 22;19(20):4603. doi: 10.3390/s19204603.

Capturing Complex 3D Human Motions with Kernelized Low-Rank Representation from Monocular RGB Camera.

Sensors (Basel). 2017 Sep 3;17(9):2019. doi: 10.3390/s17092019.

MonoCap: Monocular Human Motion Capture using a CNN Coupled with a Geometric Prior.

IEEE Trans Pattern Anal Mach Intell. 2019 Apr;41(4):901-914. doi: 10.1109/TPAMI.2018.2816031. Epub 2018 Mar 15.

Colonoscopic 3D reconstruction by tubular non-rigid structure-from-motion.

Int J Comput Assist Radiol Surg. 2021 Jul;16(7):1237-1241. doi: 10.1007/s11548-021-02409-x. Epub 2021 May 24.

Cycle-SfM: Joint self-supervised learning of depth and camera motion from monocular image sequences.

Chaos. 2019 Dec;29(12):123102. doi: 10.1063/1.5120605.

RAUM-VO: Rotational Adjusted Unsupervised Monocular Visual Odometry.

Sensors (Basel). 2022 Mar 30;22(7):2651. doi: 10.3390/s22072651.

Deep Non-Rigid Structure From Motion With Missing Data.

IEEE Trans Pattern Anal Mach Intell. 2021 Dec;43(12):4365-4377. doi: 10.1109/TPAMI.2020.2997026. Epub 2021 Nov 3.

A factorization-based approach for articulated nonrigid shape, motion and kinematic chain recovery from video.

IEEE Trans Pattern Anal Mach Intell. 2008 May;30(5):865-77. doi: 10.1109/TPAMI.2007.70739.

Robust Spatio-Temporal Clustering and Reconstruction of Multiple Deformable Bodies.

IEEE Trans Pattern Anal Mach Intell. 2019 Apr;41(4):971-984. doi: 10.1109/TPAMI.2018.2823717. Epub 2018 Apr 6.

Model-Based Real-Time Non-Rigid Tracking.

Sensors (Basel). 2017 Oct 14;17(10):2342. doi: 10.3390/s17102342.

引用本文的文献

Single-Shot Structured Light Sensor for 3D Dense and Dynamic Reconstruction.

Sensors (Basel). 2020 Feb 17;20(4):1094. doi: 10.3390/s20041094.

本文引用的文献

WHSP-Net: A Weakly-Supervised Approach for 3D Hand Shape and Pose Recovery from a Single Depth Image.

Sensors (Basel). 2019 Aug 31;19(17):3784. doi: 10.3390/s19173784.

LCR-Net++: Multi-Person 2D and 3D Pose Detection in Natural Images.

IEEE Trans Pattern Anal Mach Intell. 2020 May;42(5):1146-1161. doi: 10.1109/TPAMI.2019.2892985. Epub 2019 Jan 14.

MonoCap: Monocular Human Motion Capture using a CNN Coupled with a Geometric Prior.

IEEE Trans Pattern Anal Mach Intell. 2019 Apr;41(4):901-914. doi: 10.1109/TPAMI.2018.2816031. Epub 2018 Mar 15.

3D Reconstruction of Human Motion from Monocular Image Sequences.

IEEE Trans Pattern Anal Mach Intell. 2016 Aug;38(8):1505-16. doi: 10.1109/TPAMI.2016.2553028. Epub 2016 Apr 12.

Human3.6M: Large Scale Datasets and Predictive Methods for 3D Human Sensing in Natural Environments.

IEEE Trans Pattern Anal Mach Intell. 2014 Jul;36(7):1325-39. doi: 10.1109/TPAMI.2013.248.

Kernel Non-Rigid Structure from Motion.

Proc IEEE Int Conf Comput Vis. 2011:802-809. doi: 10.1109/ICCV.2011.6126319.

Trajectory Space: A Dual Representation for Nonrigid Structure from Motion.

IEEE Trans Pattern Anal Mach Intell. 2011 Jul;33(7):1442-56. doi: 10.1109/TPAMI.2010.201. Epub 2010 Nov 18.

A factorization-based approach for articulated nonrigid shape, motion and kinematic chain recovery from video.

IEEE Trans Pattern Anal Mach Intell. 2008 May;30(5):865-77. doi: 10.1109/TPAMI.2007.70739.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

从关节运动中构建结构：无需训练数据的准确且稳定的单目 3D 重建。

Structure from Articulated Motion: Accurate and Stable Monocular 3D Reconstruction without Training Data.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献