H4MER：基于 Transformer 学习神经组合表示的人类 4D 建模。

H4MER: Human 4D Modeling by Learning Neural Compositional Representation With Transformer.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2023 Dec;45(12):14639-14652. doi: 10.1109/TPAMI.2023.3313311. Epub 2023 Nov 3.

DOI:10.1109/TPAMI.2023.3313311

Abstract

Despite the impressive results achieved by deep learning based 3D reconstruction, the techniques of directly learning to model 4D human captures with detailed geometry have been less studied. This work presents a novel neural compositional representation for Human 4D Modeling with transformER (H4MER). Specifically, our H4MER is a compact and compositional representation for dynamic human by exploiting the human body prior from the widely used SMPL parametric model. Thus, H4MER can represent a dynamic 3D human over a temporal span with the codes of shape, initial pose, motion and auxiliaries. A simple yet effective linear motion model is proposed to provide a rough and regularized motion estimation, followed by per-frame compensation for pose and geometry details with the residual encoded in the auxiliary codes. We present a novel Transformer-based feature extractor and conditional GRU decoder to facilitate learning and improve the representation capability. Extensive experiments demonstrate our method is not only effective in recovering dynamic human with accurate motion and detailed geometry, but also amenable to various 4D human related tasks, including monocular video fitting, motion retargeting, 4D completion, and future prediction.

摘要

尽管基于深度学习的 3D 重建技术取得了令人印象深刻的成果，但直接学习用详细几何模型来建模 4D 人体捕捉的技术研究较少。本工作提出了一种新颖的神经组合表示方法，用于人类 4D 建模，称为 transformER（H4MER）。具体来说，我们的 H4MER 是一种紧凑而组合的动态人体表示方法，利用了广泛使用的 SMPL 参数模型中的人体先验。因此，H4MER 可以用形状、初始姿势、运动和辅助代码的代码来表示跨越时间的动态 3D 人体。我们提出了一种简单而有效的线性运动模型，用于提供粗略的正则化运动估计，然后通过残差在辅助代码中对每一帧的姿势和几何细节进行补偿。我们提出了一种基于 Transformer 的特征提取器和条件 GRU 解码器，以方便学习和提高表示能力。广泛的实验表明，我们的方法不仅可以有效地恢复具有准确运动和详细几何的动态人体，而且还适用于各种与 4D 人体相关的任务，包括单目视频拟合、运动重定向、4D 完成和未来预测。

相似文献

H4MER: Human 4D Modeling by Learning Neural Compositional Representation With Transformer.H4MER：基于 Transformer 学习神经组合表示的人类 4D 建模。

IEEE Trans Pattern Anal Mach Intell. 2023 Dec;45(12):14639-14652. doi: 10.1109/TPAMI.2023.3313311. Epub 2023 Nov 3.

Impact of time-of-flight on indirect 3D and direct 4D parametric image reconstruction in the presence of inconsistent dynamic PET data.飞行时间对存在不一致动态PET数据时的间接3D和直接4D参数图像重建的影响。

Phys Med Biol. 2016 May 7;61(9):3443-71. doi: 10.1088/0031-9155/61/9/3443. Epub 2016 Apr 6.

Deep learning-based motion compensation for four-dimensional cone-beam computed tomography (4D-CBCT) reconstruction.基于深度学习的四维锥形束 CT（4D-CBCT）重建的运动补偿。

Med Phys. 2023 Feb;50(2):808-820. doi: 10.1002/mp.16103. Epub 2022 Dec 3.

Motion compensation for fully 4D PET reconstruction using PET superset data.利用 PET 超集数据进行全 4D PET 重建的运动补偿。

Phys Med Biol. 2010 Jul 21;55(14):4063-82. doi: 10.1088/0031-9155/55/14/008. Epub 2010 Jul 5.

Dynamic cone-beam CT reconstruction using spatial and temporal implicit neural representation learning (STINR).基于时空隐式神经表示学习（STINR）的动态锥形束 CT 重建。

Phys Med Biol. 2023 Feb 6;68(4):045005. doi: 10.1088/1361-6560/acb30d.

U-net-based deformation vector field estimation for motion-compensated 4D-CBCT reconstruction.基于U-net的形变矢量场估计用于运动补偿4D-CBCT重建。

Med Phys. 2020 Jul;47(7):3000-3012. doi: 10.1002/mp.14150. Epub 2020 Apr 27.

A microstructure estimation Transformer inspired by sparse representation for diffusion MRI.一种受扩散磁共振成像稀疏表示启发的微观结构估计Transformer。

Med Image Anal. 2023 May;86:102788. doi: 10.1016/j.media.2023.102788. Epub 2023 Mar 1.

Self-contained deep learning-based boosting of 4D cone-beam CT reconstruction.基于深度学习的独立式4D锥形束CT重建增强技术

Med Phys. 2020 Nov;47(11):5619-5631. doi: 10.1002/mp.14441. Epub 2020 Oct 15.

Spatio-temporal deep learning methods for motion estimation using 4D OCT image data.基于 4D-OCT 图像数据的运动估计的时空深度学习方法。

Int J Comput Assist Radiol Surg. 2020 Jun;15(6):943-952. doi: 10.1007/s11548-020-02178-z. Epub 2020 May 22.

Quantitative PET image reconstruction employing nested expectation-maximization deconvolution for motion compensation.采用嵌套期望最大化反卷积进行运动补偿的定量 PET 图像重建。

Comput Med Imaging Graph. 2017 Sep;60:11-21. doi: 10.1016/j.compmedimag.2016.11.006. Epub 2016 Nov 16.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

H4MER：基于 Transformer 学习神经组合表示的人类 4D 建模。

H4MER: Human 4D Modeling by Learning Neural Compositional Representation With Transformer.

出版信息

相似文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献