通过法线积分实现轻量级显式3D人体数字化

Lightweight Explicit 3D Human Digitization via Normal Integration.

作者信息

Liu Jiaxuan, Wu Jingyi, Jing Ruiyang, Yu Han, Liu Jing, Song Liang

机构信息

Academy for Engineering and Technology, Fudan University, Shanghai 200433, China.

Innovation Platform for Academicians of Hainan Province, Haikou 570228, China.

出版信息

Sensors (Basel). 2025 Feb 28;25(5):1513. doi: 10.3390/s25051513.

DOI:10.3390/s25051513

PMID:40096397

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11902492/

Abstract

In recent years, generating 3D human models from images has gained significant attention in 3D human reconstruction. However, deploying large neural network models in practical applications remains challenging, particularly on resource-constrained edge devices. This problem is primarily because large neural network models require significantly higher computational power, which imposes greater demands on hardware capabilities and inference time. To address this issue, we can optimize the network architecture to reduce the number of model parameters, thereby alleviating the heavy reliance on hardware resources. We propose a lightweight and efficient 3D human reconstruction model that balances reconstruction accuracy and computational cost. Specifically, our model integrates Dilated Convolutions and the Cross-Covariance Attention mechanism into its architecture to construct a lightweight generative network. This design effectively captures multi-scale information while significantly reducing model complexity. Additionally, we introduce an innovative loss function tailored to the geometric properties of normal maps. This loss function provides a more accurate measure of surface reconstruction quality and enhances the overall reconstruction performance. Experimental results show that, compared with existing methods, our approach reduces the number of training parameters by approximately 80% while maintaining the generated model's quality.

摘要

近年来，从图像生成3D人体模型在3D人体重建中受到了广泛关注。然而，在实际应用中部署大型神经网络模型仍然具有挑战性，特别是在资源受限的边缘设备上。这个问题主要是因为大型神经网络模型需要显著更高的计算能力，这对硬件能力和推理时间提出了更高的要求。为了解决这个问题，我们可以优化网络架构以减少模型参数的数量，从而减轻对硬件资源的严重依赖。我们提出了一种轻量级且高效的3D人体重建模型，该模型平衡了重建精度和计算成本。具体来说，我们的模型将空洞卷积和交叉协方差注意力机制集成到其架构中，以构建一个轻量级生成网络。这种设计有效地捕获了多尺度信息，同时显著降低了模型复杂性。此外，我们引入了一种针对法线贴图几何属性量身定制的创新损失函数。该损失函数提供了更准确的表面重建质量度量，并提高了整体重建性能。实验结果表明，与现有方法相比，我们的方法在保持生成模型质量的同时，将训练参数数量减少了约80%。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/997e/11902492/2dfb0cf43359/sensors-25-01513-g001.jpg

相似文献

Lightweight Explicit 3D Human Digitization via Normal Integration.通过法线积分实现轻量级显式3D人体数字化

Sensors (Basel). 2025 Feb 28;25(5):1513. doi: 10.3390/s25051513.

PMFSNet: Polarized multi-scale feature self-attention network for lightweight medical image segmentation.PMFSNet：用于轻量级医学图像分割的极化多尺度特征自注意力网络

Comput Methods Programs Biomed. 2025 Apr;261:108611. doi: 10.1016/j.cmpb.2025.108611. Epub 2025 Jan 25.

Balancing High-performance and Lightweight: HL-UNet for 3D Cardiac Medical Image Segmentation.平衡高性能与轻量化：用于 3D 心脏医学图像分割的 HL-UNet。

Acad Radiol. 2024 Nov;31(11):4340-4351. doi: 10.1016/j.acra.2024.06.008. Epub 2024 Jun 20.

MediLite3DNet: A lightweight network for segmentation of nasopharyngeal airways.MediLite3DNet：一种用于鼻咽气道分割的轻量级网络。

Med Biol Eng Comput. 2025 Apr;63(4):1081-1099. doi: 10.1007/s11517-024-03252-3. Epub 2024 Nov 29.

OD-MVSNet: Omni-dimensional dynamic multi-view stereo network.OD-MVSNet：全维动态多视角立体网络。

PLoS One. 2024 Aug 15;19(8):e0309029. doi: 10.1371/journal.pone.0309029. eCollection 2024.

Multi-step depth enhancement refine network with multi-view stereo.基于多视角立体视觉的多步深度增强细化网络

PLoS One. 2025 Feb 13;20(2):e0314418. doi: 10.1371/journal.pone.0314418. eCollection 2025.

LeaNet: Lightweight U-shaped architecture for high-performance skin cancer image segmentation.LeaNet：用于高性能皮肤癌图像分割的轻量级 U 形架构。

Comput Biol Med. 2024 Feb;169:107919. doi: 10.1016/j.compbiomed.2024.107919. Epub 2024 Jan 1.

MRI super-resolution using similarity distance and multi-scale receptive field based feature fusion GAN and pre-trained slice interpolation network.基于相似距离和多尺度感受野的特征融合生成对抗网络和预训练切片插值网络的 MRI 超分辨率方法。

Magn Reson Imaging. 2024 Jul;110:195-209. doi: 10.1016/j.mri.2024.04.021. Epub 2024 Apr 21.

Efficient Multi-Organ Segmentation From 3D Abdominal CT Images With Lightweight Network and Knowledge Distillation.基于轻量化网络和知识蒸馏的 3D 腹部 CT 图像高效多器官分割。

IEEE Trans Med Imaging. 2023 Sep;42(9):2513-2523. doi: 10.1109/TMI.2023.3262680. Epub 2023 Aug 31.

Lightweight medical image segmentation network with multi-scale feature-guided fusion.轻量级医疗图像分割网络，具有多尺度特征引导融合。

Comput Biol Med. 2024 Nov;182:109204. doi: 10.1016/j.compbiomed.2024.109204. Epub 2024 Oct 3.

本文引用的文献

Three-Dimensional Reconstruction of Road Structural Defects Using GPR Investigation and Back-Projection Algorithm.利用探地雷达探测和反投影算法对道路结构缺陷进行三维重建

Sensors (Basel). 2024 Dec 30;25(1):162. doi: 10.3390/s25010162.

HandFI: Multilevel Interacting Hand Reconstruction Based on Multilevel Feature Fusion in RGB Images.HandFI：基于RGB图像中多级别特征融合的多级别交互手部重建

Sensors (Basel). 2024 Dec 27;25(1):88. doi: 10.3390/s25010088.

Vertex-Oriented Method for Polyhedral Reconstruction of 3D Buildings Using OpenStreetMap.使用OpenStreetMap进行三维建筑多面体重建的面向顶点方法

Sensors (Basel). 2024 Dec 14;24(24):7992. doi: 10.3390/s24247992.

Advanced Hyperspectral Image Analysis: Superpixelwise Multiscale Adaptive T-HOSVD for 3D Feature Extraction.高级高光谱图像分析：用于三维特征提取的超像素级多尺度自适应T-HOSVD

Sensors (Basel). 2024 Jun 22;24(13):4072. doi: 10.3390/s24134072.

LidPose: Real-Time 3D Human Pose Estimation in Sparse Lidar Point Clouds with Non-Repetitive Circular Scanning Pattern.LidPose：基于非重复圆形扫描模式的稀疏激光雷达点云中的实时3D人体姿态估计

Sensors (Basel). 2024 May 26;24(11):3427. doi: 10.3390/s24113427.

Analog-to-digital conversion of information archived in display holograms: I. discussion.信息在显示全息图中的模拟-数字转换：I. 讨论。

J Opt Soc Am A Opt Image Sci Vis. 2023 Apr 1;40(4):B47-B56. doi: 10.1364/JOSAA.478498.

PaMIR: Parametric Model-Conditioned Implicit Representation for Image-Based Human Reconstruction.PaMIR：基于图像的人体重建的参数模型条件隐式表示。

IEEE Trans Pattern Anal Mach Intell. 2022 Jun;44(6):3170-3184. doi: 10.1109/TPAMI.2021.3050505. Epub 2022 May 5.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

通过法线积分实现轻量级显式3D人体数字化

Lightweight Explicit 3D Human Digitization via Normal Integration.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献