• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

面向透视投影的三维人脸重建:从单目图像估计 6DoF 人脸姿态。

Toward 3D Face Reconstruction in Perspective Projection: Estimating 6DoF Face Pose From Monocular Image.

出版信息

IEEE Trans Image Process. 2023;32:3080-3091. doi: 10.1109/TIP.2023.3275535. Epub 2023 May 30.

DOI:10.1109/TIP.2023.3275535
PMID:37192029
Abstract

In 3D face reconstruction, orthogonal projection has been widely employed to substitute perspective projection to simplify the fitting process. This approximation performs well when the distance between camera and face is far enough. However, in some scenarios that the face is very close to camera or moving along the camera axis, the methods suffer from the inaccurate reconstruction and unstable temporal fitting due to the distortion under the perspective projection. In this paper, we aim to address the problem of single-image 3D face reconstruction under perspective projection. Specifically, a deep neural network, Perspective Network (PerspNet), is proposed to simultaneously reconstruct 3D face shape in canonical space and learn the correspondence between 2D pixels and 3D points, by which the 6DoF (6 Degrees of Freedom) face pose can be estimated to represent perspective projection. Besides, we contribute a large ARKitFace dataset to enable the training and evaluation of 3D face reconstruction solutions under the scenarios of perspective projection, which has 902,724 2D facial images with ground-truth 3D face mesh and annotated 6DoF pose parameters. Experimental results show that our approach outperforms current state-of-the-art methods by a significant margin. The code and data are available at https://github.com/cbsropenproject/6dof_face.

摘要

在 3D 人脸重建中,正交投影已被广泛用于替代透视投影,以简化拟合过程。当相机和人脸之间的距离足够远时,这种近似效果很好。然而,在一些人脸非常靠近相机或沿相机轴移动的场景中,由于透视投影下的失真,这些方法会导致重建不准确和时间拟合不稳定。在本文中,我们旨在解决透视投影下的单幅 3D 人脸重建问题。具体来说,我们提出了一种深度神经网络,即透视网络(PerspNet),通过同时在规范空间中重建 3D 人脸形状并学习 2D 像素和 3D 点之间的对应关系,来估计 6DoF(6 自由度)人脸姿态,以表示透视投影。此外,我们还贡献了一个大型 ARKitFace 数据集,用于在透视投影场景下训练和评估 3D 人脸重建解决方案,该数据集包含 902724 张具有地面真实 3D 人脸网格和注释 6DoF 姿态参数的 2D 面部图像。实验结果表明,我们的方法显著优于当前最先进的方法。代码和数据可在 https://github.com/cbsropenproject/6dof_face 上获得。

相似文献

1
Toward 3D Face Reconstruction in Perspective Projection: Estimating 6DoF Face Pose From Monocular Image.面向透视投影的三维人脸重建:从单目图像估计 6DoF 人脸姿态。
IEEE Trans Image Process. 2023;32:3080-3091. doi: 10.1109/TIP.2023.3275535. Epub 2023 May 30.
2
6DoF Object Pose and Focal Length Estimation from Single RGB Images in Uncontrolled Environments.在不受控环境下从单张RGB图像估计6自由度物体姿态和焦距
Sensors (Basel). 2024 Aug 23;24(17):5474. doi: 10.3390/s24175474.
3
An Efficient 3D Human Pose Retrieval and Reconstruction from 2D Image-Based Landmarks.基于二维图像特征点的高效三维人体姿态检索与重建。
Sensors (Basel). 2021 Apr 1;21(7):2415. doi: 10.3390/s21072415.
4
Head Pose Estimation through Keypoints Matching between Reconstructed 3D Face Model and 2D Image.基于重建 3D 人脸模型和 2D 图像关键点匹配的头部姿势估计
Sensors (Basel). 2021 Mar 6;21(5):1841. doi: 10.3390/s21051841.
5
Single-Camera Multi-View 6DoF pose estimation for robotic grasping.用于机器人抓取的单相机多视图6自由度姿态估计
Front Neurorobot. 2023 Jun 13;17:1136882. doi: 10.3389/fnbot.2023.1136882. eCollection 2023.
6
EPro-PnP: Generalized End-to-End Probabilistic Perspective-N-Points for Monocular Object Pose Estimation.EPro-PnP:用于单目物体位姿估计的广义端到端概率视角N点法
IEEE Trans Pattern Anal Mach Intell. 2024 Jan 16;PP. doi: 10.1109/TPAMI.2024.3354997.
7
SynPo-Net-Accurate and Fast CNN-Based 6DoF Object Pose Estimation Using Synthetic Training.基于 SynPo-Net 的准确、快速的 CNN 六自由度物体位姿估计方法,使用合成训练。
Sensors (Basel). 2021 Jan 5;21(1):300. doi: 10.3390/s21010300.
8
On Learning 3D Face Morphable Model from In-the-Wild Images.从自然图像中学习3D人脸可变形模型
IEEE Trans Pattern Anal Mach Intell. 2021 Jan;43(1):157-171. doi: 10.1109/TPAMI.2019.2927975. Epub 2020 Dec 4.
9
MeshLifter: Weakly Supervised Approach for 3D Human Mesh Reconstruction from a Single 2D Pose Based on Loop Structure.基于循环结构的基于单张二维姿态的弱监督 3D 人体网格重建方法 MeshLifter。
Sensors (Basel). 2020 Jul 30;20(15):4257. doi: 10.3390/s20154257.
10
Dual Networks Based 3D Multi-Person Pose Estimation From Monocular Video.基于双网络的单目视频3D多人姿态估计
IEEE Trans Pattern Anal Mach Intell. 2023 Feb;45(2):1636-1651. doi: 10.1109/TPAMI.2022.3170353. Epub 2023 Jan 6.

引用本文的文献

1
An Open Source Framework for Free Precise Digital Facial Analysis.用于免费精确数字面部分析的开源框架。
World J Plast Surg. 2024;13(3):111-114. doi: 10.61186/wjps.13.3.111.