Suppr超能文献

一种简单、快速且高精度的算法,可从单张图像上的 2D 地标中恢复 3D 形状。

A Simple, Fast and Highly-Accurate Algorithm to Recover 3D Shape from 2D Landmarks on a Single Image.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2018 Dec;40(12):3059-3066. doi: 10.1109/TPAMI.2017.2772922. Epub 2017 Nov 13.

Abstract

Three-dimensional shape reconstruction of 2D landmark points on a single image is a hallmark of human vision, but is a task that has been proven difficult for computer vision algorithms. We define a feed-forward deep neural network algorithm that can reconstruct 3D shapes from 2D landmark points almost perfectly (i.e., with extremely small reconstruction errors), even when these 2D landmarks are from a single image. Our experimental results show an improvement of up to two-fold over state-of-the-art computer vision algorithms; 3D shape reconstruction error (measured as the Procrustes distance between the reconstructed shape and the ground-truth) of human faces is , cars is .0022, human bodies is .022, and highly-deformable flags is .0004. Our algorithm was also a top performer at the 2016 3D Face Alignment in the Wild Challenge competition (done in conjunction with the European Conference on Computer Vision, ECCV) that required the reconstruction of 3D face shape from a single image. The derived algorithm can be trained in a couple hours and testing runs at more than 1,000 frames/s on an i7 desktop. We also present an innovative data augmentation approach that allows us to train the system efficiently with small number of samples. And the system is robust to noise (e.g., imprecise landmark points) and missing data (e.g., occluded or undetected landmark points).

摘要

从单张图像上的二维特征点重建三维形状是人类视觉的标志性能力,但这一任务已被证明非常具有挑战性,即使对于计算机视觉算法来说也是如此。我们定义了一种前馈式深度神经网络算法,可以近乎完美地(即,重建误差极小)从二维特征点重建三维形状,即使这些二维特征点仅来自单张图像。我们的实验结果表明,与最先进的计算机视觉算法相比,该算法的性能提高了一倍以上;人脸、汽车、人体和高度可变形标志的三维形状重建误差(以重建形状与真实形状之间的 Procrustes 距离衡量)分别为 、 、 和 。在与欧洲计算机视觉会议(ECCV)同期举行的 2016 年野外 3D 人脸配准挑战赛中,我们的算法也取得了优异成绩,该挑战赛要求仅从单张图像重建三维人脸形状。该算法可在数小时内训练完成,在 i7 台式机上的测试速度超过 1000 帧/秒。我们还提出了一种创新的数据增强方法,使我们能够使用少量样本高效地训练系统。此外,该系统对噪声(例如,不精确的特征点)和缺失数据(例如,遮挡或未检测到的特征点)具有鲁棒性。

相似文献

3
3D Reconstruction of "In-the-Wild" Faces in Images and Videos.“野外”人脸的图像和视频的三维重建。
IEEE Trans Pattern Anal Mach Intell. 2018 Nov;40(11):2638-2652. doi: 10.1109/TPAMI.2018.2832138. Epub 2018 May 15.
4
An Automatic 3D Facial Landmarking Algorithm Using 2D Gabor Wavelets.基于二维 Gabor 小波的自动三维人脸地标定位算法。
IEEE Trans Image Process. 2016 Feb;25(2):580-8. doi: 10.1109/TIP.2015.2496183. Epub 2015 Oct 29.
5
Joint Face Alignment and 3D Face Reconstruction with Application to Face Recognition.联合人脸对齐和 3D 人脸重建及其在人脸识别中的应用。
IEEE Trans Pattern Anal Mach Intell. 2020 Mar;42(3):664-678. doi: 10.1109/TPAMI.2018.2885995. Epub 2018 Dec 10.
6
Robust 3D face landmark localization based on local coordinate coding.基于局部坐标编码的鲁棒 3D 人脸地标定位。
IEEE Trans Image Process. 2014 Dec;23(12):5108-22. doi: 10.1109/TIP.2014.2361204. Epub 2014 Oct 2.
7
3D facial landmark detection under large yaw and expression variations.在大俯仰角和表情变化下的 3D 面部地标检测。
IEEE Trans Pattern Anal Mach Intell. 2013 Jul;35(7):1552-64. doi: 10.1109/TPAMI.2012.247.
9
Viewpoint-Consistent 3D Face Alignment.视角一致的三维人脸配准。
IEEE Trans Pattern Anal Mach Intell. 2018 Sep;40(9):2250-2264. doi: 10.1109/TPAMI.2017.2750687. Epub 2017 Sep 11.

本文引用的文献

1
Computational Models of Face Perception.面部感知的计算模型
Curr Dir Psychol Sci. 2017 Jun;26(3):263-269. doi: 10.1177/0963721417698535. Epub 2017 Jun 14.
2
Sparse Representation for 3D Shape Estimation: A Convex Relaxation Approach.基于稀疏表示的三维形状估计:一种凸松弛方法。
IEEE Trans Pattern Anal Mach Intell. 2017 Aug;39(8):1648-1661. doi: 10.1109/TPAMI.2016.2605097. Epub 2016 Sep 1.
3
Dense 3D Face Alignment from 2D Videos in Real-Time.实时从二维视频中进行密集三维人脸对齐
IEEE Int Conf Autom Face Gesture Recognit Workshops. 2015 May;1. doi: 10.1109/FG.2015.7163142.
5
Kernel Non-Rigid Structure from Motion.基于运动的内核非刚性结构
Proc IEEE Int Conf Comput Vis. 2011:802-809. doi: 10.1109/ICCV.2011.6126319.
8
Trajectory Space: A Dual Representation for Nonrigid Structure from Motion.轨迹空间:运动非刚体结构的双重表示。
IEEE Trans Pattern Anal Mach Intell. 2011 Jul;33(7):1442-56. doi: 10.1109/TPAMI.2010.201. Epub 2010 Nov 18.
10
Multi-PIE.多姿态、光照和表情数据库
Proc Int Conf Autom Face Gesture Recognit. 2010 May 1;28(5):807-813. doi: 10.1016/j.imavis.2009.08.002.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验