IEEE Trans Image Process. 2016 Feb;25(2):658-72. doi: 10.1109/TIP.2015.2492826. Epub 2015 Oct 19.
In this paper, a novel approach to local 3D surface matching representation suitable for a range of 3D vision applications is introduced. Local 3D surface patches around key points on the 3D surface are represented by 2D images such that the representing 2D images enjoy certain characteristics which positively impact the matching accuracy, robustness, and speed. First, the proposed representation is complete, in the sense, there is no information loss during their computation. Second, the 3DoF 2D representations are strictly invariant to all the 3DoF rotations. To optimally avail surface information, the sensitivity of the representations to surface information is adjustable. This also provides the proposed matching representation with the means to optimally adjust to a particular class of problems/applications or an acquisition technology. Each 2D matching representation is a sequence of adjustable integral kernels, where each kernel is efficiently computed from a triple of precise 3D curves (profiles) formed by intersecting three concentric spheres with the 3D surface. Robust techniques for sampling the profiles and establishing correspondences among them were devised. Based on the proposed matching representation, two techniques for the detection of key points were presented. The first is suitable for static images, while the second is suitable for 3D videos. The approach was tested on the face recognition grand challenge v2.0, the 3D twins expression challenge, and the Bosphorus data sets, and a superior face recognition performance was achieved. In addition, the proposed approach was used in object class recognition and tested on a Kinect data set.
本文提出了一种新的适用于多种三维视觉应用的局部三维表面匹配表示方法。通过二维图像表示三维表面关键点周围的局部三维表面补丁,使得表示二维图像具有某些特征,这些特征对匹配精度、鲁棒性和速度有积极的影响。首先,所提出的表示是完整的,即在计算过程中没有信息丢失。其次,3DoF 的二维表示严格不变于所有的 3DoF 旋转。为了最优地利用表面信息,对表示的表面信息敏感性是可调的。这也为所提出的匹配表示提供了根据特定问题/应用程序或采集技术进行最优调整的手段。每个二维匹配表示都是一系列可调积分核,其中每个核都可以从三个同心球体与三维表面相交形成的三对精确三维曲线(轮廓)高效计算得到。设计了用于采样轮廓和建立它们之间对应关系的稳健技术。基于所提出的匹配表示,提出了两种用于关键点检测的技术。第一种技术适用于静态图像,而第二种技术适用于三维视频。该方法在人脸识别大挑战 v2.0、3D 双胞胎表情挑战和博斯普鲁斯数据集上进行了测试,取得了卓越的人脸识别性能。此外,所提出的方法还用于对象类别识别,并在 Kinect 数据集上进行了测试。