IEEE Trans Cybern. 2015 Sep;45(9):1717-30. doi: 10.1109/TCYB.2014.2359056. Epub 2014 Oct 9.
We present a method for the automatic localization of facial landmarks that integrates nonrigid deformation with the ability to handle missing points. The algorithm generates sets of candidate locations from feature detectors and performs combinatorial search constrained by a flexible shape model. A key assumption of our approach is that for some landmarks there might not be an accurate candidate in the input set. This is tackled by detecting partial subsets of landmarks and inferring those that are missing, so that the probability of the flexible model is maximized. The ability of the model to work with incomplete information makes it possible to limit the number of candidates that need to be retained, drastically reducing the number of combinations to be tested with respect to the alternative of trying to always detect the complete set of landmarks. We demonstrate the accuracy of the proposed method in the face recognition grand challenge database, where we obtain average errors of approximately 3.5 mm when targeting 14 prominent facial landmarks. For the majority of these our method produces the most accurate results reported to date in this database. Handling of occlusions and surfaces with missing parts is demonstrated with tests on the Bosphorus database, where we achieve an overall error of 4.81 and 4.25 mm for data with and without occlusions, respectively. To investigate potential limits in the accuracy that could be reached, we also report experiments on a database of 144 facial scans acquired in the context of clinical research, with manual annotations performed by experts, where we obtain an overall error of 2.3 mm, with averages per landmark below 3.4 mm for all 14 targeted points and within 2 mm for half of them. The coordinates of automatically located landmarks are made available on-line.
我们提出了一种自动定位人脸特征点的方法,该方法将非刚性变形与处理缺失点的能力相结合。该算法从特征检测器生成候选位置集,并通过灵活的形状模型进行组合搜索。我们方法的一个关键假设是,对于某些特征点,输入集中可能没有准确的候选点。通过检测特征点的部分子集并推断缺失的特征点来解决这个问题,从而最大化灵活模型的概率。该模型能够处理不完整信息,使得可以限制需要保留的候选者数量,从而大大减少需要测试的组合数量,与尝试始终检测完整特征点集的替代方案相比。我们在人脸识别大挑战数据库中展示了所提出方法的准确性,在该数据库中,当目标是 14 个显著的人脸特征点时,我们获得了约 3.5 毫米的平均误差。对于其中大多数特征点,我们的方法产生了迄今为止在该数据库中报告的最准确结果。通过在博斯普鲁斯数据库上的测试演示了对遮挡和具有缺失部分的表面的处理,在该数据库中,对于有遮挡和无遮挡的数据,我们分别实现了 4.81 和 4.25 毫米的总体误差。为了研究可能达到的精度的潜在限制,我们还报告了在临床研究背景下获取的 144 个面部扫描数据库上的实验结果,由专家进行手动注释,我们获得了 2.3 毫米的总体误差,对于所有 14 个目标点,每个点的平均值低于 3.4 毫米,其中一半的点在 2 毫米以内。自动定位的特征点的坐标可在线获取。