Wen Tiancheng, Ding Zhonggan, Yao Yongqiang, Wang Yaxiong, Qian Xueming
IEEE Trans Neural Netw Learn Syst. 2023 Dec;34(12):10516-10527. doi: 10.1109/TNNLS.2022.3167743. Epub 2023 Nov 30.
Since recent facial landmark localization methods achieve satisfying accuracy, few of them enable fast inference speed, which, however, is critical in many real-world facial applications. Existing methods typically employ complicated network structure and predict all the key points through uniform computation, which is inefficient since individual facial part might take different computation to obtain the best performance. Taking both accuracy and efficiency into consideration, we propose the PicassoNet, a lightweight cascaded facial landmark detector with adaptive computation for individual facial part. Different from the conventional cascaded methods, PicassoNet integrates refinement submodules into a single network with group convolution, where each convolution group predicts landmarks from an individual facial part. Note that the groups' structures are flexible in the training process. Then, a novel grouping search algorithm is proposed to optimize the group division. With formulating the optimization as a network architecture search (NAS) problem, the grouping search adaptively allocates computation to each group and obtains an efficient structure. In addition, we propose a boundary-aware loss to optimize along tangent and normal of facial boundaries, instead of optimizing along horizontal and vertical as the conventional loss (L2, SmoothL1, WingLoss, and so on) do. The novel loss improves the joint locations of predicted keypoints. Experiments on three benchmark datasets AFLW, 300W, and WFLW show that the proposed method runs over 6× times faster than the state of the arts and meanwhile achieves comparable accuracy.
由于最近的面部 landmark 定位方法取得了令人满意的精度,但其中很少有方法能实现快速推理速度,而这在许多实际面部应用中至关重要。现有方法通常采用复杂的网络结构,并通过统一计算预测所有关键点,这是低效的,因为单个面部部分可能需要不同的计算来获得最佳性能。综合考虑准确性和效率,我们提出了 PicassoNet,一种针对单个面部部分具有自适应计算的轻量级级联面部 landmark 检测器。与传统的级联方法不同,PicassoNet 通过分组卷积将细化子模块集成到单个网络中,其中每个卷积组从单个面部部分预测 landmark。请注意,组的结构在训练过程中是灵活的。然后,提出了一种新颖的分组搜索算法来优化组划分。通过将优化表述为网络架构搜索(NAS)问题,分组搜索自适应地将计算分配给每个组并获得高效的结构。此外,我们提出了一种边界感知损失,沿着面部边界的切线和法线进行优化,而不是像传统损失(L2、SmoothL1、WingLoss 等)那样沿着水平和垂直方向进行优化。这种新颖的损失提高了预测关键点的联合位置。在三个基准数据集 AFLW、300W 和 WFLW 上的实验表明,所提出的方法比现有技术快 6 倍以上,同时实现了可比的精度。