Eleftheriadis Stefanos, Rudovic Ognjen, Pantic Maja
IEEE Trans Image Process. 2016 Dec;25(12):5727-5742. doi: 10.1109/TIP.2016.2615288. Epub 2016 Oct 5.
Automated analysis of facial expressions can benefit many domains, from marketing to clinical diagnosis of neurodevelopmental disorders. Facial expressions are typically encoded as a combination of facial muscle activations, i.e., action units. Depending on context, these action units co-occur in specific patterns, and rarely in isolation. Yet, most existing methods for automatic action unit detection fail to exploit dependencies among them, and the corresponding facial features. To address this, we propose a novel multi-conditional latent variable model for simultaneous fusion of facial features and joint action unit detection. Specifically, the proposed model performs feature fusion in a generative fashion via a low-dimensional shared subspace, while simultaneously performing action unit detection using a discriminative classification approach. We show that by combining the merits of both approaches, the proposed methodology outperforms existing purely discriminative/generative methods for the target task. To reduce the number of parameters, and avoid overfitting, a novel Bayesian learning approach based on Monte Carlo sampling is proposed, to integrate out the shared subspace. We validate the proposed method on posed and spontaneous data from three publicly available datasets (CK+, DISFA and Shoulder-pain), and show that both feature fusion and joint learning of action units leads to improved performance compared to the state-of-the-art methods for the task.
面部表情的自动分析可使许多领域受益,从市场营销到神经发育障碍的临床诊断。面部表情通常被编码为面部肌肉激活的组合,即动作单元。根据上下文,这些动作单元以特定模式同时出现,很少单独出现。然而,大多数现有的自动动作单元检测方法未能利用它们之间的依赖性以及相应的面部特征。为了解决这个问题,我们提出了一种新颖的多条件潜在变量模型,用于同时融合面部特征和联合动作单元检测。具体而言,所提出的模型通过低维共享子空间以生成方式执行特征融合,同时使用判别式分类方法执行动作单元检测。我们表明,通过结合这两种方法的优点,所提出的方法在目标任务上优于现有的纯判别式/生成式方法。为了减少参数数量并避免过拟合,提出了一种基于蒙特卡罗采样的新颖贝叶斯学习方法,以整合共享子空间。我们在来自三个公开可用数据集(CK+、DISFA 和 Shoulder-pain)的摆拍数据和自然数据上验证了所提出的方法,并表明与该任务的现有最先进方法相比,动作单元的特征融合和联合学习都能提高性能。