IEEE Trans Pattern Anal Mach Intell. 2020 Sep;42(9):2082-2095. doi: 10.1109/TPAMI.2019.2911937. Epub 2019 Apr 17.
Facial expression analysis could be greatly improved by incorporating spatial and temporal patterns present in facial behavior, but the patterns have not yet been utilized to their full advantage. We remedy this via a novel dynamic model-an interval temporal restricted Boltzmann machine (IT-RBM) - that is able to capture both universal spatial patterns and complicated temporal patterns in facial behavior for facial expression analysis. We regard a facial expression as a multifarious activity composed of sequential or overlapping primitive facial events. Allen's interval algebra is implemented to portray these complicated temporal patterns via a two-layer Bayesian network. The nodes in the upper-most layer are representative of the primitive facial events, and the nodes in the lower layer depict the temporal relationships between those events. Our model also captures inherent universal spatial patterns via a multi-value restricted Boltzmann machine in which the visible nodes are facial events, and the connections between hidden and visible nodes model intrinsic spatial patterns. Efficient learning and inference algorithms are proposed. Experiments on posed and spontaneous expression distinction and expression recognition demonstrate that our proposed IT-RBM achieves superior performance compared to state-of-the art research due to its ability to incorporate these facial behavior patterns.
通过结合面部行为中存在的空间和时间模式,可以极大地改进面部表情分析,但这些模式尚未被充分利用。我们通过一种新颖的动态模型——区间时间限制玻尔兹曼机(IT-RBM)来解决这个问题,该模型能够捕捉面部行为中的通用空间模式和复杂的时间模式,从而进行面部表情分析。我们将面部表情视为由连续或重叠的基本面部事件组成的多样化活动。通过两层贝叶斯网络,使用 Allen 的区间代数来描绘这些复杂的时间模式。最上层的节点代表基本面部事件,而下层节点则描述了这些事件之间的时间关系。我们的模型还通过多值限制玻尔兹曼机捕捉内在的通用空间模式,其中可见节点是面部事件,隐藏节点和可见节点之间的连接则模拟内在的空间模式。提出了有效的学习和推理算法。在姿态和自然表情区分以及表情识别方面的实验表明,由于能够整合这些面部行为模式,我们提出的 IT-RBM 相较于现有技术的研究方法具有更优的性能。