National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, P.R. China.
IEEE Trans Pattern Anal Mach Intell. 2012 Mar;34(3):436-50. doi: 10.1109/TPAMI.2011.157.
Slow Feature Analysis (SFA) extracts slowly varying features from a quickly varying input signal. It has been successfully applied to modeling the visual receptive fields of the cortical neurons. Sufficient experimental results in neuroscience suggest that the temporal slowness principle is a general learning principle in visual perception. In this paper, we introduce the SFA framework to the problem of human action recognition by incorporating the discriminative information with SFA learning and considering the spatial relationship of body parts. In particular, we consider four kinds of SFA learning strategies, including the original unsupervised SFA (U-SFA), the supervised SFA (S-SFA), the discriminative SFA (D-SFA), and the spatial discriminative SFA (SD-SFA), to extract slow feature functions from a large amount of training cuboids which are obtained by random sampling in motion boundaries. Afterward, to represent action sequences, the squared first order temporal derivatives are accumulated over all transformed cuboids into one feature vector, which is termed the Accumulated Squared Derivative (ASD) feature. The ASD feature encodes the statistical distribution of slow features in an action sequence. Finally, a linear support vector machine (SVM) is trained to classify actions represented by ASD features. We conduct extensive experiments, including two sets of control experiments, two sets of large scale experiments on the KTH and Weizmann databases, and two sets of experiments on the CASIA and UT-interaction databases, to demonstrate the effectiveness of SFA for human action recognition. Experimental results suggest that the SFA-based approach (1) is able to extract useful motion patterns and improves the recognition performance, (2) requires less intermediate processing steps but achieves comparable or even better performance, and (3) has good potential to recognize complex multiperson activities.
慢特征分析(SFA)从快速变化的输入信号中提取缓慢变化的特征。它已成功应用于模拟皮质神经元的视觉感受野。神经科学的充分实验结果表明,时间缓慢原理是视觉感知中的一般学习原理。在本文中,我们通过将判别信息与 SFA 学习相结合,并考虑身体部位的空间关系,将 SFA 框架引入到人体动作识别问题中。具体来说,我们考虑了四种 SFA 学习策略,包括原始无监督 SFA(U-SFA)、监督 SFA(S-SFA)、判别 SFA(D-SFA)和空间判别 SFA(SD-SFA),从通过在运动边界中随机采样获得的大量训练长方体中提取慢特征函数。之后,为了表示动作序列,将所有变换后的长方体上的一阶时间导数平方相加到一个特征向量中,称为累积平方导数(ASD)特征。ASD 特征编码了动作序列中慢特征的统计分布。最后,训练一个线性支持向量机(SVM)来对由 ASD 特征表示的动作进行分类。我们进行了广泛的实验,包括两组对照实验、两组在 KTH 和 Weizmann 数据库上的大规模实验以及两组在 CASIA 和 UT-interaction 数据库上的实验,以证明 SFA 对人体动作识别的有效性。实验结果表明,基于 SFA 的方法(1)能够提取有用的运动模式并提高识别性能,(2)需要较少的中间处理步骤,但能达到可比甚至更好的性能,(3)具有识别复杂多人活动的良好潜力。