Wu Qingqiang, Xu Guanghua, Chen Longting, Luo Ailing, Zhang Sicong
School of Mechanical Engineering, Xi'an Jiaotong University, Xi'an, China.
PLoS One. 2017 Oct 26;12(10):e0185719. doi: 10.1371/journal.pone.0185719. eCollection 2017.
Human action recognition using 3D pose data has gained a growing interest in the field of computer robotic interfaces and pattern recognition since the availability of hardware to capture human pose. In this paper, we propose a fast, simple, and powerful method of human action recognition based on human kinematic similarity. The key to this method is that the action descriptor consists of joints position, angular velocity and angular acceleration, which can meet the different individual sizes and eliminate the complex normalization. The angular parameters of joints within a short sliding time window (approximately 5 frames) around the current frame are used to express each pose frame of human action sequence. Moreover, three modified KNN (k-nearest-neighbors algorithm) classifiers are employed in our method: one for achieving the confidence of every frame in the training step, one for estimating the frame label of each descriptor, and one for classifying actions. Additional estimating of the frame's time label makes it possible to address single input frames. This approach can be used on difficult, unsegmented sequences. The proposed method is efficient and can be run in real time. The research shows that many public datasets are irregularly segmented, and a simple method is provided to regularize the datasets. The approach is tested on some challenging datasets such as MSR-Action3D, MSRDailyActivity3D, and UTD-MHAD. The results indicate our method achieves a higher accuracy.
自从有了用于捕捉人体姿态的硬件以来,利用三维姿态数据进行人体动作识别在计算机机器人接口和模式识别领域越来越受到关注。在本文中,我们提出了一种基于人体运动学相似性的快速、简单且强大的人体动作识别方法。该方法的关键在于动作描述符由关节位置、角速度和角加速度组成,这能够适应不同的个体尺寸并消除复杂的归一化处理。围绕当前帧在短滑动时间窗口(约5帧)内的关节角参数用于表示人体动作序列的每个姿态帧。此外,我们的方法采用了三个改进的KNN(k近邻算法)分类器:一个用于在训练步骤中获得每一帧的置信度,一个用于估计每个描述符的帧标签,还有一个用于对动作进行分类。对帧的时间标签进行额外估计使得处理单个输入帧成为可能。这种方法可用于处理困难的、未分割的序列。所提出的方法效率高且能实时运行。研究表明许多公共数据集的分割不规则,并提供了一种简单方法来对数据集进行规范化。该方法在一些具有挑战性的数据集上进行了测试,如MSR - Action3D、MSRDailyActivity3D和UTD - MHAD。结果表明我们的方法实现了更高的准确率。