Higher School of Communications of Tunis (Sup’Com), University of Carthage, Ariana, Tunisia.
IEEE Trans Pattern Anal Mach Intell. 2012 Nov;34(11):2247-58. doi: 10.1109/TPAMI.2012.19.
We introduce a new gesture recognition framework based on learning local motion signatures (LMSs) of HOG descriptors introduced by [1]. Our main contribution is to propose a new probabilistic learning-classification scheme based on a reliable tracking of local features. After the generation of these LMSs computed on one individual by tracking Histograms of Oriented Gradient (HOG) [2] descriptor, we learn a codebook of video-words (i.e., clusters of LMSs) using k-means algorithm on a learning gesture video database. Then, the video-words are compacted to a code-book of codewords by the Maximization of Mutual Information (MMI) algorithm. At the final step, we compare the LMSs generated for a new gesture w.r.t. the learned code-book via the k-nearest neighbors (k-NN) algorithm and a novel voting strategy. Our main contribution is the handling of the N to N mapping between codewords and gesture labels within the proposed voting strategy. Experiments have been carried out on two public gesture databases: KTH [3] and IXMAS [4]. Results show that the proposed method outperforms recent state-of-the-art methods.
我们介绍了一种新的基于学习局部运动特征(LMS)的手势识别框架,这些 LMS 是由 [1] 提出的方向梯度直方图(HOG)描述符引入的。我们的主要贡献是提出了一种新的基于可靠跟踪局部特征的概率学习分类方案。在通过跟踪方向梯度直方图(HOG)[2]描述符生成这些 LMS 之后,我们使用 k-均值算法在学习手势视频数据库上学习视频词(即 LMS 聚类)的代码本。然后,通过最大化互信息(MMI)算法将视频词压缩到代码词的代码本中。在最后一步,我们通过 k-最近邻(k-NN)算法和一种新的投票策略比较为新手势生成的 LMS 与学习的代码本。我们的主要贡献是在提出的投票策略中处理代码词和手势标签之间的 N 到 N 映射。实验在两个公开的手势数据库上进行:KTH [3]和 IXMAS [4]。结果表明,所提出的方法优于最新的最先进的方法。