Suppr超能文献

学习人体关节三维运动稀疏码字典,实现实时人体活动理解。

Learning dictionaries of sparse codes of 3D movements of body joints for real-time human activity understanding.

机构信息

Brain and Behavior Discovery Institute, James and Jean Culver Vision Discovery Institute, Department of Ophthalmology, Georgia Regents University, Augusta, Georgia, 30912, United States of America.

出版信息

PLoS One. 2014 Dec 4;9(12):e114147. doi: 10.1371/journal.pone.0114147. eCollection 2014.

Abstract

Real-time human activity recognition is essential for human-robot interactions for assisted healthy independent living. Most previous work in this area is performed on traditional two-dimensional (2D) videos and both global and local methods have been used. Since 2D videos are sensitive to changes of lighting condition, view angle, and scale, researchers begun to explore applications of 3D information in human activity understanding in recently years. Unfortunately, features that work well on 2D videos usually don't perform well on 3D videos and there is no consensus on what 3D features should be used. Here we propose a model of human activity recognition based on 3D movements of body joints. Our method has three steps, learning dictionaries of sparse codes of 3D movements of joints, sparse coding, and classification. In the first step, space-time volumes of 3D movements of body joints are obtained via dense sampling and independent component analysis is then performed to construct a dictionary of sparse codes for each activity. In the second step, the space-time volumes are projected to the dictionaries and a set of sparse histograms of the projection coefficients are constructed as feature representations of the activities. Finally, the sparse histograms are used as inputs to a support vector machine to recognize human activities. We tested this model on three databases of human activities and found that it outperforms the state-of-the-art algorithms. Thus, this model can be used for real-time human activity recognition in many applications.

摘要

实时人体活动识别对于辅助健康独立生活的人机交互至关重要。该领域的大多数先前工作都是在传统的二维(2D)视频上进行的,并且已经使用了全局和局部方法。由于 2D 视频对光照条件、视角和比例的变化很敏感,研究人员近年来开始探索在人体活动理解中应用 3D 信息。不幸的是,在 2D 视频上效果很好的特征在 3D 视频上效果不佳,并且对于应该使用哪些 3D 特征还没有共识。在这里,我们提出了一种基于人体关节 3D 运动的人体活动识别模型。我们的方法有三个步骤,学习关节 3D 运动的稀疏码字典、稀疏编码和分类。在第一步中,通过密集采样获取身体关节 3D 运动的时空体,然后进行独立成分分析,为每个活动构造一个稀疏码字典。在第二步中,将时空体投影到字典上,并构建一组投影系数的稀疏直方图作为活动的特征表示。最后,将稀疏直方图作为输入提供给支持向量机以识别人体活动。我们在三个人体活动数据库上测试了该模型,发现它优于最新的算法。因此,该模型可用于许多应用中的实时人体活动识别。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/613e/4256388/73a9673deab7/pone.0114147.g001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验