学习人体关节三维运动稀疏码字典，实现实时人体活动理解。

Learning dictionaries of sparse codes of 3D movements of body joints for real-time human activity understanding.

机构信息

Brain and Behavior Discovery Institute, James and Jean Culver Vision Discovery Institute, Department of Ophthalmology, Georgia Regents University, Augusta, Georgia, 30912, United States of America.

出版信息

PLoS One. 2014 Dec 4;9(12):e114147. doi: 10.1371/journal.pone.0114147. eCollection 2014.

DOI:10.1371/journal.pone.0114147

PMID:25473850

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4256388/

Abstract

Real-time human activity recognition is essential for human-robot interactions for assisted healthy independent living. Most previous work in this area is performed on traditional two-dimensional (2D) videos and both global and local methods have been used. Since 2D videos are sensitive to changes of lighting condition, view angle, and scale, researchers begun to explore applications of 3D information in human activity understanding in recently years. Unfortunately, features that work well on 2D videos usually don't perform well on 3D videos and there is no consensus on what 3D features should be used. Here we propose a model of human activity recognition based on 3D movements of body joints. Our method has three steps, learning dictionaries of sparse codes of 3D movements of joints, sparse coding, and classification. In the first step, space-time volumes of 3D movements of body joints are obtained via dense sampling and independent component analysis is then performed to construct a dictionary of sparse codes for each activity. In the second step, the space-time volumes are projected to the dictionaries and a set of sparse histograms of the projection coefficients are constructed as feature representations of the activities. Finally, the sparse histograms are used as inputs to a support vector machine to recognize human activities. We tested this model on three databases of human activities and found that it outperforms the state-of-the-art algorithms. Thus, this model can be used for real-time human activity recognition in many applications.

摘要

实时人体活动识别对于辅助健康独立生活的人机交互至关重要。该领域的大多数先前工作都是在传统的二维（2D）视频上进行的，并且已经使用了全局和局部方法。由于 2D 视频对光照条件、视角和比例的变化很敏感，研究人员近年来开始探索在人体活动理解中应用 3D 信息。不幸的是，在 2D 视频上效果很好的特征在 3D 视频上效果不佳，并且对于应该使用哪些 3D 特征还没有共识。在这里，我们提出了一种基于人体关节 3D 运动的人体活动识别模型。我们的方法有三个步骤，学习关节 3D 运动的稀疏码字典、稀疏编码和分类。在第一步中，通过密集采样获取身体关节 3D 运动的时空体，然后进行独立成分分析，为每个活动构造一个稀疏码字典。在第二步中，将时空体投影到字典上，并构建一组投影系数的稀疏直方图作为活动的特征表示。最后，将稀疏直方图作为输入提供给支持向量机以识别人体活动。我们在三个人体活动数据库上测试了该模型，发现它优于最新的算法。因此，该模型可用于许多应用中的实时人体活动识别。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/613e/4256388/73a9673deab7/pone.0114147.g001.jpg

相似文献

Learning dictionaries of sparse codes of 3D movements of body joints for real-time human activity understanding.

PLoS One. 2014 Dec 4;9(12):e114147. doi: 10.1371/journal.pone.0114147. eCollection 2014.

Learning sparse representations for human action recognition.

IEEE Trans Pattern Anal Mach Intell. 2012 Aug;34(8):1576-88. doi: 10.1109/TPAMI.2011.253.

Multiple kernel sparse representations for supervised and unsupervised learning.

IEEE Trans Image Process. 2014 Jul;23(7):2905-15. doi: 10.1109/TIP.2014.2322938. Epub 2014 May 9.

A Human Activity Recognition System Using Skeleton Data from RGBD Sensors.

Comput Intell Neurosci. 2016;2016:4351435. doi: 10.1155/2016/4351435. Epub 2016 Mar 16.

Image transformation based on learning dictionaries across image spaces.

IEEE Trans Pattern Anal Mach Intell. 2013 Feb;35(2):367-80. doi: 10.1109/TPAMI.2012.95.

Learning Low-Rank Class-Specific Dictionary and Sparse Intra-Class Variant Dictionary for Face Recognition.

PLoS One. 2015 Nov 16;10(11):e0142403. doi: 10.1371/journal.pone.0142403. eCollection 2015.

Compositional Dictionaries for Domain Adaptive Face Recognition.

IEEE Trans Image Process. 2015 Dec;24(12):5152-65. doi: 10.1109/TIP.2015.2479456. Epub 2015 Sep 16.

Multimodal Task-Driven Dictionary Learning for Image Classification.

IEEE Trans Image Process. 2016 Jan;25(1):24-38. doi: 10.1109/TIP.2015.2496275. Epub 2015 Oct 30.

Surgical gesture classification from video and kinematic data.

Med Image Anal. 2013 Oct;17(7):732-45. doi: 10.1016/j.media.2013.04.007. Epub 2013 Apr 28.

Label consistent K-SVD: learning a discriminative dictionary for recognition.

IEEE Trans Pattern Anal Mach Intell. 2013 Nov;35(11):2651-64. doi: 10.1109/TPAMI.2013.88.

引用本文的文献

Human action recognition based on kinematic similarity in real time.

PLoS One. 2017 Oct 26;12(10):e0185719. doi: 10.1371/journal.pone.0185719. eCollection 2017.

A Real-Time Kinect Signature-Based Patient Home Monitoring System.

Sensors (Basel). 2016 Nov 23;16(11):1965. doi: 10.3390/s16111965.

本文引用的文献

Learning Actionlet Ensemble for 3D Human Action Recognition.

IEEE Trans Pattern Anal Mach Intell. 2014 May;36(5):914-27. doi: 10.1109/TPAMI.2013.198.

PUCK: An Automated Prompting System for Smart Environments: Towards achieving automated prompting; Challenges involved.

Pers Ubiquitous Comput. 2012 Oct 1;16(7):859-873. doi: 10.1007/s00779-011-0445-6.

Multilevel depth and image fusion for human activity detection.

IEEE Trans Cybern. 2013 Oct;43(5):1383-94. doi: 10.1109/TCYB.2013.2276433. Epub 2013 Aug 27.

Robust action recognition using multi-scale spatial-temporal concatenations of local features as natural action structures.

PLoS One. 2012;7(10):e46686. doi: 10.1371/journal.pone.0046686. Epub 2012 Oct 4.

Efficient additive kernels via explicit feature maps.

IEEE Trans Pattern Anal Mach Intell. 2012 Mar;34(3):480-92. doi: 10.1109/TPAMI.2011.153.

Characterizing multiple memory deficits and their relation to everyday functioning in individuals with mild cognitive impairment.

Neuropsychology. 2009 Mar;23(2):168-77. doi: 10.1037/a0014186.

User-adaptive reminders for home-based medical tasks. A case study.

Methods Inf Med. 2008;47(3):203-7.

Mild cognitive impairment and everyday function: evidence of reduced speed in performing instrumental activities of daily living.

Am J Geriatr Psychiatry. 2008 May;16(5):416-24. doi: 10.1097/JGP.0b013e31816b7303.

MCI is associated with deficits in everyday functioning.

Alzheimer Dis Assoc Disord. 2006 Oct-Dec;20(4):217-23. doi: 10.1097/01.wad.0000213849.51495.d9.

A fast learning algorithm for deep belief nets.

Neural Comput. 2006 Jul;18(7):1527-54. doi: 10.1162/neco.2006.18.7.1527.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

学习人体关节三维运动稀疏码字典，实现实时人体活动理解。

Learning dictionaries of sparse codes of 3D movements of body joints for real-time human activity understanding.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献