Suppr超能文献

用于识别传统中国功法中视频记录动作的 CNN-LSTM 模型。

CNN-LSTM Model for Recognizing Video-Recorded Actions Performed in a Traditional Chinese Exercise.

机构信息

School of Electronic and Information EngineeringSuzhou University of Science and Technology Suzhou 215009 China.

Suzhou Institute of Biomedical Engineering and Technology Suzhou 215000 China.

出版信息

IEEE J Transl Eng Health Med. 2023 Jun 2;11:351-359. doi: 10.1109/JTEHM.2023.3282245. eCollection 2023.

Abstract

Identifying human actions from video data is an important problem in the fields of intelligent rehabilitation assessment. Motion feature extraction and pattern recognition are the two key procedures to achieve such goals. Traditional action recognition models are usually based on the geometric features manually extracted from video frames, which are however difficult to adapt to complex scenarios and cannot achieve high-precision recognition and robustness. We investigate a motion recognition model and apply it to recognize the sequence of complicated actions of a traditional Chinese exercise (ie, Baduanjin). We first developed a combined convolutional neural network (CNN) and long short-term memory (LSTM) model for recognizing the sequence of actions captured in video frames, and applied it to recognize the actions of Baduanjin. Moreover, this method has been compared with the traditional action recognition model based on geometric motion features in which Openpose is used to identify the joint positions in the skeletons. Its performance of high recognition accuracy has been verified on the testing video dataset, containing the video clips from 18 different practicers. The CNN-LSTM recognition model achieved 96.43% accuracy on the testing set; while those manually extracted features in the traditional action recognition model were only able to achieve 66.07% classification accuracy on the testing video dataset. The abstract image features extracted by the CNN module are more effective on improving the classification accuracy of the LSTM model. The proposed CNN-LSTM based method can be a useful tool in recognizing the complicated actions.

摘要

从视频数据中识别人类动作是智能康复评估领域的一个重要问题。运动特征提取和模式识别是实现这一目标的两个关键步骤。传统的动作识别模型通常基于从视频帧中手动提取的几何特征,但这些特征难以适应复杂的场景,无法实现高精度的识别和鲁棒性。我们研究了一种运动识别模型,并将其应用于识别传统中国功法(如八段锦)的复杂动作序列。我们首先开发了一个组合卷积神经网络(CNN)和长短期记忆(LSTM)模型,用于识别视频帧中捕获的动作序列,并将其应用于识别八段锦的动作。此外,该方法与基于几何运动特征的传统动作识别模型进行了比较,该模型使用 Openpose 识别骨骼中的关节位置。在包含 18 位不同练习者视频片段的测试视频数据集上验证了其具有高识别精度的性能。CNN-LSTM 识别模型在测试集上的准确率达到 96.43%;而传统动作识别模型中手动提取的特征在测试视频数据集上的分类准确率仅为 66.07%。CNN 模块提取的抽象图像特征更有效地提高了 LSTM 模型的分类准确率。基于 CNN-LSTM 的方法可以成为识别复杂动作的有用工具。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7eb8/10332470/605174310774/yang1-3282245.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验