School of Mathematics and Computer Science, Northwest Minzu University, Lanzhou, 730030, People's Republic of China.
Key Laboratory of Streaming Data Computing Technologies and Application, Northwest Minzu University, Lanzhou, 730030, People's Republic of China.
Sci Rep. 2022 Nov 30;12(1):20620. doi: 10.1038/s41598-022-24887-y.
Human Activity Recognition (HAR) is an important research area in human-computer interaction and pervasive computing. In recent years, many deep learning (DL) methods have been widely used for HAR, and due to their powerful automatic feature extraction capabilities, they achieve better recognition performance than traditional methods and are applicable to more general scenarios. However, the problem is that DL methods increase the computational cost of the system and take up more system resources while achieving higher recognition accuracy, which is more challenging for its operation in small memory terminal devices such as smartphones. So, we need to reduce the model size as much as possible while taking into account the recognition accuracy. To address this problem, we propose a multi-scale feature extraction fusion model combining Convolutional Neural Network (CNN) and Gated Recurrent Unit (GRU). The model uses different convolutional kernel sizes combined with GRU to accomplish the automatic extraction of different local features and long-term dependencies of the original data to obtain a richer feature representation. In addition, the proposed model uses separable convolution instead of classical convolution to meet the requirement of reducing model parameters while improving recognition accuracy. The accuracy of the proposed model is 97.18%, 96.71%, and 96.28% on the WISDM, UCI-HAR, and PAMAP2 datasets respectively. The experimental results show that the proposed model not only obtains higher recognition accuracy but also costs lower computational resources compared with other methods.
人体活动识别(HAR)是人机交互和普适计算中的一个重要研究领域。近年来,许多深度学习(DL)方法已被广泛应用于 HAR,由于其强大的自动特征提取能力,它们比传统方法具有更好的识别性能,并且适用于更一般的场景。然而,问题在于 DL 方法提高了系统的计算成本,占用了更多的系统资源,同时实现了更高的识别精度,这对其在智能手机等小型内存终端设备中的运行更加具有挑战性。因此,在考虑识别精度的同时,我们需要尽可能地减小模型的大小。为了解决这个问题,我们提出了一种结合卷积神经网络(CNN)和门控循环单元(GRU)的多尺度特征提取融合模型。该模型使用不同的卷积核大小与 GRU 相结合,自动提取原始数据的不同局部特征和长期依赖关系,以获得更丰富的特征表示。此外,所提出的模型使用可分离卷积代替经典卷积,以满足在提高识别精度的同时减少模型参数的要求。所提出的模型在 WISDM、UCI-HAR 和 PAMAP2 数据集上的准确率分别为 97.18%、96.71%和 96.28%。实验结果表明,与其他方法相比,所提出的模型不仅获得了更高的识别精度,而且还消耗了更低的计算资源。