基于自监督学习和自注意力机制的智能家居中人类活动识别增强。

Enhancing Human Activity Recognition in Smart Homes with Self-Supervised Learning and Self-Attention.

机构信息

Department of Computer Science, Université de Sherbrooke, 2500 Bd de l'Université, Sherbrooke, QC J1K 2R1, Canada.

Department of Computer Science and Mathematics, Université du Québec à Chicoutimi, 555 Bd de l'Université, Chicoutimi, QC G7H 2B1, Canada.

出版信息

Sensors (Basel). 2024 Jan 29;24(3):884. doi: 10.3390/s24030884.

DOI:10.3390/s24030884

PMID:38339601

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10857438/

Abstract

Deep learning models have gained prominence in human activity recognition using ambient sensors, particularly for telemonitoring older adults' daily activities in real-world scenarios. However, collecting large volumes of annotated sensor data presents a formidable challenge, given the time-consuming and costly nature of traditional manual annotation methods, especially for extensive projects. In response to this challenge, we propose a novel AttCLHAR model rooted in the self-supervised learning framework SimCLR and augmented with a self-attention mechanism. This model is designed for human activity recognition utilizing ambient sensor data, tailored explicitly for scenarios with limited or no annotations. AttCLHAR encompasses unsupervised pre-training and fine-tuning phases, sharing a common encoder module with two convolutional layers and a long short-term memory (LSTM) layer. The output is further connected to a self-attention layer, allowing the model to selectively focus on different input sequence segments. The incorporation of sharpness-aware minimization (SAM) aims to enhance model generalization by penalizing loss sharpness. The pre-training phase focuses on learning representative features from abundant unlabeled data, capturing both spatial and temporal dependencies in the sensor data. It facilitates the extraction of informative features for subsequent fine-tuning tasks. We extensively evaluated the AttCLHAR model using three CASAS smart home datasets (Aruba-1, Aruba-2, and Milan). We compared its performance against the SimCLR framework, SimCLR with SAM, and SimCLR with the self-attention layer. The experimental results demonstrate the superior performance of our approach, especially in semi-supervised and transfer learning scenarios. It outperforms existing models, marking a significant advancement in using self-supervised learning to extract valuable insights from unlabeled ambient sensor data in real-world environments.

摘要

深度学习模型在使用环境传感器进行人类活动识别方面已经取得了显著的成就，特别是在远程监测老年人的日常活动方面。然而，由于传统的手动标注方法耗时且昂贵，收集大量标注传感器数据是一项艰巨的挑战，特别是对于大规模项目而言。针对这一挑战，我们提出了一种新颖的 AttCLHAR 模型，该模型基于自监督学习框架 SimCLR，并增强了自注意力机制。该模型旨在利用环境传感器数据进行人类活动识别，专门针对有限或无标注的场景进行设计。AttCLHAR 包括无监督预训练和微调阶段，与两个卷积层和一个长短期记忆 (LSTM) 层共享一个通用编码器模块。输出进一步连接到自注意力层，使模型能够选择性地关注不同的输入序列段。引入 sharpness-aware minimization (SAM) 旨在通过惩罚损失的锐度来增强模型的泛化能力。预训练阶段专注于从大量未标注数据中学习代表性特征，捕捉传感器数据中的空间和时间依赖性。它促进了后续微调任务中信息特征的提取。我们使用三个 CASAS 智能家居数据集（Aruba-1、Aruba-2 和 Milan）对 AttCLHAR 模型进行了广泛评估。我们将其性能与 SimCLR 框架、带 SAM 的 SimCLR 和带自注意力层的 SimCLR 进行了比较。实验结果表明了我们方法的优越性能，特别是在半监督和迁移学习场景中。它优于现有的模型，标志着在使用自监督学习从现实环境中的未标注环境传感器数据中提取有价值的见解方面取得了重大进展。