Department of Environmental Sciences, Jožef Stefan Institute, 1000 Ljubljana, Slovenia.
Ecotechnologies Programme, Jožef Stefan International Postgraduate School, 1000 Ljubljana, Slovenia.
Sensors (Basel). 2023 Dec 18;23(24):9890. doi: 10.3390/s23249890.
Participatory exposure research, which tracks behaviour and assesses exposure to stressors like air pollution, traditionally relies on time-activity diaries. This study introduces a novel approach, employing machine learning (ML) to empower laypersons in human activity recognition (HAR), aiming to reduce dependence on manual recording by leveraging data from wearable sensors. Recognising complex activities such as smoking and cooking presents unique challenges due to specific environmental conditions. In this research, we combined wearable environment/ambient and wrist-worn activity/biometric sensors for complex activity recognition in an urban stressor exposure study, measuring parameters like particulate matter concentrations, temperature, and humidity. Two groups, Group H (88 individuals) and Group M (18 individuals), wore the devices and manually logged their activities hourly and minutely, respectively. Prioritising accessibility and inclusivity, we selected three classification algorithms: k-nearest neighbours (IBk), decision trees (J48), and random forests (RF), based on: (1) proven efficacy in existing literature, (2) understandability and transparency for laypersons, (3) availability on user-friendly platforms like WEKA, and (4) efficiency on basic devices such as office laptops or smartphones. Accuracy improved with finer temporal resolution and detailed activity categories. However, when compared to other published human activity recognition research, our accuracy rates, particularly for less complex activities, were not as competitive. Misclassifications were higher for vague activities (resting, playing), while well-defined activities (smoking, cooking, running) had few errors. Including environmental sensor data increased accuracy for all activities, especially playing, smoking, and running. Future work should consider exploring other explainable algorithms available on diverse tools and platforms. Our findings underscore ML's potential in exposure studies, emphasising its adaptability and significance for laypersons while also highlighting areas for improvement.
参与式暴露研究通过跟踪行为并评估暴露于空气污染等应激源的情况,传统上依赖于时间活动日记。本研究引入了一种新方法,利用机器学习 (ML) 使非专业人员能够进行人类活动识别 (HAR),旨在通过利用可穿戴传感器的数据减少对手动记录的依赖。由于特定的环境条件,识别吸烟和烹饪等复杂活动具有独特的挑战。在这项研究中,我们将可穿戴环境/环境和手腕佩戴活动/生物识别传感器结合起来,用于城市应激源暴露研究中的复杂活动识别,测量颗粒物浓度、温度和湿度等参数。两组,H 组(88 人)和 M 组(18 人)分别佩戴设备并分别每小时和每分钟手动记录他们的活动。为了优先考虑可访问性和包容性,我们选择了三种分类算法:k-最近邻 (IBk)、决策树 (J48) 和随机森林 (RF),基于:(1) 在现有文献中的有效性,(2) 非专业人员的可理解性和透明度,(3) 在 WEKA 等用户友好的平台上的可用性,以及 (4) 在基本设备(如办公笔记本电脑或智能手机)上的效率。随着时间分辨率的提高和详细活动类别的增加,准确性也得到了提高。然而,与其他已发表的人类活动识别研究相比,我们的准确率,特别是对于不太复杂的活动,并没有那么有竞争力。对于模糊活动(休息、玩耍),误分类更高,而对于定义明确的活动(吸烟、烹饪、跑步),错误较少。包含环境传感器数据可提高所有活动的准确性,尤其是玩耍、吸烟和跑步。未来的工作应该考虑探索不同工具和平台上可用的其他可解释算法。我们的研究结果强调了 ML 在暴露研究中的潜力,强调了它对非专业人员的适应性和重要性,同时也突出了改进的领域。