Nakandala Supun, Jankowska Marta M, Tuz-Zahra Fatima, Bellettiere John, Carlson Jordan A, LaCroix Andrea Z, Hartman Sheri J, Rosenberg Dori E, Zou Jingjing, Kumar Arun, Natarajan Loki
Department of Computer Science and Engineering, University of California San Diego, La Jolla, CA, USA.
Qualcomm Institute/Calit2, University of California San Diego, La Jolla, CA, USA.
J Meas Phys Behav. 2021 Jun;4(2):102-110. doi: 10.1123/jmpb.2020-0016. Epub 2021 Feb 25.
Machine learning has been used for classification of physical behavior bouts from hip-worn accelerometers; however, this research has been limited due to the challenges of directly observing and coding human behavior "in the wild." Deep learning algorithms, such as convolutional neural networks (CNNs), may offer better representation of data than other machine learning algorithms without the need for engineered features and may be better suited to dealing with free-living data. The purpose of this study was to develop a modeling pipeline for evaluation of a CNN model on a free-living data set and compare CNN inputs and results with the commonly used machine learning random forest and logistic regression algorithms.
Twenty-eight free-living women wore an ActiGraph GT3X+accelerometer on their right hip for 7 days. A concurrently worn thigh-mounted activPAL device captured ground truth activity labels. The authors evaluated logistic regression, random forest, and CNN models for classifying sitting, standing, and stepping bouts. The authors also assessed the benefit of performing feature engineering for this task.
The CNN classifier performed best (average balanced accuracy for bout classification of sitting, standing, and stepping was 84%) compared with the other methods (56% for logistic regression and 76% for random forest), even without performing any feature engineering.
Using the recent advancements in deep neural networks, the authors showed that a CNN model can outperform other methods even without feature engineering. This has important implications for both the model's ability to deal with the complexity of free-living data and its potential transferability to new populations.
机器学习已被用于通过佩戴在髋部的加速度计对身体行为发作进行分类;然而,由于在“自然环境”中直接观察和编码人类行为存在挑战,这项研究受到了限制。深度学习算法,如卷积神经网络(CNN),可能比其他机器学习算法能更好地表示数据,且无需人工设计特征,可能更适合处理自由生活数据。本研究的目的是开发一个建模流程,用于在自由生活数据集上评估CNN模型,并将CNN的输入和结果与常用的机器学习随机森林和逻辑回归算法进行比较。
28名自由生活的女性在右髋部佩戴ActiGraph GT3X +加速度计7天。同时佩戴在大腿上的activPAL设备记录实际的活动标签。作者评估了逻辑回归、随机森林和CNN模型对坐姿、站姿和行走发作进行分类的情况。作者还评估了针对此任务进行特征工程的益处。
与其他方法(逻辑回归为56%,随机森林为76%)相比,即使不进行任何特征工程,CNN分类器的表现最佳(坐姿、站姿和行走发作分类的平均平衡准确率为84%)。
利用深度神经网络的最新进展,作者表明,即使不进行特征工程,CNN模型也能优于其他方法。这对于该模型处理自由生活数据复杂性的能力及其向新人群的潜在可转移性都具有重要意义。