Smiley Aref, Finkelstein Joseph
Department of Biomedical Informatics, University of Utah, Salt Lake City, UT 84108, USA.
Diagnostics (Basel). 2024 Dec 28;15(1):52. doi: 10.3390/diagnostics15010052.
: This study aimed to explore machine learning approaches for predicting physical exertion using physiological signals collected from wearable devices. : Both traditional machine learning and deep learning methods for classification and regression were assessed. The research involved 27 healthy participants engaged in controlled cycling exercises. Physiological data, including ECG, heart rate, oxygen saturation, and pedal speed (RPM), were collected during these sessions, which were divided into eight two-minute segments. Heart rate variability (HRV) was also calculated to serve as a predictive indicator. We employed two feature selection algorithms to identify the most relevant features for model training: Minimum Redundancy Maximum Relevance (MRMR) for both classification and regression, and Univariate Feature Ranking for Classification. A total of 34 traditional models were developed using MATLAB's Classification Learner App, utilizing 20% of the data for testing. In addition, Long Short-Term Memory (LSTM) networks were trained on the top features selected by the MRMR and Univariate Feature Ranking algorithms to enhance model performance. Finally, the MRMR-selected features were used for regression to train the LSTM model for predicting continuous outcomes. : The LSTM model for regression demonstrated robust predictive capabilities, achieving a mean squared error (MSE) of 0.8493 and an R-squared value of 0.7757. The classification models also showed promising results, with the highest testing accuracy reaching 89.2% and an F1 score of 91.7%. : These results underscore the effectiveness of combining feature selection algorithms with advanced machine learning (ML) and deep learning techniques for predicting physical exertion levels using wearable sensor data.
本研究旨在探索利用从可穿戴设备收集的生理信号来预测体力消耗的机器学习方法。评估了用于分类和回归的传统机器学习和深度学习方法。该研究涉及27名健康参与者进行的受控自行车运动。在这些运动过程中收集了生理数据,包括心电图、心率、血氧饱和度和踏板速度(每分钟转速),这些运动被分为八个两分钟的时间段。还计算了心率变异性(HRV)作为预测指标。我们采用了两种特征选择算法来识别模型训练中最相关的特征:用于分类和回归的最小冗余最大相关性(MRMR)算法,以及用于分类的单变量特征排名算法。使用MATLAB的分类学习器应用程序开发了总共34个传统模型,利用20%的数据进行测试。此外,基于MRMR和单变量特征排名算法选择的顶级特征训练了长短期记忆(LSTM)网络,以提高模型性能。最后,将MRMR选择的特征用于回归,训练LSTM模型以预测连续结果。回归的LSTM模型表现出强大的预测能力,均方误差(MSE)为0.8493,决定系数(R平方)值为0.7757。分类模型也显示出有希望的结果,最高测试准确率达到89.2%,F1分数为91.7%。这些结果强调了将特征选择算法与先进的机器学习(ML)和深度学习技术相结合,利用可穿戴传感器数据预测体力消耗水平的有效性。