Wang Lin, Luo Zizhang, Zhang Tianle
Faculty of Health and Life Sciences, University of Exeter, Heavitree Road, Exeter, EX1 2LU, UK.
Engineering & Technology College, Yangtze University, Jingzhou, 434023, China.
BMC Biomed Eng. 2025 Feb 1;7(1):2. doi: 10.1186/s42490-025-00088-2.
The aim of this study is to apply a novel hybrid framework incorporating a Vision Transformer (ViT) and bidirectional long short-term memory (Bi-LSTM) model for classifying physical activity intensity (PAI) in adults using gravity-based acceleration. Additionally, it further investigates how PAI and temporal window (TW) impacts the model' s accuracy.
This research used the Capture-24 dataset, consisting of raw accelerometer data from 151 participants aged 18 to 91. Gravity-based acceleration was utilised to generate images encoding various PAIs. These images were subsequently analysed using the ViT-BiLSTM model, with results presented in confusion matrices and compared with baseline models. The model's robustness was evaluated through temporal stability testing and examination of accuracy and loss curves.
The ViT-BiLSTM model excelled in PAI classification task, achieving an overall accuracy of 98.5% ± 1.48% across five TWs-98.7% for 1s, 98.1% for 5s, 98.2% for 10s, 99% for 15s, and 98.65% for 30s of TW. The model consistently exhibited superior accuracy in predicting sedentary (98.9% ± 1%) compared to light physical activity (98.2% ± 2%) and moderate-to-vigorous physical activity (98.2% ± 3%). ANOVA showed no significant accuracy variation across PAIs (F = 2.18, p = 0.13) and TW (F = 0.52, p = 0.72). Accuracy and loss curves show the model consistently improves its performance across epochs, demonstrating its excellent robustness.
This study demonstrates the ViT-BiLSTM model's efficacy in classifying PAI using gravity-based acceleration, with performance remaining consistent across diverse TWs and intensities. However, PAI and TW could result in slight variations in the model's performance. Future research should concern and investigate the impact of gravity-based acceleration on PAI thresholds, which may influence model's robustness and reliability.
本研究旨在应用一种新型混合框架,该框架结合视觉Transformer(ViT)和双向长短期记忆(Bi-LSTM)模型,用于使用基于重力的加速度对成年人的身体活动强度(PAI)进行分类。此外,它还进一步研究了PAI和时间窗口(TW)如何影响模型的准确性。
本研究使用了Capture-24数据集,该数据集由151名年龄在18至91岁之间的参与者的原始加速度计数据组成。利用基于重力的加速度来生成编码各种PAI的图像。随后使用ViT-BiLSTM模型对这些图像进行分析,结果以混淆矩阵呈现,并与基线模型进行比较。通过时间稳定性测试以及对准确性和损失曲线的检查来评估模型的稳健性。
ViT-BiLSTM模型在PAI分类任务中表现出色,在五个TW下的总体准确率达到98.5%±1.48%——1秒的TW为98.7%,5秒的TW为98.1%,10秒的TW为98.2%,15秒的TW为99%,30秒的TW为98.65%。与轻度身体活动(98.2%±2%)和中度至剧烈身体活动(98.2%±3%)相比,该模型在预测久坐行为(98.9%±1%)方面始终表现出更高的准确性。方差分析表明,不同PAI(F = 2.18,p = 0.13)和TW(F = 0.52,p = 0.72)之间的准确性没有显著差异。准确性和损失曲线表明,该模型在各个训练轮次中持续提高其性能,证明了其出色的稳健性。
本研究证明了ViT-BiLSTM模型在使用基于重力的加速度对PAI进行分类方面的有效性,其性能在不同的TW和强度下保持一致。然而,PAI和TW可能会导致模型性能出现轻微变化。未来的研究应关注并研究基于重力的加速度对PAI阈值的影响,这可能会影响模型的稳健性和可靠性。