Institute of Epidemiology and Preventive Medicine, College of Public Health, National Taiwan University, Taipei, Taiwan.
Institute of Health Data Analytics and Statistics, College of Public Health, National Taiwan University, Taipei, Taiwan.
J Med Internet Res. 2024 Sep 11;26:e59497. doi: 10.2196/59497.
Monitoring free-living physical activity (PA) through wearable devices enables the real-time assessment of activity features associated with health outcomes and provision of treatment recommendations and adjustments. The conclusions of studies on PA and health depend crucially on reliable statistical analyses of digital data. Data analytics, however, are challenging due to the various metrics adopted for measuring PA, different aims of studies, and complex temporal variations within variables. The application, interpretation, and appropriateness of these analytical tools have yet to be summarized.
This research aimed to review studies that used analytical methods for analyzing PA monitored by accelerometers. Specifically, this review addressed three questions: (1) What metrics are used to describe an individual's free-living daily PA? (2) What are the current analytical tools for analyzing PA data, particularly under the aims of classification, association with health outcomes, and prediction of health events? and (3) What challenges exist in the analyses, and what recommendations for future research are suggested regarding the use of statistical methods in various research tasks?
This scoping review was conducted following an existing framework to map research studies by exploring the information about PA. Three databases, PubMed, IEEE Xplore, and the ACM Digital Library, were searched in February 2024 to identify related publications. Eligible articles were classification, association, or prediction studies involving human PA monitored through wearable accelerometers.
After screening 1312 articles, 428 (32.62%) eligible studies were identified and categorized into at least 1 of the following 3 thematic categories: classification (75/428, 17.5%), association (342/428, 79.9%), and prediction (32/428, 7.5%). Most articles (414/428, 96.7%) derived PA variables from 3D acceleration, rather than 1D acceleration. All eligible articles (428/428, 100%) considered PA metrics represented in the time domain, while a small fraction (16/428, 3.7%) also considered PA metrics in the frequency domain. The number of studies evaluating the influence of PA on health conditions has increased greatly. Among the studies in our review, regression-type models were the most prevalent (373/428, 87.1%). The machine learning approach for classification research is also gaining popularity (32/75, 43%). In addition to summary statistics of PA, several recent studies used tools to incorporate PA trajectories and account for temporal patterns, including longitudinal data analysis with repeated PA measurements and functional data analysis with PA as a continuum for time-varying association (68/428, 15.9%).
Summary metrics can quickly provide descriptions of the strength, frequency, and duration of individuals' overall PA. When the distribution and profile of PA need to be evaluated or detected, considering PA metrics as longitudinal or functional data can provide detailed information and improve the understanding of the role PA plays in health. Depending on the research goal, appropriate analytical tools can ensure the reliability of the scientific findings.
通过可穿戴设备监测自由活动体力活动(PA),可以实时评估与健康结果相关的活动特征,并提供治疗建议和调整。PA 与健康相关的研究结论在很大程度上取决于对数字数据进行可靠的统计分析。然而,数据分析具有挑战性,因为用于测量 PA 的各种指标、研究目的不同,以及变量内的复杂时间变化。这些分析工具的应用、解释和适当性尚未得到总结。
本研究旨在回顾使用加速度计监测 PA 的分析方法的研究。具体而言,本综述提出了三个问题:(1)用于描述个体日常自由活动 PA 的指标有哪些?(2)目前用于分析 PA 数据的分析工具是什么,特别是在分类、与健康结果的关联以及健康事件预测方面?(3)分析中存在哪些挑战,以及对于在各种研究任务中使用统计方法,未来研究有哪些建议?
本范围综述按照现有的框架进行,通过探索有关 PA 的信息来绘制研究研究。于 2024 年 2 月在 PubMed、IEEE Xplore 和 ACM 数字图书馆三个数据库中搜索相关文献,以确定相关出版物。合格的文章是涉及通过可穿戴式加速度计监测的人类 PA 的分类、关联或预测研究。
经过筛选 1312 篇文章,确定了 428 篇(32.62%)符合条件的文章,并分为以下三个主题类别之一:分类(75/428,17.5%)、关联(342/428,79.9%)和预测(32/428,7.5%)。大多数文章(414/428,96.7%)从 3D 加速度中得出 PA 变量,而不是 1D 加速度。所有合格文章(428/428,100%)均考虑了时间域中表示的 PA 指标,而一小部分(16/428,3.7%)也考虑了频率域中的 PA 指标。评估 PA 对健康状况影响的研究数量大大增加。在我们的综述中,回归型模型最为普遍(373/428,87.1%)。分类研究中的机器学习方法也越来越受欢迎(32/75,43%)。除了 PA 汇总统计数据外,最近的几项研究还使用了工具来纳入 PA 轨迹并考虑时间模式,包括具有重复 PA 测量的纵向数据分析和具有 PA 作为时变关联连续体的功能数据分析(68/428,15.9%)。
总结指标可以快速提供个体整体 PA 强度、频率和持续时间的描述。当需要评估或检测 PA 的分布和特征时,将 PA 指标视为纵向或功能数据可以提供详细信息,并提高对 PA 在健康中的作用的理解。根据研究目标,适当的分析工具可以确保科学发现的可靠性。