Khalil Rana M, Shulman Lisa M, Gruber-Baldini Ann L, Shakya Sunita, Hausdorff Jeffrey M, von Coelln Rainer, Cummings Michael P
Center for Bioinformatics and Computational Biology, University of Maryland, College Park, MD 20742, USA.
Department of Neurology, University of Maryland School of Medicine, Baltimore, MD 21201, USA.
Sensors (Basel). 2024 Dec 19;24(24):8096. doi: 10.3390/s24248096.
Mobility tasks like the Timed Up and Go test (TUG), cognitive TUG (cogTUG), and walking with turns provide insights into the impact of Parkinson's disease (PD) on motor control, balance, and cognitive function. We assess the test-retest reliability of these tasks in 262 PD participants and 50 controls by evaluating machine learning models based on wearable-sensor-derived measures and statistical metrics. This evaluation examines total duration, subtask duration, and other quantitative measures across two trials. We show that the diagnostic accuracy for distinguishing PD from controls decreases by a mean of 1.8% between the first and the second trial, suggesting that task repetition may not be necessary for accurate diagnosis. Although the total duration remains relatively consistent between trials (intraclass correlation coefficient (ICC) = 0.62 to 0.95), greater variability is seen in subtask duration and sensor-derived measures, reflected in machine learning performance and statistical differences. Our findings also show that this variability differs not only between controls and PD participants but also among groups with varying levels of PD severity, indicating the need to consider population characteristics. Relying solely on total task duration and conventional statistical metrics to gauge the reliability of mobility tasks may fail to reveal nuanced variations in movement.
诸如定时起立行走测试(TUG)、认知TUG(cogTUG)以及转弯行走等移动任务,能让我们深入了解帕金森病(PD)对运动控制、平衡和认知功能的影响。我们通过评估基于可穿戴传感器测量数据和统计指标的机器学习模型,对262名帕金森病患者和50名对照者进行这些任务的重测信度评估。此次评估考察了两次测试中的总时长、子任务时长以及其他量化指标。我们发现,在第一次和第二次测试之间,区分帕金森病患者与对照者的诊断准确率平均下降了1.8%,这表明准确诊断可能无需重复任务。尽管两次测试之间总时长相对一致(组内相关系数(ICC)=0.62至0.95),但子任务时长和传感器测量数据的变异性更大,这在机器学习性能和统计差异中有所体现。我们的研究结果还表明,这种变异性不仅在对照者和帕金森病患者之间存在差异,在帕金森病严重程度不同的组之间也存在差异,这表明需要考虑人群特征。仅依靠总任务时长和传统统计指标来衡量移动任务的可靠性,可能无法揭示运动中细微的变化。