Wakim Nicole I, Braun Thomas M, Kaye Jeffrey A, Dodge Hiroko H
Department of Biostatistics University of Michigan Ann Arbor Michigan USA.
Department of Neurology Oregon Health & Science University Portland Oregon USA.
Alzheimers Dement (N Y). 2020 Dec 18;6(1):e12094. doi: 10.1002/trc2.12094. eCollection 2020.
The use of digital biomarker data in dementia research provides the opportunity for frequent cognitive and functional assessments that was not previously available using conventional approaches. Assessing high-frequency digital biomarker data can potentially increase the opportunities for early detection of cognitive and functional decline because of improved precision of person-specific trajectories. However, we often face a decision to condense time-stamped data into a coarser time granularity, defined as the frequency at which measurements are observed or summarized, for statistical analyses. It is important to find a balance between ease of analysis by condensing data and the integrity of the data, which is reflected in a chosen time granularity.
In this paper, we discuss factors that need to be considered when faced with a time granularity decision. These factors include follow-up time, variables of interest, pattern detection, and signal-to-noise ratio.
We applied our procedure to real-world data which include longitudinal in-home monitored walking speed. The example shed lights on typical problems that data present and how we could use the above factors in exploratory analysis to choose an appropriate time granularity.
Further work is required to explore issues with missing data and computational efficiency.
在痴呆症研究中使用数字生物标志物数据,为频繁进行认知和功能评估提供了机会,而这是以前使用传统方法无法实现的。评估高频数字生物标志物数据可能会增加早期发现认知和功能衰退的机会,因为个体特定轨迹的精度有所提高。然而,在进行统计分析时,我们常常面临将带时间戳的数据浓缩为更粗时间粒度的决策,这里的时间粒度定义为观察或汇总测量值的频率。在通过浓缩数据便于分析与数据完整性之间找到平衡很重要,而这种平衡体现在所选的时间粒度上。
在本文中,我们讨论了面临时间粒度决策时需要考虑的因素。这些因素包括随访时间、感兴趣的变量、模式检测和信噪比。
我们将我们的方法应用于实际数据,这些数据包括纵向在家中监测的步行速度。该示例揭示了数据呈现的典型问题,以及我们如何在探索性分析中使用上述因素来选择合适的时间粒度。
需要进一步开展工作来探索缺失数据和计算效率方面的问题。