Ma Qianheng, Dunton Genevieve F, Hedeker Donald
Department of Psychiatry and Behavioral Sciences, Stanford University, Palo Alto, CA 94304, United States.
Departments of Preventive Medicine and Psychology, Keck School of Medicine, University of Southern California, Los Angeles, CA 90033, United States.
Biometrics. 2025 Jul 3;81(3). doi: 10.1093/biomtc/ujaf099.
In recent years, the use of wearable devices, for example, accelerometers, have become increasingly prevalent. Wearable devices enable more accurate real-time tracking of a subject's physical activity (PA) level, such as steps, number of activity bouts, or time in moderate-to-vigorous intensity PA (MVPA), which are important general health markers and can often be represented as counts. These intensive within-subject count data provided by wearable devices, for example, minutes in MVPA summarized per hour across days and even months, allow the possibility for modeling not only the mean PA level, but also the dispersion level for each subject. Especially in the context of daily PA, subjects' dispersion levels are potentially informative in reflecting their exercise patterns: some subjects might exhibit consistent PA across time and can be considered "less dispersed" subjects; while others might have a large amount of PA at a particular time point, while being sedentary for most of the day, and can be considered "more dispersed" subjects. Thus, we propose a negative binomial mixed effects location-scale model to model these intensive longitudinal PA counts and to account for the heterogeneity in both the mean and dispersion level across subjects. Further, to handle the issue of inflated numbers of zeros in the PA data, we also propose a hurdle/zero-inflated version which additionally includes the modeling of the probability of having $>$0 PA levels.
近年来,可穿戴设备(例如加速度计)的使用越来越普遍。可穿戴设备能够更准确地实时跟踪受试者的身体活动(PA)水平,如步数、活动次数或中度至剧烈强度身体活动(MVPA)的时间,这些都是重要的总体健康指标,通常可以表示为计数。可穿戴设备提供的这些密集的个体内计数数据,例如,跨天甚至数月每小时汇总的MVPA分钟数,不仅使建模每个受试者的平均PA水平成为可能,而且使建模其离散程度成为可能。特别是在日常PA的背景下,受试者的离散程度在反映他们的运动模式方面可能具有信息价值:一些受试者可能在不同时间表现出一致的PA水平,可以被视为“离散程度较低”的受试者;而另一些受试者可能在特定时间点有大量的PA,而一天中的大部分时间都久坐不动,可以被视为“离散程度较高”的受试者。因此,我们提出了一个负二项混合效应位置尺度模型,以对这些密集的纵向PA计数进行建模,并考虑受试者之间平均水平和离散程度的异质性。此外,为了解决PA数据中零值过多的问题,我们还提出了一个障碍/零膨胀版本,该版本还包括对PA水平大于0的概率进行建模。