Center for Health Sciences, SRI International, Menlo Park, California, USA.
Department of General Psychology, University of Padova, Padova, Italy.
Chronobiol Int. 2021 Jul;38(7):1010-1022. doi: 10.1080/07420528.2021.1903481. Epub 2021 Apr 1.
We evaluated the performance of Fitbit Charge 3™ (FC3), a multi-sensor commercial sleep-tracker, for measuring sleep in adolescents against gold-standard laboratory polysomnography (PSG). Single-night PSG and FC3 sleep outcomes were compared in thirty-nine adolescents (22 girls; 16-19 years), 12 of whom presented with clinical/subclinical DSM-5 insomnia symptoms (7 girls). Discrepancy analysis, Bland-Altman plots, and epoch-by-epoch analyses were used to evaluate FC3 performance. The influence of several factors potentially affecting FC3 performance (e.g., sex, age, body mass index, firmware version, and magnitude of heart rate changes between consecutive PSG epochs) was also tested. In the sample of healthy adolescents, FC3 systematically underestimated PSG total sleep time by about 11 min and sleep efficiency by 2.5%, and overestimated wake after sleep onset by 9 min. Proportional biases were detected for "light" and "deep" sleep duration, resulting in significant underestimation of these parameters for those participants having longer PSG N1+ N2 and N3 durations, respectively. No significant systematic bias was detected for sleep efficiency and sleep onset latency. Epoch-by-epoch analysis showed sleep-stage sensitivity (average proportion of PSG epochs correctly classified by the device for a given sleep stage) of 68% for wake, 78% for "light" sleep, 59% for "deep" sleep, and 69% for rapid eye movement (REM) sleep in healthy sleepers. Similar results were found in the sample of adolescents with insomnia symptoms. Body mass index was positively associated with FC3-PSG discrepancies in wake after sleep onset (R = .16, = .048). The magnitude of the heart rate acceleration/deceleration between consecutive PSG epochs was an important factor affecting FC3 classifications of sleep stages. Our results are in line with a general trend in the literature, suggesting better performance for the recently introduced multi-sensor devices compared to motion-only devices, although further developments are needed to improve accuracy in sleep stage classification and wake detection. Further insight is needed to determine factors potentially affecting device performance, such as accuracy and reliability (consistency of performance over time), in different samples and conditions.
我们评估了 Fitbit Charge 3™(FC3)作为一款多传感器商业睡眠追踪器,在测量青少年睡眠方面的表现,以对比金标准实验室多导睡眠图(PSG)。我们比较了 39 名青少年(22 名女性;16-19 岁)的单晚 PSG 和 FC3 睡眠结果,其中 12 名患有临床/亚临床 DSM-5 失眠症状(7 名女性)。我们使用差异分析、Bland-Altman 图和逐时分析来评估 FC3 的性能。还测试了几个可能影响 FC3 性能的因素(例如,性别、年龄、体重指数、固件版本以及连续 PSG 时相中心率变化的幅度)。在健康青少年样本中,FC3 系统地低估了 PSG 总睡眠时间约 11 分钟,睡眠效率降低 2.5%,并高估了睡眠后觉醒时间 9 分钟。对于“浅”睡眠和“深”睡眠时间,检测到比例偏差,导致这些参数对于 PSG N1+N2 和 N3 持续时间较长的参与者显著低估。对于睡眠效率和睡眠潜伏期,未检测到显著的系统偏差。逐时分析显示,对于健康睡眠者,清醒状态的睡眠分期敏感性(设备对特定睡眠分期的 PSG 时相的正确分类比例)平均为 68%,“浅”睡眠为 78%,“深”睡眠为 59%,快速眼动(REM)睡眠为 69%。在患有失眠症状的青少年样本中也发现了类似的结果。体重指数与睡眠后觉醒时间的 FC3-PSG 差异呈正相关(R =.16, =.048)。连续 PSG 时相中心率加速/减速的幅度是影响 FC3 睡眠分期分类的一个重要因素。我们的结果与文献中的一般趋势一致,表明与仅运动的设备相比,最近推出的多传感器设备具有更好的性能,尽管需要进一步的发展来提高睡眠分期分类和唤醒检测的准确性。需要进一步的研究来确定可能影响设备性能的因素,例如准确性和可靠性(随着时间的推移性能的一致性),在不同的样本和条件下。