Chee Nicholas I Y N, Ghorbani Shohreh, Golkashani Hosein Aghayan, Leong Ruth L F, Ong Ju Lynn, Chee Michael W L
Centre for Sleep and Cognition, Yong Loo Lin School of Medicine, National University of Singapore, Singapore.
Nat Sci Sleep. 2021 Feb 15;13:177-190. doi: 10.2147/NSS.S286070. eCollection 2021.
Wearable devices have tremendous potential for large-scale longitudinal measurement of sleep, but their accuracy needs to be validated. We compared the performance of the multisensor Oura ring (Oura Health Oy, Oulu, Finland) to polysomnography (PSG) and a research actigraph in healthy adolescents.
Fifty-three adolescents (28 females; aged 15-19 years) underwent overnight PSG monitoring while wearing both an Oura ring and Actiwatch 2 (Philips Respironics, USA). Measurements were made over multiple nights and across three levels of sleep opportunity (5 nights with either 6.5 or 8h, and 3 nights with 9h). Actiwatch data at two sensitivity settings were analyzed. Discrepancies in estimated sleep measures as well as sleep-wake, and sleep stage agreements were evaluated using Bland-Altman plots and epoch-by-epoch (EBE) analyses.
Compared with PSG, Oura consistently underestimated TST by an average of 32.8 to 47.3 minutes (s < 0.001) across the different TIB conditions; Actiwatch 2 at its default setting underestimated TST by 25.8 to 33.9 minutes. Oura significantly overestimated WASO by an average of 30.7 to 46.3 minutes. It was comparable to Actiwatch 2 at default sensitivity in the 6.5, and 8h TIB conditions. Relative to PSG, Oura significantly underestimated REM sleep (12.8 to 19.5 minutes) and light sleep (51.1 to 81.2 minutes) but overestimated N3 by 31.5 to 46.8 minutes (s < 0.01). EBE analyses demonstrated excellent sleep-wake accuracies, specificities, and sensitivities - between 0.88 and 0.89 across all TIBs.
The Oura ring yielded comparable sleep measurement to research grade actigraphy at the latter's default settings. Sleep staging needs improvement. However, the device appears adequate for characterizing the effect of sleep duration manipulation on adolescent sleep macro-architecture.
可穿戴设备在大规模纵向睡眠测量方面具有巨大潜力,但其准确性有待验证。我们比较了多传感器Oura戒指(芬兰奥卢市Oura Health Oy公司)与多导睡眠图(PSG)以及健康青少年研究用活动记录仪的性能。
53名青少年(28名女性;年龄15 - 19岁)在佩戴Oura戒指和Actiwatch 2(美国飞利浦伟康公司)的同时接受整夜PSG监测。测量在多个夜晚进行,涵盖三种睡眠机会水平(6.5小时或8小时的5个夜晚,以及9小时的3个夜晚)。分析了Actiwatch在两种灵敏度设置下的数据。使用Bland - Altman图和逐时段(EBE)分析评估估计睡眠指标以及睡眠 - 觉醒和睡眠阶段一致性的差异。
与PSG相比,在不同的卧床时间(TIB)条件下,Oura始终低估总睡眠时间(TST),平均低估32.8至47.3分钟(s < 0.001);Actiwatch 2在默认设置下低估TST 25.8至33.9分钟。Oura显著高估清醒时间(WASO),平均高估30.7至46.3分钟。在6.5小时和8小时的TIB条件下,其在默认灵敏度时与Actiwatch 2相当。相对于PSG,Oura显著低估快速眼动睡眠(12.8至19.5分钟)和浅睡眠(51.1至81.2分钟),但高估N3睡眠31.5至46.8分钟(s < 0.01)。EBE分析显示在所有TIB条件下睡眠 - 觉醒的准确性、特异性和敏感性都非常出色,在0.88至0.89之间。
Oura戒指在研究级活动记录仪的默认设置下产生了可比的睡眠测量结果。睡眠分期需要改进。然而,该设备似乎足以表征睡眠时间操纵对青少年睡眠宏观结构的影响。