Ghorbani Shohreh, Golkashani Hosein Aghayan, Chee Nicholas I Y N, Teo Teck Boon, Dicom Andrew Roshan, Yilmaz Gizem, Leong Ruth L F, Ong Ju Lynn, Chee Michael W L
Centre for Sleep and Cognition, Yong Loo Lin School of Medicine, National University of Singapore, Singapore.
Nat Sci Sleep. 2022 Apr 14;14:645-660. doi: 10.2147/NSS.S359789. eCollection 2022.
To evaluate the benefits of applying an improved sleep detection and staging algorithm on minimally processed multi-sensor wearable data collected from older generation hardware.
58 healthy, East Asian adults aged 23-69 years (M = 37.10, SD = 13.03, 32 males), each underwent 3 nights of PSG at home, wearing 2 Generation Oura Rings equipped with additional memory to store raw data from accelerometer, infra-red photoplethysmography and temperature sensors. 2-stage and 4-stage sleep classifications using a new machine-learning algorithm (Gen3) trained on a diverse and independent dataset were compared to the existing consumer algorithm (Gen2) for whole-night and epoch-by-epoch metrics.
Gen 3 outperformed its predecessor with a mean (SD) accuracy of 92.6% (0.04), sensitivity of 94.9% (0.03), and specificity of 78.5% (0.11); corresponding to a 3%, 2.8% and 6.2% improvement from Gen2 across the three nights, with Cohen's d values >0.39, t values >2.69, and p values <0.01. Notably, Gen 3 showed robust performance comparable to PSG in its assessment of sleep latency, light sleep, rapid eye movement (REM), and wake after sleep onset (WASO) duration. Participants <40 years of age benefited more from the upgrade with less measurement bias for total sleep time (TST), WASO, light sleep and sleep efficiency compared to those ≥40 years. Males showed greater improvements on TST and REM sleep measurement bias compared to females, while females benefitted more for deep sleep measures compared to males.
These results affirm the benefits of applying machine learning and a diverse training dataset to improve sleep measurement of a consumer wearable device. Importantly, collecting raw data with appropriate hardware allows for future advancements in algorithm development or sleep physiology to be retrospectively applied to enhance the value of longitudinal sleep studies.
评估将改进的睡眠检测与分期算法应用于从旧一代硬件收集的最少处理的多传感器可穿戴数据的益处。
58名年龄在23 - 69岁之间的健康东亚成年人(M = 37.10,SD = 13.03,男性32名),每人在家中进行3晚的多导睡眠图(PSG)监测,佩戴2个配备额外内存以存储来自加速度计、红外光电容积脉搏波描记术和温度传感器原始数据的初代欧若环。使用在多样且独立的数据集上训练的新机器学习算法(Gen3)进行的两阶段和四阶段睡眠分类,与现有的消费者算法(Gen2)在全夜和逐段指标上进行比较。
Gen3的表现优于其前身,平均(标准差)准确率为92.6%(0.04),灵敏度为94.9%(0.03),特异性为78.5%(0.11);在三个晚上分别比Gen2提高了3%、2.8%和6.2%,科恩d值>0.39,t值>2.69,p值<0.01。值得注意的是,Gen3在评估睡眠潜伏期、浅睡眠、快速眼动(REM)和睡眠中觉醒后(WASO)时长方面表现出与PSG相当的稳健性能。与40岁及以上的参与者相比,40岁以下的参与者从升级中受益更多,在总睡眠时间(TST)、WASO、浅睡眠和睡眠效率方面的测量偏差更小。与女性相比,男性在TST和REM睡眠测量偏差方面有更大改善,而与男性相比,女性在深度睡眠测量方面受益更多。
这些结果证实了应用机器学习和多样的训练数据集来改善消费者可穿戴设备睡眠测量的益处。重要的是,使用适当的硬件收集原始数据有助于未来算法开发或睡眠生理学的进展能够被追溯应用,以提高纵向睡眠研究的价值。