Department of Neurology, University of Michigan, Ann Arbor, MI.
Department of Mathematics, Dartmouth College, Hanover, NH.
Sleep. 2019 Dec 24;42(12). doi: 10.1093/sleep/zsz180.
Wearable, multisensor, consumer devices that estimate sleep are now commonplace, but the algorithms used by these devices to score sleep are not open source, and the raw sensor data is rarely accessible for external use. As a result, these devices are limited in their usefulness for clinical and research applications, despite holding much promise. We used a mobile application of our own creation to collect raw acceleration data and heart rate from the Apple Watch worn by participants undergoing polysomnography, as well as during the ambulatory period preceding in lab testing. Using this data, we compared the contributions of multiple features (motion, local standard deviation in heart rate, and "clock proxy") to performance across several classifiers. Best performance was achieved using neural nets, though the differences across classifiers were generally small. For sleep-wake classification, our method scored 90% of epochs correctly, with 59.6% of true wake epochs (specificity) and 93% of true sleep epochs (sensitivity) scored correctly. Accuracy for differentiating wake, NREM sleep, and REM sleep was approximately 72% when all features were used. We generalized our results by testing the models trained on Apple Watch data using data from the Multi-ethnic Study of Atherosclerosis (MESA), and found that we were able to predict sleep with performance comparable to testing on our own dataset. This study demonstrates, for the first time, the ability to analyze raw acceleration and heart rate data from a ubiquitous wearable device with accepted, disclosed mathematical methods to improve accuracy of sleep and sleep stage prediction.
现在,可穿戴、多传感器、用于估计睡眠的消费类设备已经很常见,但这些设备用于评分睡眠的算法不是开源的,并且很少可以访问原始传感器数据以供外部使用。因此,尽管这些设备有很大的应用前景,但它们在临床和研究应用中的实用性有限。我们使用自己创建的移动应用程序从接受多导睡眠图检查的参与者佩戴的 Apple Watch 以及在实验室测试前的日间期间收集原始加速度数据和心率数据。使用这些数据,我们比较了多个特征(运动、心率的局部标准差和“时钟代理”)对多个分类器性能的贡献。使用神经网络可以实现最佳性能,尽管分类器之间的差异通常很小。对于睡眠-觉醒分类,我们的方法正确分类了 90%的时段,正确分类了 59.6%的真正觉醒时段(特异性)和 93%的真正睡眠时段(敏感性)。当使用所有特征时,区分清醒、非快速眼动睡眠和快速眼动睡眠的准确性约为 72%。我们通过使用来自多民族动脉粥样硬化研究(MESA)的数据测试在 Apple Watch 数据上训练的模型来推广我们的结果,并发现我们能够使用与在自己的数据集上测试相当的性能来预测睡眠。这项研究首次证明了使用可接受的、公开的数学方法分析无处不在的可穿戴设备的原始加速度和心率数据的能力,从而提高了睡眠和睡眠阶段预测的准确性。