Multidisciplinary Sleep Disorders Centre, Antwerp University Hospital, Edegem, Belgium.
Laboratory of Experimental Medicine and Pediatrics, University of Antwerp, Wilrijk, Belgium.
JMIR Mhealth Uhealth. 2024 Mar 27;12:e52192. doi: 10.2196/52192.
Despite being the gold-standard method for objectively assessing sleep, polysomnography (PSG) faces several limitations as it is expensive, time-consuming, and labor-intensive; requires various equipment and technical expertise; and is impractical for long-term or in-home use. Consumer wrist-worn wearables are able to monitor sleep parameters and thus could be used as an alternative for PSG. Consequently, wearables gained immense popularity over the past few years, but their accuracy has been a major concern.
A systematic review of the literature was conducted to appraise the performance of 3 recent-generation wearable devices (Fitbit Charge 4, Garmin Vivosmart 4, and WHOOP) in determining sleep parameters and sleep stages.
Per the PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) statement, a comprehensive search was conducted using the PubMed, Web of Science, Google Scholar, Scopus, and Embase databases. Eligible publications were those that (1) involved the validity of sleep data of any marketed model of the candidate wearables and (2) used PSG or an ambulatory electroencephalogram monitor as a reference sleep monitoring device. Exclusion criteria were as follows: (1) incorporated a sleep diary or survey method as a reference, (2) review paper, (3) children as participants, and (4) duplicate publication of the same data and findings.
The search yielded 504 candidate articles. After eliminating duplicates and applying the eligibility criteria, 8 articles were included. WHOOP showed the least disagreement relative to PSG and Sleep Profiler for total sleep time (-1.4 min), light sleep (-9.6 min), and deep sleep (-9.3 min) but showed the largest disagreement for rapid eye movement (REM) sleep (21.0 min). Fitbit Charge 4 and Garmin Vivosmart 4 both showed moderate accuracy in assessing sleep stages and total sleep time compared to PSG. Fitbit Charge 4 showed the least disagreement for REM sleep (4.0 min) relative to PSG. Additionally, Fitbit Charge 4 showed higher sensitivities to deep sleep (75%) and REM sleep (86.5%) compared to Garmin Vivosmart 4 and WHOOP.
The findings of this systematic literature review indicate that the devices with higher relative agreement and sensitivities to multistate sleep (ie, Fitbit Charge 4 and WHOOP) seem appropriate for deriving suitable estimates of sleep parameters. However, analyses regarding the multistate categorization of sleep indicate that all devices can benefit from further improvement in the assessment of specific sleep stages. Although providers are continuously developing new versions and variants of wearables, the scientific research on these wearables remains considerably limited. This scarcity in literature not only reduces our ability to draw definitive conclusions but also highlights the need for more targeted research in this domain. Additionally, future research endeavors should strive for standardized protocols including larger sample sizes to enhance the comparability and power of the results across studies.
尽管多导睡眠图(PSG)是客观评估睡眠的金标准方法,但它存在一些局限性,因为它昂贵、耗时且劳动强度大;需要各种设备和技术专业知识;并且不适合长期或家庭使用。消费者腕戴可穿戴设备能够监测睡眠参数,因此可以作为 PSG 的替代方法。因此,可穿戴设备在过去几年中获得了极大的普及,但准确性一直是一个主要关注点。
对文献进行系统评价,评估 3 款新一代可穿戴设备(Fitbit Charge 4、Garmin Vivosmart 4 和 WHOOP)在确定睡眠参数和睡眠阶段方面的性能。
根据 PRISMA(系统评价和荟萃分析的首选报告项目)声明,使用 PubMed、Web of Science、Google Scholar、Scopus 和 Embase 数据库进行全面搜索。符合条件的出版物包括:(1)涉及候选可穿戴设备任何市售型号的睡眠数据有效性,以及(2)使用 PSG 或动态脑电图监测仪作为参考睡眠监测设备。排除标准如下:(1)包含睡眠日记或调查方法作为参考,(2)综述文章,(3)参与者为儿童,以及(4)同一数据和发现的重复发表。
搜索产生了 504 篇候选文章。在消除重复项并应用合格标准后,纳入了 8 篇文章。与 PSG 和 Sleep Profiler 相比,WHOOP 显示出总睡眠时间(-1.4 分钟)、浅睡(-9.6 分钟)和深睡(-9.3 分钟)的差异最小,但 REM 睡眠(21.0 分钟)的差异最大。Fitbit Charge 4 和 Garmin Vivosmart 4 与 PSG 相比,在评估睡眠阶段和总睡眠时间方面均显示出中等准确性。与 PSG 相比,Fitbit Charge 4 显示出 REM 睡眠(4.0 分钟)的差异最小。此外,与 Garmin Vivosmart 4 和 WHOOP 相比,Fitbit Charge 4 对深睡(75%)和 REM 睡眠(86.5%)的敏感性更高。
本系统文献综述的结果表明,与多状态睡眠具有更高相对一致性和敏感性的设备(即 Fitbit Charge 4 和 WHOOP)似乎适合推导出睡眠参数的合适估计值。然而,关于睡眠多状态分类的分析表明,所有设备都可以从特定睡眠阶段评估的进一步改进中受益。尽管提供者不断开发可穿戴设备的新版本和变体,但关于这些可穿戴设备的科学研究仍然相当有限。这种文献的缺乏不仅降低了我们得出明确结论的能力,而且突出了在该领域进行更有针对性研究的必要性。此外,未来的研究工作应努力制定包括更大样本量在内的标准化方案,以提高研究之间结果的可比性和效力。