Dean Dennis A, Goldberger Ary L, Mueller Remo, Kim Matthew, Rueschman Michael, Mobley Daniel, Sahoo Satya S, Jayapandian Catherine P, Cui Licong, Morrical Michael G, Surovec Susan, Zhang Guo-Qiang, Redline Susan
Brigham and Women's Hospital, Boston, MA.
Harvard Medical School, Boston, MA.
Sleep. 2016 May 1;39(5):1151-64. doi: 10.5665/sleep.5774.
Professional sleep societies have identified a need for strategic research in multiple areas that may benefit from access to and aggregation of large, multidimensional datasets. Technological advances provide opportunities to extract and analyze physiological signals and other biomedical information from datasets of unprecedented size, heterogeneity, and complexity. The National Institutes of Health has implemented a Big Data to Knowledge (BD2K) initiative that aims to develop and disseminate state of the art big data access tools and analytical methods. The National Sleep Research Resource (NSRR) is a new National Heart, Lung, and Blood Institute resource designed to provide big data resources to the sleep research community. The NSRR is a web-based data portal that aggregates, harmonizes, and organizes sleep and clinical data from thousands of individuals studied as part of cohort studies or clinical trials and provides the user a suite of tools to facilitate data exploration and data visualization. Each deidentified study record minimally includes the summary results of an overnight sleep study; annotation files with scored events; the raw physiological signals from the sleep record; and available clinical and physiological data. NSRR is designed to be interoperable with other public data resources such as the Biologic Specimen and Data Repository Information Coordinating Center Demographics (BioLINCC) data and analyzed with methods provided by the Research Resource for Complex Physiological Signals (PhysioNet). This article reviews the key objectives, challenges and operational solutions to addressing big data opportunities for sleep research in the context of the national sleep research agenda. It provides information to facilitate further interactions of the user community with NSRR, a community resource.
专业睡眠协会已经确定,在多个领域开展战略研究很有必要,这些领域可能会受益于获取和整合大型多维数据集。技术进步为从前所未有的规模、异质性和复杂性的数据集中提取和分析生理信号及其他生物医学信息提供了机会。美国国立卫生研究院实施了一项“大数据到知识”(BD2K)计划,旨在开发和传播先进的大数据访问工具及分析方法。国家睡眠研究资源(NSRR)是美国国立心肺血液研究所的一项新资源,旨在为睡眠研究界提供大数据资源。NSRR是一个基于网络的数据门户,它汇总、协调并整理来自数千名作为队列研究或临床试验一部分进行研究的个体的睡眠和临床数据,并为用户提供一套工具以促进数据探索和数据可视化。每条经过去标识化处理的研究记录至少包括一项夜间睡眠研究的总结结果;带有评分事件的注释文件;睡眠记录中的原始生理信号;以及可用的临床和生理数据。NSRR旨在与其他公共数据资源(如生物样本和数据存储库信息协调中心人口统计学(BioLINCC)数据)实现互操作,并使用复杂生理信号研究资源(PhysioNet)提供的方法进行分析。本文回顾了在国家睡眠研究议程背景下应对睡眠研究大数据机遇的关键目标、挑战和操作解决方案。它提供信息以促进用户社区与作为社区资源的NSRR的进一步互动。