Dean Dennis A, Adler Gail K, Nguyen David P, Klerman Elizabeth B
Division of Sleep and Circadian Disorders, Brigham and Women's Hospital, Boston, Massachusetts, United States of America; Neuroscience Statistical Research Laboratory, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America; Biomedical Engineering and Biotechnology Program, University of Massachusetts, Lowell, Massachusetts, United States of America; Harvard Medical School, Boston, Massachusetts, United States of America.
Harvard Medical School, Boston, Massachusetts, United States of America; Division of Endocrinology, Diabetes and Hypertension, Brigham and Women's Hospital, Boston, Massachusetts, United States of America.
PLoS One. 2014 Sep 3;9(9):e104087. doi: 10.1371/journal.pone.0104087. eCollection 2014.
We present a novel approach for analyzing biological time-series data using a context-free language (CFL) representation that allows the extraction and quantification of important features from the time-series. This representation results in Hierarchically AdaPtive (HAP) analysis, a suite of multiple complementary techniques that enable rapid analysis of data and does not require the user to set parameters. HAP analysis generates hierarchically organized parameter distributions that allow multi-scale components of the time-series to be quantified and includes a data analysis pipeline that applies recursive analyses to generate hierarchically organized results that extend traditional outcome measures such as pharmacokinetics and inter-pulse interval. Pulsicons, a novel text-based time-series representation also derived from the CFL approach, are introduced as an objective qualitative comparison nomenclature. We apply HAP to the analysis of 24 hours of frequently sampled pulsatile cortisol hormone data, which has known analysis challenges, from 14 healthy women. HAP analysis generated results in seconds and produced dozens of figures for each participant. The results quantify the observed qualitative features of cortisol data as a series of pulse clusters, each consisting of one or more embedded pulses, and identify two ultradian phenotypes in this dataset. HAP analysis is designed to be robust to individual differences and to missing data and may be applied to other pulsatile hormones. Future work can extend HAP analysis to other time-series data types, including oscillatory and other periodic physiological signals.
我们提出了一种新颖的方法,用于使用上下文无关语言(CFL)表示来分析生物时间序列数据,该表示允许从时间序列中提取和量化重要特征。这种表示产生了分层自适应(HAP)分析,这是一套多种互补技术,能够快速分析数据,并且不需要用户设置参数。HAP分析生成分层组织的参数分布,使时间序列的多尺度成分能够被量化,并且包括一个数据分析管道,该管道应用递归分析来生成分层组织的结果,扩展了传统的结果测量,如药代动力学和脉冲间期。脉冲图标(Pulsicons),一种同样源自CFL方法的基于文本的新颖时间序列表示,被引入作为一种客观的定性比较命名法。我们将HAP应用于对14名健康女性的24小时频繁采样的搏动性皮质醇激素数据的分析,该数据存在已知的分析挑战。HAP分析在数秒内生成结果,并为每个参与者生成了数十个图表。结果将皮质醇数据观察到的定性特征量化为一系列脉冲簇,每个脉冲簇由一个或多个嵌入脉冲组成,并在该数据集中识别出两种超日节律表型。HAP分析旨在对个体差异和缺失数据具有鲁棒性,并且可以应用于其他搏动性激素。未来的工作可以将HAP分析扩展到其他时间序列数据类型,包括振荡和其他周期性生理信号。