Ramanathan Arvind, Pullum Laura L, Hobson Tanner C, Steed Chad A, Quinn Shannon P, Chennubhotla Chakra S, Valkova Silvia
BMC Bioinformatics. 2015;16 Suppl 17(Suppl 17):S4. doi: 10.1186/1471-2105-16-S17-S4. Epub 2015 Dec 7.
The digitization of health-related information through electronic health records (EHR) and electronic healthcare reimbursement claims and the continued growth of self-reported health information through social media provides both tremendous opportunities and challenges in developing effective biosurveillance tools. With novel emerging infectious diseases being reported across different parts of the world, there is a need to build systems that can track, monitor and report such events in a timely manner. Further, it is also important to identify susceptible geographic regions and populations where emerging diseases may have a significant impact.
In this paper, we present an overview of Oak Ridge Biosurveillance Toolkit (ORBiT), which we have developed specifically to address data analytic challenges in the realm of public health surveillance. In particular, ORBiT provides an extensible environment to pull together diverse, large-scale datasets and analyze them to identify spatial and temporal patterns for various biosurveillance-related tasks.
We demonstrate the utility of ORBiT in automatically extracting a small number of spatial and temporal patterns during the 2009-2010 pandemic H1N1 flu season using claims data. These patterns provide quantitative insights into the dynamics of how the pandemic flu spread across different parts of the country. We discovered that the claims data exhibits multi-scale patterns from which we could identify a small number of states in the United States (US) that act as "bridge regions" contributing to one or more specific influenza spread patterns. Similar to previous studies, the patterns show that the south-eastern regions of the US were widely affected by the H1N1 flu pandemic. Several of these south-eastern states act as bridge regions, which connect the north-east and central US in terms of flu occurrences.
These quantitative insights show how the claims data combined with novel analytical techniques can provide important information to decision makers when an epidemic spreads throughout the country. Taken together ORBiT provides a scalable and extensible platform for public health surveillance.
通过电子健康记录(EHR)和电子医疗报销申请实现的与健康相关信息的数字化,以及通过社交媒体自我报告健康信息的持续增长,在开发有效的生物监测工具方面既带来了巨大机遇,也带来了挑战。随着世界各地不断报告新型传染病,有必要建立能够及时跟踪、监测和报告此类事件的系统。此外,识别新兴疾病可能产生重大影响的易感地理区域和人群也很重要。
在本文中,我们概述了橡树岭生物监测工具包(ORBiT),我们专门开发该工具包以应对公共卫生监测领域的数据分析挑战。特别是,ORBiT提供了一个可扩展的环境,用于整合各种大规模数据集并对其进行分析,以识别各种与生物监测相关任务的时空模式。
我们展示了ORBiT在利用报销申请数据自动提取2009 - 2010年甲型H1N1流感大流行季节期间的少量时空模式方面的效用。这些模式为大流行性流感在该国不同地区的传播动态提供了定量见解。我们发现报销申请数据呈现出多尺度模式,从中我们可以识别出美国的少数几个州,它们作为“桥梁区域”促成了一种或多种特定的流感传播模式。与先前的研究类似,这些模式表明美国东南部地区受到H1N1流感大流行的广泛影响。其中几个东南部州充当桥梁区域,在流感发生方面连接了美国东北部和中部地区。
这些定量见解表明,当疫情在全国蔓延时,报销申请数据与新颖的分析技术相结合如何能够为决策者提供重要信息。总体而言,ORBiT为公共卫生监测提供了一个可扩展且可延伸的平台。