Suppr超能文献

利用新型互联网数据流进行准确的流感监测与预测:以波士顿都会区为例

Accurate Influenza Monitoring and Forecasting Using Novel Internet Data Streams: A Case Study in the Boston Metropolis.

作者信息

Lu Fred Sun, Hou Suqin, Baltrusaitis Kristin, Shah Manan, Leskovec Jure, Sosic Rok, Hawkins Jared, Brownstein John, Conidi Giuseppe, Gunn Julia, Gray Josh, Zink Anna, Santillana Mauricio

机构信息

Computational Health Informatics Program, Boston Children's Hospital, Boston, MA, United States.

Harvard Chan School of Public Health, Harvard University, Boston, MA, United States.

出版信息

JMIR Public Health Surveill. 2018 Jan 9;4(1):e4. doi: 10.2196/publichealth.8950.

Abstract

BACKGROUND

Influenza outbreaks pose major challenges to public health around the world, leading to thousands of deaths a year in the United States alone. Accurate systems that track influenza activity at the city level are necessary to provide actionable information that can be used for clinical, hospital, and community outbreak preparation.

OBJECTIVE

Although Internet-based real-time data sources such as Google searches and tweets have been successfully used to produce influenza activity estimates ahead of traditional health care-based systems at national and state levels, influenza tracking and forecasting at finer spatial resolutions, such as the city level, remain an open question. Our study aimed to present a precise, near real-time methodology capable of producing influenza estimates ahead of those collected and published by the Boston Public Health Commission (BPHC) for the Boston metropolitan area. This approach has great potential to be extended to other cities with access to similar data sources.

METHODS

We first tested the ability of Google searches, Twitter posts, electronic health records, and a crowd-sourced influenza reporting system to detect influenza activity in the Boston metropolis separately. We then adapted a multivariate dynamic regression method named ARGO (autoregression with general online information), designed for tracking influenza at the national level, and showed that it effectively uses the above data sources to monitor and forecast influenza at the city level 1 week ahead of the current date. Finally, we presented an ensemble-based approach capable of combining information from models based on multiple data sources to more robustly nowcast as well as forecast influenza activity in the Boston metropolitan area. The performances of our models were evaluated in an out-of-sample fashion over 4 influenza seasons within 2012-2016, as well as a holdout validation period from 2016 to 2017.

RESULTS

Our ensemble-based methods incorporating information from diverse models based on multiple data sources, including ARGO, produced the most robust and accurate results. The observed Pearson correlations between our out-of-sample flu activity estimates and those historically reported by the BPHC were 0.98 in nowcasting influenza and 0.94 in forecasting influenza 1 week ahead of the current date.

CONCLUSIONS

We show that information from Internet-based data sources, when combined using an informed, robust methodology, can be effectively used as early indicators of influenza activity at fine geographic resolutions.

摘要

背景

流感爆发给全球公共卫生带来重大挑战,仅在美国每年就导致数千人死亡。准确的城市层面流感活动追踪系统对于提供可用于临床、医院及社区疫情防控准备的可操作信息至关重要。

目的

尽管基于互联网的实时数据源(如谷歌搜索和推文)已成功用于在国家和州层面的传统医疗保健系统之前进行流感活动估计,但在更精细的空间分辨率(如城市层面)上进行流感追踪和预测仍是一个悬而未决的问题。我们的研究旨在提出一种精确的、近乎实时的方法,能够在波士顿公共卫生委员会(BPHC)收集和发布波士顿大都市区的数据之前进行流感估计。这种方法有很大潜力扩展到其他可获取类似数据源的城市。

方法

我们首先分别测试了谷歌搜索、推特帖子、电子健康记录和一个众包流感报告系统检测波士顿大都市流感活动的能力。然后我们采用了一种名为ARGO(带一般在线信息的自回归)的多变量动态回归方法,该方法专为国家层面的流感追踪而设计,并表明它能有效利用上述数据源在当前日期前1周对城市层面的流感进行监测和预测。最后,我们提出了一种基于集成的方法,能够结合来自基于多个数据源的模型的信息,以更稳健地进行实时流感预测以及预测波士顿大都市区的流感活动。我们模型的性能在2012 - 2016年的4个流感季节以及2016 - 2017年的一个验证期内以样本外的方式进行了评估。

结果

我们基于集成的方法结合了来自包括ARGO在内的基于多个数据源的不同模型的信息,产生了最稳健和准确的结果。我们样本外流感活动估计与BPHC历史报告之间的观察到的皮尔逊相关性在实时流感预测中为0.98,在当前日期前1周的流感预测中为0.94。

结论

我们表明,当使用明智、稳健的方法进行组合时,基于互联网数据源的信息可有效用作精细地理分辨率下流感活动的早期指标。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f282/5780615/dddfe0323ed6/publichealth_v4i1e4_fig1.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验