利用疾病预防控制中心网络流量数据监测流感发病率：使用新数据集进行的演示。

Surveilling Influenza Incidence With Centers for Disease Control and Prevention Web Traffic Data: Demonstration Using a Novel Dataset.

机构信息

X Computational Physics Division, Los Alamos National Laboratory, Los Alamos, NM, United States.

School of Mathematical and Statistical Sciences, Arizona State University, Tempe, AZ, United States.

出版信息

J Med Internet Res. 2020 Jul 3;22(7):e14337. doi: 10.2196/14337.

DOI:10.2196/14337

PMID:32437327

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7367534/

Abstract

BACKGROUND

Influenza epidemics result in a public health and economic burden worldwide. Traditional surveillance techniques, which rely on doctor visits, provide data with a delay of 1 to 2 weeks. A means of obtaining real-time data and forecasting future outbreaks is desirable to provide more timely responses to influenza epidemics.

OBJECTIVE

This study aimed to present the first implementation of a novel dataset by demonstrating its ability to supplement traditional disease surveillance at multiple spatial resolutions.

METHODS

We used internet traffic data from the Centers for Disease Control and Prevention (CDC) website to determine the potential usability of this data source. We tested the traffic generated by 10 influenza-related pages in 8 states and 9 census divisions within the United States and compared it against clinical surveillance data.

RESULTS

Our results yielded an r value of 0.955 in the most successful case, promising results for some cases, and unsuccessful results for other cases. In the interest of scientific transparency to further the understanding of when internet data streams are an appropriate supplemental data source, we also included negative results (ie, unsuccessful models). Models that focused on a single influenza season were more successful than those that attempted to model multiple influenza seasons. Geographic resolution appeared to play a key role, with national and regional models being more successful, overall, than models at the state level.

CONCLUSIONS

These results demonstrate that internet data may be able to complement traditional influenza surveillance in some cases but not in others. Specifically, our results show that the CDC website traffic may inform national- and division-level models but not models for each individual state. In addition, our results show better agreement when the data were broken up by seasons instead of aggregated over several years. We anticipate that this work will lead to more complex nowcasting and forecasting models using this data stream.

摘要

背景

流感疫情在全球范围内造成了公共卫生和经济负担。传统的监测技术依赖于医生就诊，数据延迟 1 至 2 周。获得实时数据并预测未来疫情的方法是及时应对流感疫情的理想选择。

目的

本研究旨在通过展示其补充传统疾病监测的能力，介绍一种新型数据集的首次实施，以多个空间分辨率呈现。

方法

我们使用疾病预防控制中心（CDC）网站的互联网流量数据来确定该数据源的潜在可用性。我们测试了美国 8 个州和 9 个普查区的 10 个与流感相关页面产生的流量，并将其与临床监测数据进行了比较。

结果

在最成功的情况下，我们的结果产生了 0.955 的 r 值，对于一些情况有很好的结果，而对于其他情况则没有成功。为了科学透明，进一步了解何时互联网数据流是适当的补充数据源，我们还包括了负面结果（即不成功的模型）。专注于单个流感季节的模型比试图模拟多个流感季节的模型更成功。地理分辨率似乎起着关键作用，总体而言，国家和地区模型比州级模型更成功。

结论

这些结果表明，互联网数据在某些情况下可能能够补充传统的流感监测，但在其他情况下则不行。具体来说，我们的结果表明，CDC 网站的流量可能会为国家和地区模型提供信息，但不能为每个州的模型提供信息。此外，当数据按季节划分而不是多年汇总时，我们的结果显示出更好的一致性。我们预计这项工作将使用此数据流带来更复杂的实时预测和预测模型。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c644/7367534/b7836ae9ab13/jmir_v22i7e14337_fig1.jpg

相似文献

Surveilling Influenza Incidence With Centers for Disease Control and Prevention Web Traffic Data: Demonstration Using a Novel Dataset.

J Med Internet Res. 2020 Jul 3;22(7):e14337. doi: 10.2196/14337.

Use of daily Internet search query data improves real-time projections of influenza epidemics.

J R Soc Interface. 2018 Oct 10;15(147):20180220. doi: 10.1098/rsif.2018.0220.

Accuracy of real-time multi-model ensemble forecasts for seasonal influenza in the U.S.

PLoS Comput Biol. 2019 Nov 22;15(11):e1007486. doi: 10.1371/journal.pcbi.1007486. eCollection 2019 Nov.

Using electronic health records and Internet search information for accurate influenza forecasting.

BMC Infect Dis. 2017 May 8;17(1):332. doi: 10.1186/s12879-017-2424-7.

Accurate influenza forecasts using type-specific incidence data for small geographic units.

PLoS Comput Biol. 2021 Jul 29;17(7):e1009230. doi: 10.1371/journal.pcbi.1009230. eCollection 2021 Jul.

Use Internet search data to accurately track state level influenza epidemics.

Sci Rep. 2021 Feb 17;11(1):4023. doi: 10.1038/s41598-021-83084-5.

Google Flu Trends Spatial Variability Validated Against Emergency Department Influenza-Related Visits.

J Med Internet Res. 2016 Jun 28;18(6):e175. doi: 10.2196/jmir.5585.

Optimal multi-source forecasting of seasonal influenza.

PLoS Comput Biol. 2018 Sep 4;14(9):e1006236. doi: 10.1371/journal.pcbi.1006236. eCollection 2018 Sep.

Forecasting the 2013-2014 influenza season using Wikipedia.

PLoS Comput Biol. 2015 May 14;11(5):e1004239. doi: 10.1371/journal.pcbi.1004239. eCollection 2015 May.

Applying infectious disease forecasting to public health: a path forward using influenza forecasting examples.

BMC Public Health. 2019 Dec 10;19(1):1659. doi: 10.1186/s12889-019-7966-8.

引用本文的文献

Prediction of influenza outbreaks in Fuzhou, China: comparative analysis of forecasting models.

BMC Public Health. 2024 May 25;24(1):1399. doi: 10.1186/s12889-024-18583-x.

Assessing health human resource structure at Urumqi's center for disease control and prevention.

Medicine (Baltimore). 2023 Dec 1;102(48):e36209. doi: 10.1097/MD.0000000000036209.

Influenza surveillance systems using traditional and alternative sources of data: A scoping review.

Influenza Other Respir Viruses. 2022 Nov;16(6):965-974. doi: 10.1111/irv.13037. Epub 2022 Sep 8.

Role of Participatory Health Informatics in Detecting and Managing Pandemics: Literature Review.

Yearb Med Inform. 2021 Aug;30(1):200-209. doi: 10.1055/s-0041-1726486. Epub 2021 Apr 21.

本文引用的文献

Comparison of crowd-sourced, electronic health records based, and traditional health-care based influenza-tracking systems at multiple spatial resolutions in the United States of America.

BMC Infect Dis. 2018 Aug 15;18(1):403. doi: 10.1186/s12879-018-3322-3.

Economic burden of seasonal influenza in the United States.

Vaccine. 2018 Jun 22;36(27):3960-3966. doi: 10.1016/j.vaccine.2018.05.057. Epub 2018 May 22.

Combining Participatory Influenza Surveillance with Modeling and Forecasting: Three Alternative Approaches.

JMIR Public Health Surveill. 2017 Nov 1;3(4):e83. doi: 10.2196/publichealth.7344.

Measuring Global Disease with Wikipedia: Success, Failure, and a Research Agenda.

CSCW Conf Comput Support Coop Work. 2017 Feb-Mar;2017:1812-1834. doi: 10.1145/2998181.2998183.

Results from the centers for disease control and prevention's predict the 2013-2014 Influenza Season Challenge.

BMC Infect Dis. 2016 Jul 22;16:357. doi: 10.1186/s12879-016-1669-x.

Accurate estimation of influenza epidemics using Google search data via ARGO.

Proc Natl Acad Sci U S A. 2015 Nov 24;112(47):14473-8. doi: 10.1073/pnas.1515373112. Epub 2015 Nov 9.

Combining Search, Social Media, and Traditional Data Sources to Improve Influenza Surveillance.

PLoS Comput Biol. 2015 Oct 29;11(10):e1004513. doi: 10.1371/journal.pcbi.1004513. eCollection 2015 Oct.

Advances in nowcasting influenza-like illness rates using search query logs.

Sci Rep. 2015 Aug 3;5:12760. doi: 10.1038/srep12760.

Forecasting the 2013-2014 influenza season using Wikipedia.

PLoS Comput Biol. 2015 May 14;11(5):e1004239. doi: 10.1371/journal.pcbi.1004239. eCollection 2015 May.

Twitter improves influenza forecasting.

PLoS Curr. 2014 Oct 28;6:ecurrents.outbreaks.90b9ed0f59bae4ccaa683a39865d9117. doi: 10.1371/currents.outbreaks.90b9ed0f59bae4ccaa683a39865d9117.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

利用疾病预防控制中心网络流量数据监测流感发病率：使用新数据集进行的演示。

Surveilling Influenza Incidence With Centers for Disease Control and Prevention Web Traffic Data: Demonstration Using a Novel Dataset.

机构信息

出版信息

BACKGROUND

OBJECTIVE

METHODS

RESULTS

CONCLUSIONS

背景

目的

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献