流感预警：通过谷歌流感趋势进行早期疫情检测。

FluBreaks: early epidemic detection from Google flu trends.

作者信息

Pervaiz Fahad, Pervaiz Mansoor, Abdur Rehman Nabeel, Saif Umar

机构信息

School of Science and Engineering, Computer Science Department, Lahore University of Management Sciences, Lahore, Pakistan.

出版信息

J Med Internet Res. 2012 Oct 4;14(5):e125. doi: 10.2196/jmir.2102.

DOI:10.2196/jmir.2102

PMID:23037553

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3510767/

Abstract

BACKGROUND

The Google Flu Trends service was launched in 2008 to track changes in the volume of online search queries related to flu-like symptoms. Over the last few years, the trend data produced by this service has shown a consistent relationship with the actual number of flu reports collected by the US Centers for Disease Control and Prevention (CDC), often identifying increases in flu cases weeks in advance of CDC records. However, contrary to popular belief, Google Flu Trends is not an early epidemic detection system. Instead, it is designed as a baseline indicator of the trend, or changes, in the number of disease cases.

OBJECTIVE

To evaluate whether these trends can be used as a basis for an early warning system for epidemics.

METHODS

We present the first detailed algorithmic analysis of how Google Flu Trends can be used as a basis for building a fully automated system for early warning of epidemics in advance of methods used by the CDC. Based on our work, we present a novel early epidemic detection system, called FluBreaks (dritte.org/flubreaks), based on Google Flu Trends data. We compared the accuracy and practicality of three types of algorithms: normal distribution algorithms, Poisson distribution algorithms, and negative binomial distribution algorithms. We explored the relative merits of these methods, and related our findings to changes in Internet penetration and population size for the regions in Google Flu Trends providing data.

RESULTS

Across our performance metrics of percentage true-positives (RTP), percentage false-positives (RFP), percentage overlap (OT), and percentage early alarms (EA), Poisson- and negative binomial-based algorithms performed better in all except RFP. Poisson-based algorithms had average values of 99%, 28%, 71%, and 76% for RTP, RFP, OT, and EA, respectively, whereas negative binomial-based algorithms had average values of 97.8%, 17.8%, 60%, and 55% for RTP, RFP, OT, and EA, respectively. Moreover, the EA was also affected by the region's population size. Regions with larger populations (regions 4 and 6) had higher values of EA than region 10 (which had the smallest population) for negative binomial- and Poisson-based algorithms. The difference was 12.5% and 13.5% on average in negative binomial- and Poisson-based algorithms, respectively.

CONCLUSIONS

We present the first detailed comparative analysis of popular early epidemic detection algorithms on Google Flu Trends data. We note that realizing this opportunity requires moving beyond the cumulative sum and historical limits method-based normal distribution approaches, traditionally employed by the CDC, to negative binomial- and Poisson-based algorithms to deal with potentially noisy search query data from regions with varying population and Internet penetrations. Based on our work, we have developed FluBreaks, an early warning system for flu epidemics using Google Flu Trends.

摘要

背景

谷歌流感趋势服务于2008年推出，用于追踪与流感样症状相关的在线搜索查询量的变化。在过去几年中，该服务生成的趋势数据与美国疾病控制与预防中心（CDC）收集的实际流感报告数量呈现出一致的关系，常常能在CDC记录之前数周就识别出流感病例的增加。然而，与普遍看法相反，谷歌流感趋势并非一个早期疫情检测系统。相反，它被设计为疾病病例数量趋势或变化的基线指标。

目的

评估这些趋势能否作为疫情早期预警系统的基础。

方法

我们首次详细分析了如何将谷歌流感趋势用作构建一个全自动疫情早期预警系统的基础，该系统比CDC所使用的方法更早。基于我们的工作，我们提出了一种基于谷歌流感趋势数据的新型早期疫情检测系统，名为FluBreaks（dritte.org/flubreaks）。我们比较了三种算法的准确性和实用性：正态分布算法、泊松分布算法和负二项分布算法。我们探讨了这些方法的相对优点，并将我们的发现与谷歌流感趋势提供数据的地区的互联网普及率和人口规模变化相关联。

结果

在我们的真阳性百分比（RTP）、假阳性百分比（RFP）、重叠百分比（OT）和早期警报百分比（EA）等性能指标方面，基于泊松和负二项分布的算法在除RFP之外的所有指标上表现更好。基于泊松的算法在RTP、RFP、OT和EA方面的平均值分别为99%、28%、71%和76%，而基于负二项分布的算法在RTP、RFP、OT和EA方面的平均值分别为97.8%、17.8%、60%和55%。此外，EA也受地区人口规模的影响。对于基于负二项分布和泊松的算法，人口较多的地区（地区4和6）的EA值高于地区10（人口最少）。基于负二项分布和泊松的算法的差异平均分别为12.5%和13.5%。

结论

我们首次对基于谷歌流感趋势数据的流行早期疫情检测算法进行了详细的比较分析。我们指出，要实现这一机遇，需要超越CDC传统采用的基于累积求和和历史极限方法的正态分布方法，转而采用基于负二项分布和泊松的算法，以处理来自人口和互联网普及率不同地区的潜在噪声搜索查询数据。基于我们的工作，我们开发了FluBreaks，这是一种利用谷歌流感趋势的流感疫情早期预警系统。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/24c1/3510767/af1cb1177cf9/jmir_v14i5e125_fig1.jpg

相似文献

FluBreaks: early epidemic detection from Google flu trends.流感预警：通过谷歌流感趋势进行早期疫情检测。

J Med Internet Res. 2012 Oct 4;14(5):e125. doi: 10.2196/jmir.2102.

Monitoring influenza activity in the United States: a comparison of traditional surveillance systems with Google Flu Trends.监测美国的流感活动：传统监测系统与谷歌流感趋势的比较。

PLoS One. 2011 Apr 27;6(4):e18687. doi: 10.1371/journal.pone.0018687.

Influenza forecasting with Google Flu Trends.利用谷歌流感趋势进行流感预测。

PLoS One. 2013;8(2):e56176. doi: 10.1371/journal.pone.0056176. Epub 2013 Feb 14.

Using electronic health records and Internet search information for accurate influenza forecasting.利用电子健康记录和互联网搜索信息进行准确的流感预测。

BMC Infect Dis. 2017 May 8;17(1):332. doi: 10.1186/s12879-017-2424-7.

Improving Google Flu Trends estimates for the United States through transformation.通过数据转换改进谷歌流感趋势对美国的预测。

PLoS One. 2014 Dec 31;9(12):e109209. doi: 10.1371/journal.pone.0109209. eCollection 2014.

Reassessing Google Flu Trends data for detection of seasonal and pandemic influenza: a comparative epidemiological study at three geographic scales.重新评估谷歌流感趋势数据在季节性和大流行性流感检测中的作用：三个地理尺度的比较流行病学研究。

PLoS Comput Biol. 2013;9(10):e1003256. doi: 10.1371/journal.pcbi.1003256. Epub 2013 Oct 17.

Google trends: a web-based tool for real-time surveillance of disease outbreaks.谷歌趋势：一种基于网络的疾病暴发实时监测工具。

Clin Infect Dis. 2009 Nov 15;49(10):1557-64. doi: 10.1086/630200.

Using Google Flu Trends data in forecasting influenza-like-illness related ED visits in Omaha, Nebraska.利用谷歌流感趋势数据预测内布拉斯加州奥马哈市与流感样疾病相关的急诊就诊情况。

Am J Emerg Med. 2014 Sep;32(9):1016-23. doi: 10.1016/j.ajem.2014.05.052. Epub 2014 Jun 12.

Using Google Trends for influenza surveillance in South China.利用谷歌趋势监测中国南方地区的流感疫情。

PLoS One. 2013;8(1):e55205. doi: 10.1371/journal.pone.0055205. Epub 2013 Jan 25.

United States Influenza Search Patterns Since the Emergence of COVID-19: Infodemiology Study.美国自新冠肺炎疫情出现以来的流感搜索模式：信息流行病学研究。

JMIR Public Health Surveill. 2022 Mar 3;8(3):e32364. doi: 10.2196/32364.

引用本文的文献

[Piloting an electronic death certificate (eTB app)-physicians' user experiences].[试用电子死亡证明（eTB应用程序）——医生的用户体验]

Bundesgesundheitsblatt Gesundheitsforschung Gesundheitsschutz. 2025 Jul;68(7):818-826. doi: 10.1007/s00103-025-04082-w. Epub 2025 Jun 2.

Incorporating connectivity among Internet search data for enhanced influenza-like illness tracking.利用互联网搜索数据的连通性提高流感样疾病监测能力。

PLoS One. 2024 Aug 26;19(8):e0305579. doi: 10.1371/journal.pone.0305579. eCollection 2024.

Influenza surveillance with Baidu index and attention-based long short-term memory model.基于百度指数和注意力机制长短期记忆模型的流感监测。

PLoS One. 2023 Jan 23;18(1):e0280834. doi: 10.1371/journal.pone.0280834. eCollection 2023.

Explanation of hand, foot, and mouth disease cases in Japan using Google Trends before and during the COVID-19: infodemiology study.在COVID-19疫情之前及期间利用谷歌趋势对日本手足口病病例进行的解释：信息流行病学研究

BMC Infect Dis. 2022 Oct 29;22(1):806. doi: 10.1186/s12879-022-07790-9.

Global monitoring of public interest in preventive measures against COVID-19 via analysis of Google Trends: an infodemiology and infoveillance study.通过分析谷歌趋势对 COVID-19 预防措施的公众关注度进行全球监测：一项信息流行病学和信息监测研究。

BMJ Open. 2022 Aug 11;12(8):e060715. doi: 10.1136/bmjopen-2021-060715.

Associations between Google Search Trends for Symptoms and COVID-19 Confirmed and Death Cases in the United States.美国症状谷歌搜索趋势与 COVID-19 确诊和死亡病例的关联。

Int J Environ Res Public Health. 2021 Apr 25;18(9):4560. doi: 10.3390/ijerph18094560.

Regional Infoveillance of COVID-19 Case Rates: Analysis of Search-Engine Query Patterns.新型冠状病毒肺炎病例发生率的区域信息监测：搜索引擎查询模式分析

J Med Internet Res. 2020 Jul 30;22(7):e19483. doi: 10.2196/19483.

Are online searches for the novel coronavirus (COVID-19) related to media or epidemiology? A cross-sectional study.针对新型冠状病毒（COVID-19）的在线搜索是否与媒体或流行病学有关？一项横断面研究。

Int J Infect Dis. 2020 Aug;97:386-390. doi: 10.1016/j.ijid.2020.06.028. Epub 2020 Jun 12.

Infodemiology and Infoveillance: Scoping Review.信息流行病学与信息监测：范围综述

J Med Internet Res. 2020 Apr 28;22(4):e16206. doi: 10.2196/16206.

The Application of Internet-Based Sources for Public Health Surveillance (Infoveillance): Systematic Review.基于互联网的公共卫生监测资源应用（信息监测）：系统评价

J Med Internet Res. 2020 Mar 13;22(3):e13680. doi: 10.2196/13680.

本文引用的文献

Use of Google Insights for Search to track seasonal and geographic kidney stone incidence in the United States.利用 Google 搜索趋势来追踪美国季节性和地域性肾结石发病率。

Urology. 2011 Aug;78(2):267-71. doi: 10.1016/j.urology.2011.01.010. Epub 2011 Apr 3.

Outbreak detection algorithms for seasonal disease data: a case study using Ross River virus disease.季节性疾病数据的爆发检测算法：以罗斯河病毒病为例的研究

BMC Med Inform Decis Mak. 2010 Nov 24;10:74. doi: 10.1186/1472-6947-10-74.

Trends and directions of global public health surveillance.全球公共卫生监测的趋势和方向。

Epidemiol Rev. 2010;32:93-109. doi: 10.1093/epirev/mxq008. Epub 2010 Jun 9.

The utility of "Google Trends" for epidemiological research: Lyme disease as an example.“谷歌趋势”在流行病学研究中的应用：以莱姆病为例。

Geospat Health. 2010 May;4(2):135-7. doi: 10.4081/gh.2010.195.

Early detection of disease outbreaks using the Internet.利用互联网早期发现疾病爆发。

CMAJ. 2009 Apr 14;180(8):829-31. doi: 10.1503/cmaj.090215.

Infodemiology and infoveillance: framework for an emerging set of public health informatics methods to analyze search, communication and publication behavior on the Internet.信息流行病学与信息监测：一套新兴的公共卫生信息学方法的框架，用于分析互联网上的搜索、交流和出版行为。

J Med Internet Res. 2009 Mar 27;11(1):e11. doi: 10.2196/jmir.1157.

The delivery of public health interventions via the Internet: actualizing their potential.通过互联网提供公共卫生干预措施：发挥其潜力。

Annu Rev Public Health. 2009;30:273-92. doi: 10.1146/annurev.publhealth.031308.100235.

Detecting influenza epidemics using search engine query data.利用搜索引擎查询数据检测流感疫情。

Nature. 2009 Feb 19;457(7232):1012-4. doi: 10.1038/nature07634.

Applying cusum-based methods for the detection of outbreaks of Ross River virus disease in Western Australia.应用基于累积和的方法检测西澳大利亚州罗斯河病毒病疫情。

BMC Med Inform Decis Mak. 2008 Aug 13;8:37. doi: 10.1186/1472-6947-8-37.

Comparing syndromic surveillance detection methods: EARS' versus a CUSUM-based methodology.比较症状监测检测方法：急诊室监测系统（EARS）与基于累积和（CUSUM）的方法。

Stat Med. 2008 Jul 30;27(17):3407-29. doi: 10.1002/sim.3197.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

流感预警：通过谷歌流感趋势进行早期疫情检测。

FluBreaks: early epidemic detection from Google flu trends.

作者信息

机构信息

出版信息

BACKGROUND

OBJECTIVE

METHODS

RESULTS

CONCLUSIONS

背景

目的

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献