• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用维基百科进行全球疾病监测与预测。

Global disease monitoring and forecasting with Wikipedia.

作者信息

Generous Nicholas, Fairchild Geoffrey, Deshpande Alina, Del Valle Sara Y, Priedhorsky Reid

机构信息

Defense Systems and Analysis Division, Los Alamos National Laboratory, Los Alamos, New Mexico, United States of America.

出版信息

PLoS Comput Biol. 2014 Nov 13;10(11):e1003892. doi: 10.1371/journal.pcbi.1003892. eCollection 2014 Nov.

DOI:10.1371/journal.pcbi.1003892
PMID:25392913
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4231164/
Abstract

Infectious disease is a leading threat to public health, economic stability, and other key social structures. Efforts to mitigate these impacts depend on accurate and timely monitoring to measure the risk and progress of disease. Traditional, biologically-focused monitoring techniques are accurate but costly and slow; in response, new techniques based on social internet data, such as social media and search queries, are emerging. These efforts are promising, but important challenges in the areas of scientific peer review, breadth of diseases and countries, and forecasting hamper their operational usefulness. We examine a freely available, open data source for this use: access logs from the online encyclopedia Wikipedia. Using linear models, language as a proxy for location, and a systematic yet simple article selection procedure, we tested 14 location-disease combinations and demonstrate that these data feasibly support an approach that overcomes these challenges. Specifically, our proof-of-concept yields models with r2 up to 0.92, forecasting value up to the 28 days tested, and several pairs of models similar enough to suggest that transferring models from one location to another without re-training is feasible. Based on these preliminary results, we close with a research agenda designed to overcome these challenges and produce a disease monitoring and forecasting system that is significantly more effective, robust, and globally comprehensive than the current state of the art.

摘要

传染病是对公共卫生、经济稳定和其他关键社会结构的主要威胁。减轻这些影响的努力依赖于准确及时的监测,以衡量疾病的风险和进展。传统的、以生物学为重点的监测技术准确,但成本高且速度慢;作为回应,基于社交媒体和搜索查询等社会互联网数据的新技术正在兴起。这些努力很有前景,但在科学同行评审、疾病和国家的覆盖范围以及预测等方面的重大挑战阻碍了它们的实际应用。我们研究了一种可免费获取的开放数据源用于此目的:在线百科全书维基百科的访问日志。使用线性模型、以语言作为地点的代理以及系统但简单的文章选择程序,我们测试了14种地点 - 疾病组合,并证明这些数据切实支持一种能够克服这些挑战的方法。具体而言,我们的概念验证产生了决定系数高达0.92的模型、在长达28天的测试期内的预测值,以及几对足够相似的模型,这表明在不重新训练的情况下将模型从一个地点转移到另一个地点是可行的。基于这些初步结果,我们最后提出了一个研究议程,旨在克服这些挑战,并产生一个比当前技术水平显著更有效、更稳健且全球覆盖范围更广的疾病监测和预测系统。

相似文献

1
Global disease monitoring and forecasting with Wikipedia.利用维基百科进行全球疾病监测与预测。
PLoS Comput Biol. 2014 Nov 13;10(11):e1003892. doi: 10.1371/journal.pcbi.1003892. eCollection 2014 Nov.
2
Enhancing COVID-19 Epidemic Forecasting Accuracy by Combining Real-time and Historical Data From Multiple Internet-Based Sources: Analysis of Social Media Data, Online News Articles, and Search Queries.利用多个基于互联网的来源的实时和历史数据提高 COVID-19 疫情预测准确性:社交媒体数据、在线新闻文章和搜索查询分析。
JMIR Public Health Surveill. 2022 Jun 16;8(6):e35266. doi: 10.2196/35266.
3
Data-model fusion to better understand emerging pathogens and improve infectious disease forecasting.数据模型融合以更好地了解新出现的病原体并改进传染病预测。
Ecol Appl. 2011 Jul;21(5):1443-60. doi: 10.1890/09-1409.1.
4
Forecasting the 2013-2014 influenza season using Wikipedia.利用维基百科预测2013 - 2014年流感季节。
PLoS Comput Biol. 2015 May 14;11(5):e1004239. doi: 10.1371/journal.pcbi.1004239. eCollection 2015 May.
5
Epidemic Forecasting is Messier Than Weather Forecasting: The Role of Human Behavior and Internet Data Streams in Epidemic Forecast.疫情预测比天气预报更复杂:人类行为和互联网数据流在疫情预测中的作用
J Infect Dis. 2016 Dec 1;214(suppl_4):S404-S408. doi: 10.1093/infdis/jiw375.
6
Measuring Global Disease with Wikipedia: Success, Failure, and a Research Agenda.利用维基百科衡量全球疾病:成功、失败与研究议程
CSCW Conf Comput Support Coop Work. 2017 Feb-Mar;2017:1812-1834. doi: 10.1145/2998181.2998183.
7
Digital Disease Surveillance for Emerging Infectious Diseases: An Early Warning System Using the Internet and Social Media Data for COVID-19 Forecasting in Canada.利用互联网和社交媒体数据进行数字疾病监测以发现新发传染病:用于加拿大 COVID-19 预测的预警系统。
Stud Health Technol Inform. 2023 May 18;302:861-865. doi: 10.3233/SHTI230290.
8
Dengue prediction by the web: Tweets are a useful tool for estimating and forecasting Dengue at country and city level.网络进行登革热预测:推文是在国家和城市层面估算和预测登革热的有用工具。
PLoS Negl Trop Dis. 2017 Jul 18;11(7):e0005729. doi: 10.1371/journal.pntd.0005729. eCollection 2017 Jul.
9
Forecasting disease risk for increased epidemic preparedness in public health.预测疾病风险以加强公共卫生领域的疫情防范能力。
Adv Parasitol. 2000;47:309-30. doi: 10.1016/s0065-308x(00)47013-2.
10
Digital disease detection: A systematic review of event-based internet biosurveillance systems.数字疾病检测:基于事件的互联网生物监测系统的系统综述
Int J Med Inform. 2017 May;101:15-22. doi: 10.1016/j.ijmedinf.2017.01.019. Epub 2017 Feb 8.

引用本文的文献

1
AI-Enabled Diagnostic Prediction within Electronic Health Records to Enhance Biosurveillance and Early Outbreak Detection.利用电子健康记录中的人工智能进行诊断预测以加强生物监测和早期疫情检测。
Res Sq. 2025 Jun 12:rs.3.rs-6606632. doi: 10.21203/rs.3.rs-6606632/v1.
2
AI-Enabled Diagnostic Prediction within Electronic Health Records to Enhance Biosurveillance and Early Outbreak Detection.利用电子健康记录中的人工智能进行诊断预测以加强生物监测和早期疫情检测。
medRxiv. 2025 May 16:2025.05.14.25327606. doi: 10.1101/2025.05.14.25327606.
3
Changes in Reproductive Health Information-Seeking Behaviors After the Dobbs Decision: Systematic Search of the Wikimedia Database.

本文引用的文献

1
Wikipedia usage estimates prevalence of influenza-like illness in the United States in near real-time.维基百科的使用情况近乎实时地估计了美国流感样疾病的流行程度。
PLoS Comput Biol. 2014 Apr 17;10(4):e1003581. doi: 10.1371/journal.pcbi.1003581. eCollection 2014 Apr.
2
National and local influenza surveillance through Twitter: an analysis of the 2012-2013 influenza epidemic.通过推特进行的国家和地方流感监测:对2012 - 2013年流感疫情的分析
PLoS One. 2013 Dec 9;8(12):e83672. doi: 10.1371/journal.pone.0083672. eCollection 2013.
3
Correlation between national influenza surveillance data and google trends in South Korea.
多布斯案裁决后生殖健康信息寻求行为的变化:维基媒体数据库的系统检索
JMIR Infodemiology. 2024 Dec 16;4:e64577. doi: 10.2196/64577.
4
Transforming Disease Surveillance through Artificial Intelligence.通过人工智能变革疾病监测
Indian J Community Med. 2024 Sep-Oct;49(5):663-664. doi: 10.4103/ijcm.ijcm_459_24. Epub 2024 Aug 14.
5
Predicting Norovirus in England Using Existing and Emerging Syndromic Data: Infodemiology Study.利用现有和新兴综合征数据预测英格兰的诺如病毒:信息流行病学研究。
J Med Internet Res. 2023 May 8;25:e37540. doi: 10.2196/37540.
6
Joint COVID-19 and influenza-like illness forecasts in the United States using internet search information.利用互联网搜索信息对美国新冠病毒病和流感样疾病进行联合预测。
Commun Med (Lond). 2023 Mar 24;3(1):39. doi: 10.1038/s43856-023-00272-2.
7
Internet search data with spatiotemporal analysis in infectious disease surveillance: Challenges and perspectives.利用时空分析进行传染病监测的互联网搜索数据:挑战与展望。
Front Public Health. 2022 Dec 5;10:958835. doi: 10.3389/fpubh.2022.958835. eCollection 2022.
8
What's hot and what's not in lay psychology: Wikipedia's most-viewed articles.大众心理学中的热门与冷门:维基百科浏览量最高的文章。
Curr Psychol. 2022 Oct 12:1-13. doi: 10.1007/s12144-022-03826-0.
9
Surveillance of emerging infectious diseases for biosecurity.生物安全新兴传染病监测。
Sci China Life Sci. 2022 Aug;65(8):1504-1516. doi: 10.1007/s11427-021-2071-x. Epub 2022 Mar 4.
10
Inclusion of environmentally themed search terms improves Elastic net regression nowcasts of regional Lyme disease rates.包含环境主题的搜索词可以提高弹性网络回归对区域性莱姆病发病率的实时预测。
PLoS One. 2022 Mar 10;17(3):e0251165. doi: 10.1371/journal.pone.0251165. eCollection 2022.
韩国国家流感监测数据与谷歌趋势之间的相关性。
PLoS One. 2013 Dec 5;8(12):e81422. doi: 10.1371/journal.pone.0081422. eCollection 2013.
4
Real-time influenza forecasts during the 2012-2013 season.2012-2013 年季节的实时流感预测。
Nat Commun. 2013;4:2837. doi: 10.1038/ncomms3837.
5
Internet search patterns of human immunodeficiency virus and the digital divide in the Russian Federation: infoveillance study.俄罗斯联邦人类免疫缺陷病毒的网络搜索模式与数字鸿沟:信息监测研究
J Med Internet Res. 2013 Nov 12;15(11):e256. doi: 10.2196/jmir.2936.
6
Using search queries for malaria surveillance, Thailand.利用疟疾监测搜索查询,泰国。
Malar J. 2013 Nov 4;12:390. doi: 10.1186/1475-2875-12-390.
7
The complex relationship of realspace events and messages in cyberspace: case study of influenza and pertussis using tweets.现实空间事件与网络空间信息之间的复杂关系:以流感和百日咳为例利用推文进行的案例研究
J Med Internet Res. 2013 Oct 24;15(10):e237. doi: 10.2196/jmir.2705.
8
Reassessing Google Flu Trends data for detection of seasonal and pandemic influenza: a comparative epidemiological study at three geographic scales.重新评估谷歌流感趋势数据在季节性和大流行性流感检测中的作用:三个地理尺度的比较流行病学研究。
PLoS Comput Biol. 2013;9(10):e1003256. doi: 10.1371/journal.pcbi.1003256. Epub 2013 Oct 17.
9
Early prediction of movie box office success based on Wikipedia activity big data.基于维基百科活动大数据的电影票房成功的早期预测。
PLoS One. 2013 Aug 21;8(8):e71226. doi: 10.1371/journal.pone.0071226. eCollection 2013.
10
Quality of information on the Internet about carpal tunnel syndrome: an update.互联网上关于腕管综合征的信息质量:最新情况
Orthopedics. 2013 Aug;36(8):e1038-41. doi: 10.3928/01477447-20130724-20.