• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用搜索引擎大数据预测新的 HIV 诊断。

Using search engine big data for predicting new HIV diagnoses.

机构信息

University of California Institute for Prediction Technology, Department of Family Medicine, University of California Los Angeles, Los Angeles, California, United States of America.

Department of Systems Engineering and Engineering Management, City University of Hong Kong, Kowloon, Hong Kong.

出版信息

PLoS One. 2018 Jul 12;13(7):e0199527. doi: 10.1371/journal.pone.0199527. eCollection 2018.

DOI:10.1371/journal.pone.0199527
PMID:30001360
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6042696/
Abstract

BACKGROUND

A large and growing body of "big data" is generated by internet search engines, such as Google. Because people often search for information about public health and medical issues, researchers may be able to use search engine data to monitor and predict public health problems, such as HIV. We sought to assess the feasibility of using Google search data to analyze and predict new HIV diagnoses cases in the United States.

METHODS AND FINDINGS

From 2007 to 2014, we collected search volume data on HIV-related Google search keywords across the United States. State-level new HIV diagnoses data were collected from the Centers for Disease Control and Prevention (CDC) and AIDSVu.org. We developed a negative binomial model to predict HIV cases using a subset of significant predictor keywords identified by LASSO. The Google search data were combined with state-level HIV case reports provided by the CDC. We use historical data to train the model and predict new HIV diagnoses from 2011 to 2014, with an average R2 value of 0.99 between predicted versus actual cases, and average root-mean-square error (RMSE) of 108.75.

CONCLUSIONS

Results indicate that Google Trends is a feasible tool to predict new cases of HIV at the state level. We discuss the implications of integrating visualization maps and tools based on these models into public health and HIV monitoring and surveillance.

摘要

背景

互联网搜索引擎(如谷歌)生成了大量且不断增长的“大数据”。由于人们经常搜索有关公共卫生和医疗问题的信息,因此研究人员或许能够使用搜索引擎数据来监测和预测公共卫生问题,例如 HIV。我们旨在评估使用谷歌搜索数据来分析和预测美国新的 HIV 诊断病例的可行性。

方法和发现

从 2007 年到 2014 年,我们在美国范围内收集了与 HIV 相关的谷歌搜索关键字的搜索量数据。从疾病预防控制中心(CDC)和 AIDSVu.org 收集了各州新的 HIV 诊断数据。我们开发了一个负二项式模型,使用通过 LASSO 确定的一组显著预测关键字来预测 HIV 病例。将谷歌搜索数据与由 CDC 提供的各州 HIV 病例报告相结合。我们使用历史数据来训练模型,并预测 2011 年至 2014 年的新 HIV 诊断,预测与实际病例之间的平均 R2 值为 0.99,平均均方根误差(RMSE)为 108.75。

结论

结果表明,谷歌趋势是预测州级新 HIV 病例的可行工具。我们讨论了将这些模型的可视化地图和工具集成到公共卫生和 HIV 监测和监测中的意义。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/229a/6042696/fbf1118068ec/pone.0199527.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/229a/6042696/f104b50aa4c8/pone.0199527.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/229a/6042696/fbf1118068ec/pone.0199527.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/229a/6042696/f104b50aa4c8/pone.0199527.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/229a/6042696/fbf1118068ec/pone.0199527.g002.jpg

相似文献

1
Using search engine big data for predicting new HIV diagnoses.利用搜索引擎大数据预测新的 HIV 诊断。
PLoS One. 2018 Jul 12;13(7):e0199527. doi: 10.1371/journal.pone.0199527. eCollection 2018.
2
Using Search Engine Data as a Tool to Predict Syphilis.利用搜索引擎数据预测梅毒
Epidemiology. 2018 Jul;29(4):574-578. doi: 10.1097/EDE.0000000000000836.
3
Using Search Engine Data to Explore Interest in PrEP and HIV Testing in the United States.利用搜索引擎数据探索美国对 PrEP 和 HIV 检测的兴趣。
AIDS Behav. 2021 Mar;25(3):983-991. doi: 10.1007/s10461-020-03057-z. Epub 2020 Oct 8.
4
FluBreaks: early epidemic detection from Google flu trends.流感预警:通过谷歌流感趋势进行早期疫情检测。
J Med Internet Res. 2012 Oct 4;14(5):e125. doi: 10.2196/jmir.2102.
5
Using internet search data to predict new HIV diagnoses in China: a modelling study.利用互联网搜索数据预测中国的新 HIV 诊断:一项建模研究。
BMJ Open. 2018 Oct 17;8(10):e018335. doi: 10.1136/bmjopen-2017-018335.
6
Surveillance Tools Emerging From Search Engines and Social Media Data for Determining Eye Disease Patterns.从搜索引擎和社交媒体数据中涌现的用于确定眼病模式的监测工具。
JAMA Ophthalmol. 2016 Sep 1;134(9):1024-30. doi: 10.1001/jamaophthalmol.2016.2267.
7
Regional Infoveillance of COVID-19 Case Rates: Analysis of Search-Engine Query Patterns.新型冠状病毒肺炎病例发生率的区域信息监测:搜索引擎查询模式分析
J Med Internet Res. 2020 Jul 30;22(7):e19483. doi: 10.2196/19483.
8
Use Internet search data to accurately track state level influenza epidemics.利用互联网搜索数据准确追踪州级流感疫情。
Sci Rep. 2021 Feb 17;11(1):4023. doi: 10.1038/s41598-021-83084-5.
9
Predicting new diagnoses of HIV infection using internet search engine data.利用互联网搜索引擎数据预测HIV感染的新诊断情况。
Clin Infect Dis. 2013 May;56(9):1352-3. doi: 10.1093/cid/cit022. Epub 2013 Jan 18.
10
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.

引用本文的文献

1
Trending ticks: using Google Trends data to understand tickborne disease prevention.热门话题:利用谷歌趋势数据了解蜱传疾病预防。
Front Public Health. 2024 Jun 13;12:1410713. doi: 10.3389/fpubh.2024.1410713. eCollection 2024.
2
Digital Epidemiological Approaches in HIV Research: a Scoping Methodological Review.数字流行病学方法在 HIV 研究中的应用:系统综述方法学。
Curr HIV/AIDS Rep. 2023 Dec;20(6):470-480. doi: 10.1007/s11904-023-00673-x. Epub 2023 Nov 2.
3
Predicting HIV infection in the decade (2005-2015) pre-COVID-19 in Zimbabwe: A supervised classification-based machine learning approach.

本文引用的文献

1
Internet searches for opioids predict future emergency department heroin admissions.互联网对阿片类药物的搜索预测了未来急诊部门的海洛因入院人数。
Drug Alcohol Depend. 2018 Sep 1;190:166-169. doi: 10.1016/j.drugalcdep.2018.05.009. Epub 2018 Jun 19.
2
Future-oriented tweets predict lower county-level HIV prevalence in the United States.面向未来的推文预测美国县级艾滋病毒流行率较低。
Health Psychol. 2015 Dec;34S:1252-60. doi: 10.1037/hea0000279.
3
Accurate estimation of influenza epidemics using Google search data via ARGO.通过ARGO利用谷歌搜索数据准确估计流感疫情。
预测2019年新冠疫情之前十年(2005 - 2015年)津巴布韦的艾滋病毒感染情况:一种基于监督分类的机器学习方法。
PLOS Digit Health. 2023 Jun 7;2(6):e0000260. doi: 10.1371/journal.pdig.0000260. eCollection 2023 Jun.
4
Hangover-Related Internet Searches Before and During the COVID-19 Pandemic in England: Observational Study.英国新冠疫情期间及之前与宿醉相关的网络搜索:观察性研究
JMIR Form Res. 2023 Mar 3;7:e40518. doi: 10.2196/40518.
5
Increased Rate of Fracture Injuries Associated With Alternative Modes of Transportation During COVID-19.与 COVID-19 期间替代交通方式相关的骨折损伤发生率增加。
J Am Acad Orthop Surg Glob Res Rev. 2022 Sep 27;6(9). doi: 10.5435/JAAOSGlobal-D-22-00147. eCollection 2022 Sep 1.
6
Three dimensions of COVID-19 risk perceptions and their socioeconomic correlates in the United States: A social media analysis.美国 COVID-19 风险认知的三个维度及其社会经济相关性:社交媒体分析。
Risk Anal. 2023 Jun;43(6):1174-1186. doi: 10.1111/risa.13993. Epub 2022 Jul 13.
7
COVID-19 hospitalizations forecasts using internet search data.使用互联网搜索数据预测 COVID-19 住院情况。
Sci Rep. 2022 Jun 11;12(1):9661. doi: 10.1038/s41598-022-13162-9.
8
State health policies and interest in PrEP: evidence from Google Trends.州级卫生政策与对 PrEP 的关注:谷歌趋势提供的证据。
AIDS Care. 2022 Mar;34(3):331-339. doi: 10.1080/09540121.2021.1934381. Epub 2021 Jun 30.
9
Technology-Delivered Intervention Strategies to Bolster HIV Testing.利用技术手段强化艾滋病病毒检测的干预策略
Curr HIV/AIDS Rep. 2021 Aug;18(4):391-405. doi: 10.1007/s11904-021-00565-y. Epub 2021 Jun 9.
10
Power of Big Data in ending HIV.大数据终结艾滋病的力量。
AIDS. 2021 May 1;35(Suppl 1):S1-S5. doi: 10.1097/QAD.0000000000002888.
Proc Natl Acad Sci U S A. 2015 Nov 24;112(47):14473-8. doi: 10.1073/pnas.1515373112. Epub 2015 Nov 9.
4
Adaptive nowcasting of influenza outbreaks using Google searches.利用谷歌搜索进行流感疫情的适应性实时预测。
R Soc Open Sci. 2014 Oct 29;1(2):140095. doi: 10.1098/rsos.140095. eCollection 2014 Oct.
5
Using internet search queries for infectious disease surveillance: screening diseases for suitability.利用互联网搜索查询进行传染病监测:筛选适合监测的疾病
BMC Infect Dis. 2014 Dec 31;14:690. doi: 10.1186/s12879-014-0690-1.
6
The reliability of tweets as a supplementary method of seasonal influenza surveillance.推文作为季节性流感监测补充方法的可靠性。
J Med Internet Res. 2014 Nov 14;16(11):e250. doi: 10.2196/jmir.3532.
7
Methods of using real-time social media technologies for detection and remote monitoring of HIV outcomes.利用实时社交媒体技术检测和远程监测艾滋病病毒感染结果的方法。
Prev Med. 2014 Jun;63:112-5. doi: 10.1016/j.ypmed.2014.01.024. Epub 2014 Feb 8.
8
National and local influenza surveillance through Twitter: an analysis of the 2012-2013 influenza epidemic.通过推特进行的国家和地方流感监测:对2012 - 2013年流感疫情的分析
PLoS One. 2013 Dec 9;8(12):e83672. doi: 10.1371/journal.pone.0083672. eCollection 2013.
9
Predicting new diagnoses of HIV infection using internet search engine data.利用互联网搜索引擎数据预测HIV感染的新诊断情况。
Clin Infect Dis. 2013 May;56(9):1352-3. doi: 10.1093/cid/cit022. Epub 2013 Jan 18.
10
Norovirus disease surveillance using Google Internet query share data.利用谷歌互联网查询份额数据进行诺如病毒病监测。
Clin Infect Dis. 2012 Oct;55(8):e75-8. doi: 10.1093/cid/cis579. Epub 2012 Jun 19.