Barros Joana M, Duggan Jim, Rebholz-Schuhmann Dietrich
Insight Centre for Data Analytics, National University of Ireland Galway, Galway, Ireland.
School of Computer Science, National University of Ireland Galway, Galway, Ireland.
J Med Internet Res. 2020 Mar 13;22(3):e13680. doi: 10.2196/13680.
Public health surveillance is based on the continuous and systematic collection, analysis, and interpretation of data. This informs the development of early warning systems to monitor epidemics and documents the impact of intervention measures. The introduction of digital data sources, and specifically sources available on the internet, has impacted the field of public health surveillance. New opportunities enabled by the underlying availability and scale of internet-based sources (IBSs) have paved the way for novel approaches for disease surveillance, exploration of health communities, and the study of epidemic dynamics. This field and approach is also known as infodemiology or infoveillance.
This review aimed to assess research findings regarding the application of IBSs for public health surveillance (infodemiology or infoveillance). To achieve this, we have presented a comprehensive systematic literature review with a focus on these sources and their limitations, the diseases targeted, and commonly applied methods.
A systematic literature review was conducted targeting publications between 2012 and 2018 that leveraged IBSs for public health surveillance, outbreak forecasting, disease characterization, diagnosis prediction, content analysis, and health-topic identification. The search results were filtered according to previously defined inclusion and exclusion criteria.
Spanning a total of 162 publications, we determined infectious diseases to be the preferred case study (108/162, 66.7%). Of the eight categories of IBSs (search queries, social media, news, discussion forums, websites, web encyclopedia, and online obituaries), search queries and social media were applied in 95.1% (154/162) of the reviewed publications. We also identified limitations in representativeness and biased user age groups, as well as high susceptibility to media events by search queries, social media, and web encyclopedias.
IBSs are a valuable proxy to study illnesses affecting the general population; however, it is important to characterize which diseases are best suited for the available sources; the literature shows that the level of engagement among online platforms can be a potential indicator. There is a necessity to understand the population's online behavior; in addition, the exploration of health information dissemination and its content is significantly unexplored. With this information, we can understand how the population communicates about illnesses online and, in the process, benefit public health.
公共卫生监测基于对数据的持续、系统收集、分析和解读。这为监测疫情的预警系统的开发提供信息,并记录干预措施的影响。数字数据源的引入,特别是互联网上可用的数据源,已经对公共卫生监测领域产生了影响。基于互联网的数据源(IBS)的潜在可用性和规模带来的新机遇,为疾病监测、健康社区探索和疫情动态研究的新方法铺平了道路。这个领域和方法也被称为信息流行病学或信息监测。
本综述旨在评估关于IBS在公共卫生监测(信息流行病学或信息监测)中的应用的研究结果。为实现这一目标,我们进行了一项全面的系统文献综述,重点关注这些数据源及其局限性、所针对的疾病以及常用方法。
针对2012年至2018年间利用IBS进行公共卫生监测、疫情预测、疾病特征描述、诊断预测、内容分析和健康主题识别的出版物进行了系统文献综述。根据先前定义的纳入和排除标准对搜索结果进行筛选。
在总共162篇出版物中,我们确定传染病是首选的案例研究(108/162,66.7%)。在IBS的八种类别(搜索查询、社交媒体、新闻、讨论论坛、网站、网络百科全书和在线讣告)中,搜索查询和社交媒体在95.1%(154/162)的综述出版物中被应用。我们还发现了代表性和用户年龄组偏差方面的局限性,以及搜索查询、社交媒体和网络百科全书对媒体事件的高度敏感性。
IBS是研究影响普通人群疾病的有价值的替代指标;然而,确定哪些疾病最适合可用的数据源很重要;文献表明在线平台的参与程度可能是一个潜在指标。有必要了解人群的在线行为;此外,健康信息传播及其内容的探索在很大程度上尚未得到研究。有了这些信息,我们可以了解人群如何在网上交流疾病信息,并在此过程中使公共卫生受益。