Alibudbud Rowalt
Department of Sociology and Behavioral Sciences, De La Salle University, Manila, Philippines.
Front Big Data. 2023 Jul 4;6:1199060. doi: 10.3389/fdata.2023.1199060. eCollection 2023.
Wikipedia is an open-source online encyclopedia and one of the most-read sources of online health information. Likewise, Wikipedia page views have also been analyzed to inform public health services and policies. The present review analyzed 29 studies utilizing Wikipedia page views for health research. Most reviewed studies were published in recent years and emanated from high-income countries. Together with Wikipedia page views, most studies also used data from other internet sources, such as Google, Twitter, YouTube, and Reddit. The reviewed studies also explored various non-communicable diseases, infectious diseases, and health interventions to describe changes in the utilization of online health information from Wikipedia, to examine the effect of public events on public interest and information usage about health-related Wikipedia pages, to estimate and predict the incidence and prevalence of diseases, to predict data from other internet data sources, to evaluate the effectiveness of health education activities, and to explore the evolution of a health topic. Given some of the limitations in replicating some of the reviewed studies, future research can specify the specific Wikipedia page or pages analyzed, the language of the Wikipedia pages examined, dates of data collection, dates explored, type of data, and whether page views were limited to Internet users and whether web crawlers and redirects to the Wikipedia page were included. Future research can also explore public interest in other commonly read health topics available in Wikipedia, develop Wikipedia-based models that can be used to predict disease incidence and improve Wikipedia-based health education activities.
维基百科是一个开源在线百科全书,也是在线健康信息阅读量最大的来源之一。同样,维基百科的页面浏览量也被用于分析,以为公共卫生服务和政策提供参考。本综述分析了29项利用维基百科页面浏览量进行健康研究的研究。大多数被综述的研究是近年来发表的,且来自高收入国家。除了维基百科页面浏览量,大多数研究还使用了来自其他互联网来源的数据,如谷歌、推特、优兔和红迪网。被综述的研究还探讨了各种非传染性疾病、传染病和健康干预措施,以描述维基百科在线健康信息使用情况的变化,研究公共事件对与健康相关的维基百科页面的公众兴趣和信息使用的影响,估计和预测疾病的发病率和患病率,从其他互联网数据源预测数据,评估健康教育活动的有效性,以及探索健康主题的演变。鉴于在复制一些被综述的研究时存在一些局限性,未来的研究可以明确所分析的具体维基百科页面、所检查的维基百科页面的语言、数据收集日期、探索日期、数据类型,以及页面浏览量是否仅限于互联网用户,是否包括网络爬虫和指向维基百科页面的重定向。未来的研究还可以探索公众对维基百科中其他常见的健康主题的兴趣,开发基于维基百科的模型,用于预测疾病发病率并改进基于维基百科的健康教育活动。