Amusa Lateef Babatunde, Twinomurinzi Hossana, Phalane Edith, Phaswana-Mafuya Refilwe Nancy
Centre for Applied Data Science, University of Johannesburg, Johannesburg, South Africa.
Department of Statistics, University of Ilorin, Ilorin, Nigeria.
Interact J Med Res. 2023 Mar 31;12:e42292. doi: 10.2196/42292.
Infectious diseases represent a major challenge for health systems worldwide. With the recent global pandemic of COVID-19, the need to research strategies to treat these health problems has become even more pressing. Although the literature on big data and data science in health has grown rapidly, few studies have synthesized these individual studies, and none has identified the utility of big data in infectious disease surveillance and modeling.
The aim of this study was to synthesize research and identify hotspots of big data in infectious disease epidemiology.
Bibliometric data from 3054 documents that satisfied the inclusion criteria retrieved from the Web of Science database over 22 years (2000-2022) were analyzed and reviewed. The search retrieval occurred on October 17, 2022. Bibliometric analysis was performed to illustrate the relationships between research constituents, topics, and key terms in the retrieved documents.
The bibliometric analysis revealed internet searches and social media as the most utilized big data sources for infectious disease surveillance or modeling. The analysis also placed US and Chinese institutions as leaders in this research area. Disease monitoring and surveillance, utility of electronic health (or medical) records, methodology framework for infodemiology tools, and machine/deep learning were identified as the core research themes.
Proposals for future studies are made based on these findings. This study will provide health care informatics scholars with a comprehensive understanding of big data research in infectious disease epidemiology.
传染病是全球卫生系统面临的一项重大挑战。随着近期新冠疫情的全球大流行,研究治疗这些健康问题的策略的需求变得更加紧迫。尽管关于健康领域大数据和数据科学的文献增长迅速,但很少有研究对这些个别研究进行综合,而且没有一项研究确定大数据在传染病监测和建模中的效用。
本研究的目的是综合研究并确定传染病流行病学中大数据的热点。
对从科学网数据库中检索到的在22年(2000 - 2022年)期间满足纳入标准的3054篇文献的文献计量数据进行分析和综述。检索于2022年10月17日进行。进行文献计量分析以阐明检索到的文献中研究要素、主题和关键词之间的关系。
文献计量分析表明,互联网搜索和社交媒体是传染病监测或建模中使用最多的大数据来源。该分析还将美国和中国的机构列为该研究领域的领先者。疾病监测与监视、电子健康(或医疗)记录的效用、信息流行病学工具的方法框架以及机器学习/深度学习被确定为核心研究主题。
基于这些发现提出了未来研究的建议。本研究将为医疗保健信息学学者提供对传染病流行病学中大数据研究的全面理解。