Cheng Yunyun, Cheng Rong, Xu Ting, Tan Xiuhui, Bai Yanping
School of Information and Communication Engineering, North University of China, Taiyuan 030051, China.
School of Mathematics, North University of China, Taiyuan 030051, China.
Bioengineering (Basel). 2025 May 13;12(5):514. doi: 10.3390/bioengineering12050514.
COVID-19 was one of the most serious global public health emergencies in recent years, and its extremely fast spreading speed had a profound negative impact on society. A comprehensive analysis and prediction of COVID-19 could lay a theoretical foundation for monitoring and early warning systems. Since the outbreak of COVID-19, there has been an influx of research on predictive modelling, with artificial intelligence (AI) techniques, particularly machine learning (ML) methods, becoming the dominant research direction due to their superior capability in processing multidimensional datasets and capturing complex nonlinear transmission patterns. We systematically reviewed COVID-19 ML prediction models developed under the background of the epidemic using the PRISMA method. We used the selected keywords to screen the relevant literature of COVID-19 prediction using ML technology from 2020 to 2023 in the Web of Science, Springer and Elsevier databases. Based on predetermined inclusion and exclusion criteria, 136 eligible studies were ultimately selected from 5731 preliminarily screened publications, and the datasets, data preprocessing, ML models, and evaluation metrics used in these studies were assessed. By establishing a multi-level classification framework that included traditional statistical models (such as ARIMA), ML models (such as SVM), deep learning (DL) models (such as CNN, LSTM), ensemble learning methods (such as AdaBoost), and hybrid models (such as the fusion architecture of intelligent optimization algorithms and neural networks), it revealed that the hybrid modelling strategy effectively improved the prediction accuracy of the model through feature combination optimization and model cascade integration. In addition, we compared the performance of ML models with other models in the COVID-19 prediction task. The results showed that the propagation of COVID-19 is affected by multiple factors, including meteorological and socio-economic conditions. Compared to traditional methods, ML methods demonstrated significant advantages in COVID-19 prediction, especially hybrid modelling strategies, which showed great potential in optimizing accuracy. However, these techniques face challenges and limitations despite their strong performance. By reviewing existing research on COVID-19 prediction, this study provided systematic theoretical support for AI applications in infectious disease prediction and promoted technological innovation in public health.
新冠疫情是近年来最严重的全球突发公共卫生事件之一,其极快的传播速度给社会带来了深远的负面影响。对新冠疫情进行全面分析和预测可为监测和预警系统奠定理论基础。自新冠疫情爆发以来,关于预测模型的研究大量涌现,人工智能(AI)技术,特别是机器学习(ML)方法,因其在处理多维数据集和捕捉复杂非线性传播模式方面的卓越能力,成为了主要的研究方向。我们使用PRISMA方法系统回顾了在疫情背景下开发的新冠疫情ML预测模型。我们使用选定的关键词在Web of Science、Springer和Elsevier数据库中筛选了2020年至2023年期间使用ML技术进行新冠疫情预测的相关文献。基于预先确定的纳入和排除标准,最终从5731篇初步筛选的出版物中选出了136项符合条件的研究,并对这些研究中使用的数据集、数据预处理、ML模型和评估指标进行了评估。通过建立一个多层次分类框架,其中包括传统统计模型(如ARIMA)、ML模型(如SVM)、深度学习(DL)模型(如CNN、LSTM)、集成学习方法(如AdaBoost)和混合模型(如智能优化算法与神经网络的融合架构),结果表明混合建模策略通过特征组合优化和模型级联集成有效地提高了模型的预测准确性。此外,我们在新冠疫情预测任务中比较了ML模型与其他模型的性能。结果表明,新冠疫情的传播受到多种因素的影响,包括气象和社会经济条件。与传统方法相比,ML方法在新冠疫情预测中表现出显著优势,特别是混合建模策略,在优化准确性方面显示出巨大潜力。然而,尽管这些技术表现强劲,但它们也面临挑战和局限性。通过回顾现有的新冠疫情预测研究,本研究为AI在传染病预测中的应用提供了系统的理论支持,并推动了公共卫生领域的技术创新。