Sardar Iqra, Akbar Muhammad Azeem, Leiva Víctor, Alsanad Ahmed, Mishra Pradeep
Department of Mathematics and Statistics, International Islamic University Islamabad, Islamabad, Pakistan.
Department of Software Engineering, LUT University, Lappeenranta, Finland.
Stoch Environ Res Risk Assess. 2023;37(1):345-359. doi: 10.1007/s00477-022-02307-x. Epub 2022 Oct 5.
Machine learning (ML) has proved to be a prominent study field while solving complex real-world problems. The whole globe has suffered and continues suffering from Coronavirus disease 2019 (COVID-19), and its projections need to be forecasted. In this article, we propose and derive an autoregressive modeling framework based on ML and statistical methods to predict confirmed cases of COVID-19 in the South Asian Association for Regional Cooperation (SAARC) countries. Automatic forecasting models based on autoregressive integrated moving average (ARIMA) and Prophet time series structures, as well as extreme gradient boosting, generalized linear model elastic net (GLMNet), and random forest ML techniques, are introduced and applied to COVID-19 data from the SAARC countries. Different forecasting models are compared by means of selection criteria. By using evaluation metrics, the best and suitable models are selected. Results prove that the ARIMA model is found to be suitable and ideal for forecasting confirmed infected cases of COVID-19 in these countries. For the confirmed cases in Afghanistan, Bangladesh, India, Maldives, and Sri Lanka, the ARIMA model is superior to the other models. In Bhutan, the Prophet time series model is appropriate for predicting such cases. The GLMNet model is more accurate than other time-series models for Nepal and Pakistan. The random forest model is excluded from forecasting because of its poor fit.
机器学习(ML)在解决复杂的现实世界问题时已被证明是一个突出的研究领域。全球一直在遭受并仍在遭受2019冠状病毒病(COVID-19)的影响,其预测需要进行预估。在本文中,我们提出并推导了一个基于机器学习和统计方法的自回归建模框架,以预测南亚区域合作联盟(SAARC)国家的COVID-19确诊病例。介绍了基于自回归积分移动平均(ARIMA)和先知时间序列结构的自动预测模型,以及极端梯度提升、广义线性模型弹性网(GLMNet)和随机森林机器学习技术,并将其应用于SAARC国家的COVID-19数据。通过选择标准对不同的预测模型进行比较。通过使用评估指标,选择最佳和合适的模型。结果证明,ARIMA模型被发现适用于预测这些国家的COVID-19确诊感染病例。对于阿富汗、孟加拉国、印度、马尔代夫和斯里兰卡的确诊病例,ARIMA模型优于其他模型。在不丹,先知时间序列模型适用于预测此类病例。GLMNet模型对尼泊尔和巴基斯坦的预测比其他时间序列模型更准确。随机森林模型因拟合效果不佳而被排除在预测之外。