Institute for Medical Research (IMR), Ministry of Health Malaysia, Shah Alam 40170, Malaysia.
School of Mathematical and Computer Sciences, Heriot-Watt University Malaysia, Putrajaya 62200, Malaysia.
Int J Environ Res Public Health. 2022 Jan 28;19(3):1504. doi: 10.3390/ijerph19031504.
With many countries experiencing a resurgence in COVID-19 cases, it is important to forecast disease trends to enable effective planning and implementation of control measures. This study aims to develop Seasonal Autoregressive Integrated Moving Average (SARIMA) models using 593 data points and smoothened case and covariate time-series data to generate a 28-day forecast of COVID-19 case trends during the third wave in Malaysia. SARIMA models were developed using COVID-19 case data sourced from the Ministry of Health Malaysia's official website. Model training and validation was conducted from 22 January 2020 to 5 September 2021 using daily COVID-19 case data. The SARIMA model with the lowest root mean square error (RMSE), mean absolute percentage error (MAE) and Bayesian information criterion (BIC) was selected to generate forecasts from 6 September to 3 October 2021. The best SARIMA model with a RMSE = 73.374, MAE = 39.716 and BIC = 8.656 showed a downward trend of COVID-19 cases during the forecast period, wherein the observed daily cases were within the forecast range. The majority (89%) of the difference between the forecasted and observed values was well within a deviation range of 25%. Based on this work, we conclude that SARIMA models developed in this paper using 593 data points and smoothened data and sensitive covariates can generate accurate forecast of COVID-19 case trends.
随着许多国家的 COVID-19 病例再次出现,预测疾病趋势以实现有效规划和实施控制措施非常重要。本研究旨在使用 593 个数据点和平滑的病例和协变量时间序列数据开发季节性自回归综合移动平均 (SARIMA) 模型,以生成马来西亚第三波 COVID-19 病例趋势的 28 天预测。SARIMA 模型使用从马来西亚卫生部官方网站获取的 COVID-19 病例数据开发。使用每日 COVID-19 病例数据,从 2020 年 1 月 22 日至 2021 年 9 月 5 日对模型进行训练和验证。选择具有最低均方根误差 (RMSE)、平均绝对百分比误差 (MAE) 和贝叶斯信息准则 (BIC) 的 SARIMA 模型来生成从 2021 年 9 月 6 日至 10 月 3 日的预测。最佳 SARIMA 模型的 RMSE = 73.374、MAE = 39.716 和 BIC = 8.656,显示出预测期内 COVID-19 病例呈下降趋势,其中观察到的每日病例在预测范围内。预测值和观察值之间的差异中,有 89%在 25%的偏差范围内。基于这项工作,我们得出结论,使用 593 个数据点和平滑数据以及敏感协变量开发的本文中的 SARIMA 模型可以对 COVID-19 病例趋势进行准确预测。