K Abdul Hamid Abdul Aziz, Wan Mohamad Nawi Wan Imanul Aisyah, Lola Muhamad Safiih, Mustafa Wan Azani, Abdul Malik Siti Madhihah, Zakaria Syerrina, Aruchunan Elayaraja, Zainuddin Nurul Hila, Gobithaasan R U, Abdullah Mohd Tajuddin
Faculty of Ocean Engineering Technology and Informatics, Universiti Malaysia Terengganu, Kuala Nerus 21030, Terengganu, Malaysia.
Special Interest Group on Applied Informatics and Intelligent Applications (AINIA), Universiti Malaysia Terengganu, Kuala Nerus 21030, Terengganu, Malaysia.
Diagnostics (Basel). 2023 Mar 15;13(6):1121. doi: 10.3390/diagnostics13061121.
Improving forecasts, particularly the accuracy, efficiency, and precision of time-series forecasts, is becoming critical for authorities to predict, monitor, and prevent the spread of the Coronavirus disease. However, the results obtained from the predictive models are imprecise and inefficient because the dataset contains linear and non-linear patterns, respectively. Linear models such as autoregressive integrated moving average cannot be used effectively to predict complex time series, so nonlinear approaches are better suited for such a purpose. Therefore, to achieve a more accurate and efficient predictive value of COVID-19 that is closer to the true value of COVID-19, a hybrid approach was implemented. Therefore, the objectives of this study are twofold. The first objective is to propose intelligence-based prediction methods to achieve better prediction results called autoregressive integrated moving average-least-squares support vector machine. The second objective is to investigate the performance of these proposed models by comparing them with the autoregressive integrated moving average, support vector machine, least-squares support vector machine, and autoregressive integrated moving average-support vector machine. Our investigation is based on three COVID-19 real datasets, i.e., daily new cases data, daily new death cases data, and daily new recovered cases data. Then, statistical measures such as mean square error, root mean square error, mean absolute error, and mean absolute percentage error were performed to verify that the proposed models are better than the autoregressive integrated moving average, support vector machine model, least-squares support vector machine, and autoregressive integrated moving average-support vector machine. Empirical results using three recent datasets of known the Coronavirus Disease-19 cases in Malaysia show that the proposed model generates the smallest mean square error, root mean square error, mean absolute error, and mean absolute percentage error values for training and testing datasets compared to the autoregressive integrated moving average, support vector machine, least-squares support vector machine, and autoregressive integrated moving average-support vector machine models. This means that the predicted value of the proposed model is closer to the true value. These results demonstrate that the proposed model can generate estimates more accurately and efficiently. Compared to the autoregressive integrated moving average, support vector machine, least-squares support vector machine, and autoregressive integrated moving average-support vector machine models, our proposed models perform much better in terms of percent error reduction for both training and testing all datasets. Therefore, the proposed model is possibly the most efficient and effective way to improve prediction for future pandemic performance with a higher level of accuracy and efficiency.
提高预测,尤其是时间序列预测的准确性、效率和精度,对于当局预测、监测和预防新冠病毒疾病的传播变得至关重要。然而,由于数据集分别包含线性和非线性模式,从预测模型获得的结果并不精确且效率低下。诸如自回归积分移动平均等线性模型无法有效地用于预测复杂的时间序列,因此非线性方法更适合此目的。因此,为了获得更接近新冠病毒真实值的更准确、高效的预测值,实施了一种混合方法。因此,本研究的目标有两个。第一个目标是提出基于智能的预测方法以实现更好的预测结果,即自回归积分移动平均 - 最小二乘支持向量机。第二个目标是通过将这些提出的模型与自回归积分移动平均、支持向量机、最小二乘支持向量机以及自回归积分移动平均 - 支持向量机进行比较,来研究这些模型的性能。我们的研究基于三个新冠病毒真实数据集,即每日新增病例数据、每日新增死亡病例数据和每日新增康复病例数据。然后,进行了诸如均方误差、均方根误差、平均绝对误差和平均绝对百分比误差等统计量的计算,以验证所提出的模型优于自回归积分移动平均、支持向量机模型、最小二乘支持向量机以及自回归积分移动平均 - 支持向量机。使用马来西亚最近三个已知新冠病毒病例数据集的实证结果表明,与自回归积分移动平均、支持向量机、最小二乘支持向量机以及自回归积分移动平均 - 支持向量机模型相比,所提出的模型在训练和测试数据集上产生的均方误差、均方根误差、平均绝对误差和平均绝对百分比误差值最小。这意味着所提出模型的预测值更接近真实值。这些结果表明所提出的模型能够更准确、高效地生成估计值。与自回归积分移动平均、支持向量机、最小二乘支持向量机以及自回归积分移动平均 - 支持向量机模型相比,我们提出的模型在所有数据集的训练和测试中,在百分比误差降低方面表现得更好。因此,所提出的模型可能是提高未来大流行预测准确性和效率的最有效方法。