Department of Health Statistics, School of Public Health, Shanxi Medical University, Taiyuan, Shanxi, China.
Shanxi Centre for Disease Control and Prevention, Taiyuan, 030012, Shanxi, China.
BMC Infect Dis. 2023 Feb 6;23(1):71. doi: 10.1186/s12879-023-08025-1.
Influenza is an acute respiratory infectious disease that is highly infectious and seriously damages human health. Reasonable prediction is of great significance to control the epidemic of influenza.
Our Influenza data were extracted from Shanxi Provincial Center for Disease Control and Prevention. Seasonal-trend decomposition using Loess (STL) was adopted to analyze the season characteristics of the influenza in Shanxi Province, China, from the 1st week in 2010 to the 52nd week in 2019. To handle the insufficient prediction performance of the seasonal autoregressive integrated moving average (SARIMA) model in predicting the nonlinear parts and the poor accuracy of directly predicting the original sequence, this study established the SARIMA model, the combination model of SARIMA and Long-Short Term Memory neural network (SARIMA-LSTM) and the combination model of SARIMA-LSTM based on Singular spectrum analysis (SSA-SARIMA-LSTM) to make predictions and identify the best model. Additionally, the Mean Squared Error (MSE), Mean Absolute Error (MAE) and Root Mean Squared Error (RMSE) were used to evaluate the performance of the models.
The influenza time series in Shanxi Province from the 1st week in 2010 to the 52nd week in 2019 showed a year-by-year decrease with obvious seasonal characteristics. The peak period of the disease mainly concentrated from the end of the year to the beginning of the next year. The best fitting and prediction performance was the SSA-SARIMA-LSTM model. Compared with the SARIMA model, the MSE, MAE and RMSE of the SSA-SARIMA-LSTM model decreased by 38.12, 17.39 and 21.34%, respectively, in fitting performance; the MSE, MAE and RMSE decreased by 42.41, 18.69 and 24.11%, respectively, in prediction performances. Furthermore, compared with the SARIMA-LSTM model, the MSE, MAE and RMSE of the SSA-SARIMA-LSTM model decreased by 28.26, 14.61 and 15.30%, respectively, in fitting performance; the MSE, MAE and RMSE decreased by 36.99, 7.22 and 20.62%, respectively, in prediction performances.
The fitting and prediction performances of the SSA-SARIMA-LSTM model were better than those of the SARIMA and the SARIMA-LSTM models. Generally speaking, we can apply the SSA-SARIMA-LSTM model to the prediction of influenza, and offer a leg-up for public policy.
流感是一种急性呼吸道传染病,传染性强,严重危害人类健康。合理预测对控制流感疫情具有重要意义。
我们从山西省疾病预防控制中心提取流感数据。采用季节趋势分解局部平均(STL)分析方法,分析了 2010 年第 1 周至 2019 年第 52 周山西省流感的季节特征。为了解决季节性自回归综合移动平均(SARIMA)模型在预测非线性部分的预测性能不足和直接预测原始序列精度较差的问题,本研究建立了 SARIMA 模型、SARIMA 和长短时记忆神经网络(SARIMA-LSTM)组合模型以及基于奇异谱分析(SSA)的 SARIMA-LSTM 组合模型(SSA-SARIMA-LSTM)进行预测,并识别最佳模型。此外,采用均方误差(MSE)、平均绝对误差(MAE)和均方根误差(RMSE)来评估模型的性能。
山西省 2010 年第 1 周至 2019 年第 52 周的流感时间序列呈逐年下降趋势,具有明显的季节性特征。发病高峰期主要集中在年底至次年年初。最佳拟合和预测性能是 SSA-SARIMA-LSTM 模型。与 SARIMA 模型相比,SSA-SARIMA-LSTM 模型在拟合性能方面,MSE、MAE 和 RMSE 分别降低了 38.12%、17.39%和 21.34%;在预测性能方面,MSE、MAE 和 RMSE 分别降低了 42.41%、18.69%和 24.11%。此外,与 SARIMA-LSTM 模型相比,SSA-SARIMA-LSTM 模型在拟合性能方面,MSE、MAE 和 RMSE 分别降低了 28.26%、14.61%和 15.30%;在预测性能方面,MSE、MAE 和 RMSE 分别降低了 36.99%、7.22%和 20.62%。
SSA-SARIMA-LSTM 模型的拟合和预测性能均优于 SARIMA 和 SARIMA-LSTM 模型。一般来说,我们可以将 SSA-SARIMA-LSTM 模型应用于流感预测,为公共政策提供帮助。