Department of Epidemiology and Biostatistics, College of Public Health, Zhengzhou University, Zhengzhou 450001, China.
Int J Environ Res Public Health. 2022 May 12;19(10):5910. doi: 10.3390/ijerph19105910.
Acquired immune deficiency syndrome (AIDS) is a serious public health problem. This study aims to establish a combined model of seasonal autoregressive integrated moving average (SARIMA) and Prophet models based on an L1-norm to predict the incidence of AIDS in Henan province, China. The monthly incidences of AIDS in Henan province from 2012 to 2020 were obtained from the Health Commission of Henan Province. A SARIMA model, a Prophet model, and two combined models were adopted to fit the monthly incidence of AIDS using the data from January 2012 to December 2019. The data from January 2020 to December 2020 was used to verify. The mean square error (MSE), mean absolute error (MAE), and mean absolute percentage error (MAPE) were used to compare the prediction effect among the models. The results showed that the monthly incidence fluctuated from 0.05 to 0.50 per 100,000 individuals, and the monthly incidence of AIDS had a certain periodicity in Henan province. In addition, the prediction effect of the Prophet model was better than SARIMA model, the combined model was better than the single models, and the combined model based on the L1-norm had the best effect values (MSE = 0.0056, MAE = 0.0553, MAPE = 43.5337). This indicated that, compared with the L2-norm, the L1-norm improved the prediction accuracy of the combined model. The combined model of SARIMA and Prophet based on the L1-norm is a suitable method to predict the incidence of AIDS in Henan. Our findings can provide theoretical evidence for the government to formulate policies regarding AIDS prevention.
获得性免疫缺陷综合征(AIDS)是一个严重的公共卫生问题。本研究旨在建立基于 L1 范数的季节性自回归综合移动平均(SARIMA)和 Prophet 模型的组合模型,以预测中国河南省 AIDS 的发病率。从河南省卫生健康委员会获得了河南省 2012 年至 2020 年每月 AIDS 的发病率。采用 SARIMA 模型、Prophet 模型和两种组合模型,使用 2012 年 1 月至 2019 年 12 月的数据拟合 AIDS 每月发病率。使用 2020 年 1 月至 12 月的数据进行验证。使用均方误差(MSE)、平均绝对误差(MAE)和平均绝对百分比误差(MAPE)来比较模型之间的预测效果。结果表明,每月发病率在每 10 万人中有 0.05 至 0.50 人之间波动,河南省 AIDS 的每月发病率具有一定的周期性。此外,Prophet 模型的预测效果优于 SARIMA 模型,组合模型优于单一模型,基于 L1 范数的组合模型具有最佳的效果值(MSE=0.0056,MAE=0.0553,MAPE=43.5337)。这表明,与 L2 范数相比,L1 范数提高了组合模型的预测精度。基于 L1 范数的 SARIMA 和 Prophet 的组合模型是预测河南省 AIDS 发病率的合适方法。我们的研究结果可以为政府制定艾滋病预防政策提供理论依据。