da Silva Tiago Tiburcio, Francisquini Rodrigo, Nascimento Mariá C V
Instituto de Ciência e Tecnologia, Universidade Federal de São Paulo (UNIFESP), Av. Cesare M. G. Lattes, 1201, Eugênio de Mello, São José dos Campos-SP, CEP: 12247-014, Brazil.
Expert Syst Appl. 2021 Nov 15;182:115190. doi: 10.1016/j.eswa.2021.115190. Epub 2021 May 18.
In 2020, Brazil was the leading country in COVID-19 cases in Latin America, and capital cities were the most severely affected by the outbreak. Climates vary in Brazil due to the territorial extension of the country, its relief, geography, and other factors. Since the most common COVID-19 symptoms are related to the respiratory system, many researchers have studied the correlation between the number of COVID-19 cases with meteorological variables like temperature, humidity, rainfall, etc. Also, due to its high transmission rate, some researchers have analyzed the impact of human mobility on the dynamics of COVID-19 transmission. There is a dearth of literature that considers these two variables when predicting the spread of COVID-19 cases. In this paper, we analyzed the correlation between the number of COVID-19 cases and human mobility, and meteorological data in Brazilian capitals. We found that the correlation between such variables depends on the regions where the cities are located. We employed the variables with a significant correlation with COVID-19 cases to predict the number of COVID-19 infections in all Brazilian capitals and proposed a prediction method combining the Ensemble Empirical Mode Decomposition (EEMD) method with the Autoregressive Integrated Moving Average Exogenous inputs (ARIMAX) method, which we called EEMD-ARIMAX. After analyzing the results poor predictions were further investigated using a signal processing-based anomaly detection method. Computational tests showed that EEMD-ARIMAX achieved a forecast 26.73% better than ARIMAX. Moreover, an improvement of 30.69% in the average root mean squared error (RMSE) was noticed when applying the EEMD-ARIMAX method to the data normalized after the anomaly detection.
2020年,巴西是拉丁美洲新冠肺炎病例数最多的国家,首都受疫情影响最为严重。巴西因国土面积、地形、地理位置等因素,气候多样。由于新冠肺炎最常见的症状与呼吸系统有关,许多研究人员研究了新冠肺炎病例数与温度、湿度、降雨量等气象变量之间的相关性。此外,由于其高传播率,一些研究人员分析了人员流动对新冠肺炎传播动态的影响。在预测新冠肺炎病例传播时,很少有文献考虑这两个变量。在本文中,我们分析了巴西首都新冠肺炎病例数与人员流动以及气象数据之间的相关性。我们发现这些变量之间的相关性取决于城市所在的地区。我们使用与新冠肺炎病例有显著相关性的变量来预测巴西所有首都的新冠肺炎感染人数,并提出了一种将总体经验模态分解(EEMD)方法与自回归积分移动平均外生输入(ARIMAX)方法相结合的预测方法,我们称之为EEMD - ARIMAX。在分析结果后,使用基于信号处理的异常检测方法对较差的预测进行了进一步研究。计算测试表明,EEMD - ARIMAX的预测效果比ARIMAX好26.73%。此外,将EEMD - ARIMAX方法应用于异常检测后归一化的数据时,平均均方根误差(RMSE)提高了30.69%。