de Oliveira Joao F L, Silva Eraylson G, de Mattos Neto Paulo S G
IEEE Trans Neural Netw Learn Syst. 2022 Aug;33(8):3251-3263. doi: 10.1109/TNNLS.2021.3051384. Epub 2022 Aug 3.
Hybrid systems, which combine statistical and machine learning (ML) techniques using residual (error forecasting) modeling, have been highlighted in the literature due to their accuracy and ability to forecast time series with different characteristics. In these architectures, a crucial task is the proper modeling of the residuals since they may present random fluctuations, complex nonlinear patterns, and heteroscedastic behavior. Hence, the selection, specification, and training of one ML model to forecast the residuals are costly and challenging tasks since issues, such as underfitting, overfitting, and misspecification, can lead to a system with low accuracy or even deteriorate the linear forecast of the time series. This article proposes a hybrid system, named dynamic residual forecasting (DReF), that employs a modified dynamic selection (DS) algorithm to decide: the most suitable ML model to forecast a pattern of the residual series and if it is a promising candidate to increase the accuracy of the time series forecast from the linear combination. Thus, the DReF aims to reduce the uncertainty of the ML model selection and avoid the deterioration of the time series forecast. Furthermore, the proposed system searches for the most suitable parameters of the DS algorithm for each data set. In this article, the proposed method uses a pool of five ML models widely adopted in the literature: multilayer perceptron, support vector regression, radial basis function, long short-term memory, and convolutional neural network. An experimental evaluation was conducted using ten well-known time series. The results show that the DReF obtains superior results for the majority of the data sets compared with single and hybrid models of the literature.
混合系统将统计和机器学习(ML)技术相结合,采用残差(误差预测)建模,因其准确性以及预测具有不同特征的时间序列的能力而在文献中受到关注。在这些架构中,一项关键任务是对残差进行恰当建模,因为它们可能呈现随机波动、复杂的非线性模式和异方差行为。因此,选择、指定和训练一个ML模型来预测残差是代价高昂且具有挑战性的任务,因为诸如欠拟合、过拟合和错误指定等问题可能导致系统准确性低,甚至会使时间序列的线性预测恶化。本文提出了一种名为动态残差预测(DReF)的混合系统,该系统采用一种改进的动态选择(DS)算法来决定:预测残差序列模式的最合适的ML模型,以及它是否是通过线性组合提高时间序列预测准确性的有前景的候选模型。因此,DReF旨在减少ML模型选择的不确定性,并避免时间序列预测的恶化。此外,所提出的系统为每个数据集寻找DS算法的最合适参数。在本文中,所提出的方法使用了文献中广泛采用的五个ML模型的集合:多层感知器、支持向量回归、径向基函数、长短期记忆和卷积神经网络。使用十个著名的时间序列进行了实验评估。结果表明,与文献中的单一模型和混合模型相比,DReF在大多数数据集上都取得了优异的结果。