Division of Computer Science, University of Sevilla, ES-41012 Seville, Spain.
Int J Neural Syst. 2021 Mar;31(3):2130001. doi: 10.1142/S0129065721300011. Epub 2021 Feb 16.
In recent years, deep learning techniques have outperformed traditional models in many machine learning tasks. Deep neural networks have successfully been applied to address time series forecasting problems, which is a very important topic in data mining. They have proved to be an effective solution given their capacity to automatically learn the temporal dependencies present in time series. However, selecting the most convenient type of deep neural network and its parametrization is a complex task that requires considerable expertise. Therefore, there is a need for deeper studies on the suitability of all existing architectures for different forecasting tasks. In this work, we face two main challenges: a comprehensive review of the latest works using deep learning for time series forecasting and an experimental study comparing the performance of the most popular architectures. The comparison involves a thorough analysis of seven types of deep learning models in terms of accuracy and efficiency. We evaluate the rankings and distribution of results obtained with the proposed models under many different architecture configurations and training hyperparameters. The datasets used comprise more than 50,000 time series divided into 12 different forecasting problems. By training more than 38,000 models on these data, we provide the most extensive deep learning study for time series forecasting. Among all studied models, the results show that long short-term memory (LSTM) and convolutional networks (CNN) are the best alternatives, with LSTMs obtaining the most accurate forecasts. CNNs achieve comparable performance with less variability of results under different parameter configurations, while also being more efficient.
近年来,深度学习技术在许多机器学习任务中已经超越了传统模型。深度神经网络已经成功地应用于解决时间序列预测问题,这是数据挖掘中一个非常重要的课题。它们被证明是一种有效的解决方案,因为它们能够自动学习时间序列中存在的时间依赖性。然而,选择最合适的深度神经网络类型及其参数化是一项复杂的任务,需要相当的专业知识。因此,需要更深入地研究所有现有架构对于不同预测任务的适用性。在这项工作中,我们面临两个主要挑战:全面回顾最新使用深度学习进行时间序列预测的工作,以及对最流行架构的性能进行实验研究。比较涉及从准确性和效率两个方面对七种类型的深度学习模型进行深入分析。我们评估了在所提出的模型下,在许多不同的架构配置和训练超参数下获得的结果的排名和分布。所使用的数据集包含超过 50000 个时间序列,分为 12 个不同的预测问题。通过在这些数据上训练超过 38000 个模型,我们提供了最广泛的时间序列预测深度学习研究。在所研究的所有模型中,结果表明长短期记忆(LSTM)和卷积网络(CNN)是最佳选择,其中 LSTM 获得了最准确的预测。CNN 在不同参数配置下具有可比较的性能,并且结果的可变性更小,同时效率更高。