Ponce-Flores Mirna, Frausto-Solís Juan, Santamaría-Bonfil Guillermo, Pérez-Ortega Joaquín, González-Barbosa Juan J
Graduate Program Division, Tecnológico Nacional de México/Instituto Tecnológico de Ciudad Madero, Cd. Madero 89440, Mexico.
Information Technologies Department, Consejo Nacional de Ciencia y Tecnología-Instituto Nacional de Electricidad y Energías Limpias, Cuernavaca 62490, Mexico.
Entropy (Basel). 2020 Jan 10;22(1):89. doi: 10.3390/e22010089.
Entropy is a key concept in the characterization of uncertainty for any given signal, and its extensions such as Spectral Entropy and Permutation Entropy. They have been used to measure the complexity of time series. However, these measures are subject to the discretization employed to study the states of the system, and identifying the relationship between complexity measures and the expected performance of the four selected forecasting methods that participate in the M4 Competition. This relationship allows the decision, in advance, of which algorithm is adequate. Therefore, in this paper, we found the relationships between entropy-based complexity framework and the forecasting error of four selected methods (Smyl, Theta, ARIMA, and ETS). Moreover, we present a framework extension based on the Emergence, Self-Organization, and Complexity paradigm. The experimentation with both synthetic and M4 Competition time series show that the feature space induced by complexities, visually constrains the forecasting method performance to specific regions; where the logarithm of its metric error is poorer, the Complexity based on the emergence and self-organization is maximal.
熵是表征任何给定信号不确定性的关键概念,以及它的扩展,如谱熵和排列熵。它们已被用于测量时间序列的复杂性。然而,这些度量受用于研究系统状态的离散化的影响,并且要确定复杂性度量与参与M4竞赛的四种选定预测方法的预期性能之间的关系。这种关系允许提前决定哪种算法是合适的。因此,在本文中,我们发现了基于熵的复杂性框架与四种选定方法(Smyl、Theta、ARIMA和ETS)的预测误差之间的关系。此外,我们提出了一个基于涌现、自组织和复杂性范式的框架扩展。对合成时间序列和M4竞赛时间序列的实验表明,由复杂性诱导的特征空间在视觉上把预测方法的性能限制在特定区域;在其度量误差的对数较差的地方,基于涌现和自组织的复杂性最大。