Koutlis Christos, Kugiumtzis Dimitris
Information Technologies Institute, Centre of Research and Technology Hellas, 57001 Thessaloniki, Greece.
Department of Electrical and Computer Engineering, Aristotle University of Thessaloniki, University Campus, 54124 Thessaloniki, Greece.
Entropy (Basel). 2021 Feb 8;23(2):208. doi: 10.3390/e23020208.
Many methods of Granger causality, or broadly termed connectivity, have been developed to assess the causal relationships between the system variables based only on the information extracted from the time series. The power of these methods to capture the true underlying connectivity structure has been assessed using simulated dynamical systems where the ground truth is known. Here, we consider the presence of an unobserved variable that acts as a hidden source for the observed high-dimensional dynamical system and study the effect of the hidden source on the estimation of the connectivity structure. In particular, the focus is on estimating the direct causality effects in high-dimensional time series (not including the hidden source) of relatively short length. We examine the performance of a linear and a nonlinear connectivity measure using dimension reduction and compare them to a linear measure designed for latent variables. For the simulations, four systems are considered, the coupled Hénon maps system, the coupled Mackey-Glass system, the neural mass model and the vector autoregressive (VAR) process, each comprising 25 subsystems (variables for VAR) at close chain coupling structure and another subsystem (variable for VAR) driving all others acting as the hidden source. The results show that the direct causality measures estimate, in general terms, correctly the existing connectivity in the absence of the source when its driving is zero or weak, yet fail to detect the actual relationships when the driving is strong, with the nonlinear measure of dimension reduction performing best. An example from finance including and excluding the USA index in the global market indices highlights the different performance of the connectivity measures in the presence of hidden source.
许多格兰杰因果关系方法,或者广义上称为连通性方法,已经被开发出来,用于仅基于从时间序列中提取的信息来评估系统变量之间的因果关系。这些方法捕捉真实潜在连通性结构的能力已经通过已知真实情况的模拟动态系统进行了评估。在这里,我们考虑存在一个未观测变量,它作为观测到的高维动态系统的隐藏源,并研究隐藏源对连通性结构估计的影响。特别地,重点是估计相对较短长度的高维时间序列(不包括隐藏源)中的直接因果效应。我们使用降维方法研究了一种线性和一种非线性连通性度量的性能,并将它们与为潜在变量设计的线性度量进行比较。对于模拟,考虑了四个系统,即耦合的亨农映射系统、耦合的麦基 - 格拉斯系统、神经质量模型和向量自回归(VAR)过程,每个系统在紧密链耦合结构下包含25个子系统(VAR的变量),以及另一个驱动所有其他子系统的子系统(VAR的变量)作为隐藏源。结果表明,一般来说,当隐藏源的驱动为零或较弱时,直接因果度量能够正确估计不存在该源时的现有连通性,但当驱动较强时,它们无法检测到实际关系,其中降维的非线性度量表现最佳。一个来自金融领域的例子,包括和排除全球市场指数中的美国指数,突出了在存在隐藏源时连通性度量的不同表现。