Institute of Computer Science of the Czech Academy of Sciences, Czech Academy of Sciences, Pod vodarenskou vezi 271/2, 182 07 Prague, Czech Republic.
Chaos. 2020 Jan;30(1):013117. doi: 10.1063/1.5115267.
Estimating causal interactions in complex dynamical systems is an important problem encountered in many fields of current science. While a theoretical solution for detecting the causal interactions has been previously formulated in the framework of prediction improvement, it generally requires the computation of high-dimensional information functionals-a situation invoking the curse of dimensionality with increasing network size. Recently, several methods have been proposed to alleviate this problem, based on iterative procedures for the assessment of conditional (in)dependences. In the current work, we bring a comparison of several such prominent approaches. This is done both by theoretical comparison of the algorithms using a formulation in a common framework and by numerical simulations including realistic complex coupling patterns. The theoretical analysis highlights the key similarities and differences between the algorithms, hinting on their comparative strengths and weaknesses. The method assumptions and specific properties such as false positive control and order-dependence are discussed. Numerical simulations suggest that while the accuracy of most of the algorithms is almost indistinguishable, there are substantial differences in their computational demands, ranging theoretically from polynomial to exponential complexity and leading to substantial differences in computation time in realistic scenarios depending on the density and size of networks. Based on the analysis of the algorithms and numerical simulations, we propose a hybrid approach providing competitive accuracy with improved computational efficiency.
在许多当前科学领域中,估计复杂动力系统中的因果相互作用是一个重要问题。虽然之前已经在预测改进的框架中提出了用于检测因果相互作用的理论解决方案,但它通常需要计算高维信息泛函 - 随着网络规模的增加,这种情况会引发维度诅咒。最近,已经提出了几种基于条件(不)独立性评估的迭代程序来缓解此问题的方法。在当前的工作中,我们对几种此类突出方法进行了比较。这是通过使用通用框架中的公式对算法进行理论比较以及通过包括现实复杂耦合模式的数值模拟来完成的。理论分析突出了算法之间的关键相似性和差异,暗示了它们的相对优势和劣势。讨论了方法假设和特定属性,例如假阳性控制和顺序依赖性。数值模拟表明,虽然大多数算法的准确性几乎无法区分,但它们的计算需求存在很大差异,理论上从多项式到指数复杂度不等,并且根据网络的密度和大小,在实际场景中导致计算时间有很大差异。基于对算法和数值模拟的分析,我们提出了一种混合方法,提供了具有改进计算效率的竞争准确性。