Surasinghe Sudam, Bollt Erik M
Department of Mathematics, Clarkson University, Potsdam, NY 13699, USA.
Department of Electrical and Computer Engineering, Clarkson Center for Complex Systems Science (C3S2), Clarkson University, Potsdam, NY 13699, USA.
Entropy (Basel). 2020 Mar 30;22(4):396. doi: 10.3390/e22040396.
Causal inference is perhaps one of the most fundamental concepts in science, beginning originally from the works of some of the ancient philosophers, through today, but also weaved strongly in current work from statisticians, machine learning experts, and scientists from many other fields. This paper takes the perspective of information flow, which includes the Nobel prize winning work on Granger-causality, and the recently highly popular transfer entropy, these being probabilistic in nature. Our main contribution will be to develop analysis tools that will allow a geometric interpretation of information flow as a causal inference indicated by positive transfer entropy. We will describe the effective dimensionality of an underlying manifold as projected into the outcome space that summarizes information flow. Therefore, contrasting the probabilistic and geometric perspectives, we will introduce a new measure of causal inference based on the fractal correlation dimension conditionally applied to competing explanations of future forecasts, which we will write G e o C y → x . This avoids some of the boundedness issues that we show exist for the transfer entropy, T y → x . We will highlight our discussions with data developed from synthetic models of successively more complex nature: these include the Hénon map example, and finally a real physiological example relating breathing and heart rate function.
因果推断或许是科学中最基本的概念之一,它最初源于一些古代哲学家的著作,一直延续至今,并且在统计学家、机器学习专家以及许多其他领域的科学家的当前工作中也有着紧密的联系。本文从信息流的角度出发,其中包括关于格兰杰因果关系的诺贝尔奖获奖成果以及最近非常流行的转移熵,它们本质上都是概率性的。我们的主要贡献将是开发分析工具,这些工具将允许把信息流进行几何解释,作为由正转移熵所表明的因果推断。我们将描述潜在流形投影到总结信息流的结果空间中的有效维度。因此,对比概率性和几何性视角,我们将引入一种基于分形相关维度的因果推断新度量,它有条件地应用于未来预测的竞争性解释,我们将其记为GeoCy→x。这避免了我们所表明的转移熵Ty→x存在的一些有界性问题。我们将通过从性质上越来越复杂的合成模型所生成的数据来突出我们的讨论:这些包括亨农映射示例,最后是一个关于呼吸和心率功能的真实生理示例。