Mahdi Mahmoudi Seyed, Wit Ernst C
Department of Statistics, Faculty of Mathematics, Statistics and Computer Science, Semnan University, Semnan, Iran.
Johann Bernoulli Institute (FWN), Rijksuniversiteit Groningen Faculteit voor Wiskunde en Natuurwetenschappen, Groningen, Netherlands.
Int J Biostat. 2018 Sep 1;14(2):/j/ijb.2018.14.issue-2/ijb-2018-0030/ijb-2018-0030.xml. doi: 10.1515/ijb-2018-0030.
One of the basic aims of science is to unravel the chain of cause and effect of particular systems. Especially for large systems, this can be a daunting task. Detailed interventional and randomized data sampling approaches can be used to resolve the causality question, but for many systems, such interventions are impossible or too costly to obtain. Recently, Maathuis et al. (2010), following ideas from Spirtes et al. (2000), introduced a framework to estimate causal effects in large scale Gaussian systems. By describing the causal network as a directed acyclic graph it is a possible to estimate a class of Markov equivalent systems that describe the underlying causal interactions consistently, even for non-Gaussian systems. In these systems, causal effects stop being linear and cannot be described any more by a single coefficient. In this paper, we derive the general functional form of a causal effect in a large subclass of non-Gaussian distributions, called the non-paranormal. We also derive a convenient approximation, which can be used effectively in estimation. We show that the estimate is consistent under certain conditions and we apply the method to an observational gene expression dataset of the Arabidopsis thaliana circadian clock system.
科学的基本目标之一是揭示特定系统的因果链。尤其是对于大型系统而言,这可能是一项艰巨的任务。详细的干预性和随机数据采样方法可用于解决因果关系问题,但对于许多系统来说,这种干预是不可能的,或者获取成本过高。最近,马休伊斯等人(2010年)借鉴斯皮尔特斯等人(2000年)的观点,引入了一个框架来估计大规模高斯系统中的因果效应。通过将因果网络描述为有向无环图,即使对于非高斯系统,也能够估计出一类能一致描述潜在因果相互作用的马尔可夫等价系统。在这些系统中,因果效应不再是线性的,无法再用单个系数来描述。在本文中,我们推导了非高斯分布的一个大型子类(称为非正态分布)中因果效应的一般函数形式。我们还推导了一种方便的近似方法,可有效地用于估计。我们表明,在某些条件下该估计是一致的,并将该方法应用于拟南芥生物钟系统的一个观测基因表达数据集。