Department of Statistics, University of Michigan, Ann Arbor, Michigan 48109, USA.
Bioinformatics. 2010 Sep 15;26(18):i517-23. doi: 10.1093/bioinformatics/btq377.
Components of biological systems interact with each other in order to carry out vital cell functions. Such information can be used to improve estimation and inference, and to obtain better insights into the underlying cellular mechanisms. Discovering regulatory interactions among genes is therefore an important problem in systems biology. Whole-genome expression data over time provides an opportunity to determine how the expression levels of genes are affected by changes in transcription levels of other genes, and can therefore be used to discover regulatory interactions among genes.
In this article, we propose a novel penalization method, called truncating lasso, for estimation of causal relationships from time-course gene expression data. The proposed penalty can correctly determine the order of the underlying time series, and improves the performance of the lasso-type estimators. Moreover, the resulting estimate provides information on the time lag between activation of transcription factors and their effects on regulated genes. We provide an efficient algorithm for estimation of model parameters, and show that the proposed method can consistently discover causal relationships in the large p, small n setting. The performance of the proposed model is evaluated favorably in simulated, as well as real, data examples.
The proposed truncating lasso method is implemented in the R-package 'grangerTlasso' and is freely available at http://www.stat.lsa.umich.edu/~shojaie/.
生物系统的组成部分相互作用,以执行重要的细胞功能。这些信息可用于改进估计和推断,并更好地了解潜在的细胞机制。因此,发现基因之间的调控相互作用是系统生物学中的一个重要问题。随着时间的推移,全基因组表达数据提供了一个机会,可以确定基因的表达水平如何受到其他基因转录水平变化的影响,因此可以用于发现基因之间的调控相互作用。
在本文中,我们提出了一种新的惩罚方法,称为截断lasso,用于从时间序列基因表达数据中估计因果关系。所提出的惩罚可以正确确定潜在时间序列的顺序,并提高lasso 型估计量的性能。此外,所得估计值提供了关于转录因子激活与其对调节基因的影响之间时间滞后的信息。我们提供了一种用于估计模型参数的有效算法,并表明所提出的方法可以在大 p、小 n 的设置中一致地发现因果关系。所提出的模型在模拟和真实数据示例中的性能都得到了很好的评估。
所提出的截断lasso 方法在 R 包“grangerTlasso”中实现,并可在 http://www.stat.lsa.umich.edu/~shojaie/ 上免费获得。