Emad Amin, Milenkovic Olgica
Department of Electrical and Computer Engineering, University of Illinois at Urbana-Champaign, Urbana, Illinois, United States of America.
PLoS One. 2014 Mar 12;9(3):e90781. doi: 10.1371/journal.pone.0090781. eCollection 2014.
We introduce a novel algorithm for inference of causal gene interactions, termed CaSPIAN (Causal Subspace Pursuit for Inference and Analysis of Networks), which is based on coupling compressive sensing and Granger causality techniques. The core of the approach is to discover sparse linear dependencies between shifted time series of gene expressions using a sequential list-version of the subspace pursuit reconstruction algorithm and to estimate the direction of gene interactions via Granger-type elimination. The method is conceptually simple and computationally efficient, and it allows for dealing with noisy measurements. Its performance as a stand-alone platform without biological side-information was tested on simulated networks, on the synthetic IRMA network in Saccharomyces cerevisiae, and on data pertaining to the human HeLa cell network and the SOS network in E. coli. The results produced by CaSPIAN are compared to the results of several related algorithms, demonstrating significant improvements in inference accuracy of documented interactions. These findings highlight the importance of Granger causality techniques for reducing the number of false-positives, as well as the influence of noise and sampling period on the accuracy of the estimates. In addition, the performance of the method was tested in conjunction with biological side information of the form of sparse "scaffold networks", to which new edges were added using available RNA-seq or microarray data. These biological priors aid in increasing the sensitivity and precision of the algorithm in the small sample regime.
我们介绍了一种用于推断因果基因相互作用的新算法,称为CaSPIAN(用于网络推断和分析的因果子空间追踪),它基于压缩感知和格兰杰因果技术的结合。该方法的核心是使用子空间追踪重建算法的顺序列表版本来发现基因表达的移位时间序列之间的稀疏线性依赖性,并通过格兰杰型消除来估计基因相互作用的方向。该方法概念简单且计算效率高,并且能够处理有噪声的测量。在模拟网络、酿酒酵母中的合成IRMA网络以及与人类HeLa细胞网络和大肠杆菌中的SOS网络相关的数据上测试了其作为无生物学辅助信息的独立平台的性能。将CaSPIAN产生的结果与几种相关算法的结果进行了比较,结果表明在已记录相互作用的推断准确性方面有显著提高。这些发现突出了格兰杰因果技术对于减少假阳性数量的重要性,以及噪声和采样周期对估计准确性的影响。此外,还结合稀疏“支架网络”形式的生物学辅助信息测试了该方法的性能,利用可用的RNA测序或微阵列数据向其中添加新的边。这些生物学先验有助于在小样本情况下提高算法的灵敏度和精度。