Department of Computer Science, University of New Mexico, Albuquerque, NM 87131, USA.
Bioinformatics. 2011 Jul 1;27(13):1832-8. doi: 10.1093/bioinformatics/btr270. Epub 2011 May 5.
Condition-specific networks capture system-wide behavior under varying conditions such as environmental stresses, cell types or tissues. These networks frequently comprise parts that are unique to each condition, and parts that are shared among related conditions. Existing approaches for learning condition-specific networks typically identify either only differences or only similarities across conditions. Most of these approaches first learn networks per condition independently, and then identify similarities and differences in a post-learning step. Such approaches do not exploit the shared information across conditions during network learning.
We describe an approach for learning condition-specific networks that identifies the shared and unique subgraphs during network learning simultaneously, rather than as a post-processing step. Our approach learns networks across condition sets, shares data from different conditions and produces high-quality networks that capture biologically meaningful information. On simulated data, our approach outperformed an existing approach that learns networks independently for each condition, especially for small training datasets. On microarray data of hundreds of deletion mutants in two, yeast stationary-phase cell populations, the inferred network structure identified several common and population-specific effects of these deletion mutants and several high-confidence cases of double-deletion pairs, which can be experimentally tested. Our results are consistent with and extend the existing knowledge base of differentiated cell populations in yeast stationary phase.
C++ code can be accessed from http://www.broadinstitute.org/~sroy/condspec/ .
条件特定网络在不同条件下(如环境压力、细胞类型或组织)捕获系统范围的行为。这些网络通常包含特定于每个条件的部分,以及在相关条件之间共享的部分。学习条件特定网络的现有方法通常仅识别条件之间的差异或相似性。这些方法中的大多数首先独立地学习每个条件的网络,然后在学习后的步骤中识别相似性和差异。这些方法在网络学习过程中没有利用条件之间的共享信息。
我们描述了一种学习条件特定网络的方法,该方法在网络学习过程中同时识别共享和独特的子图,而不是作为后处理步骤。我们的方法跨条件集学习网络,共享来自不同条件的数据,并生成捕获生物学有意义信息的高质量网络。在模拟数据上,我们的方法优于为每个条件独立学习网络的现有方法,尤其是对于小的训练数据集。在酵母静止期两个细胞群体的数百个缺失突变体的微阵列数据上,推断出的网络结构确定了这些缺失突变体的几个共同和群体特异性效应,以及几个可以进行实验测试的高可信度双缺失对情况。我们的结果与酵母静止期分化细胞群体的现有知识库一致,并进行了扩展。
C++ 代码可从 http://www.broadinstitute.org/~sroy/condspec/ 访问。