Computational Molecular Biology, MPI for Molecular Genetics, Berlin, Germany.
PLoS Comput Biol. 2013;9(9):e1003168. doi: 10.1371/journal.pcbi.1003168. Epub 2013 Sep 5.
Histone modifications are known to play an important role in the regulation of transcription. While individual modifications have received much attention in genome-wide analyses, little is known about their relationships. Some authors have built Bayesian networks of modifications, however most often they have used discretized data, and relied on unrealistic assumptions such as the absence of feedback mechanisms or hidden confounding factors. Here, we propose to infer undirected networks based on partial correlations between histone modifications. Within the partial correlation framework, correlations among two variables are controlled for associations induced by the other variables. Partial correlation networks thus focus on direct associations of histone modifications. We apply this methodology to data in CD4+ cells. The resulting network is well supported by common knowledge. When pairs of modifications show a large difference between their correlation and their partial correlation, a potential confounding factor is identified and provided as explanation. Data from different cell types (IMR90, H1) is also exploited in the analysis to assess the stability of the networks. The results are remarkably similar across cell types. Based on this observation, the networks from the three cell types are integrated into a consensus network to increase robustness. The data and the results discussed in the manuscript can be found, together with code, on http://spcn.molgen.mpg.de/index.html.
组蛋白修饰被认为在转录调控中起着重要作用。虽然个别修饰在全基因组分析中受到了广泛关注,但它们之间的关系却知之甚少。一些作者已经构建了修饰的贝叶斯网络,但大多数情况下,他们使用了离散化的数据,并依赖于不切实际的假设,如不存在反馈机制或隐藏的混杂因素。在这里,我们建议基于组蛋白修饰之间的偏相关来推断无向网络。在偏相关框架内,两个变量之间的相关性受到其他变量引起的关联的控制。因此,偏相关网络专注于组蛋白修饰的直接关联。我们将这种方法应用于 CD4+细胞中的数据。所得到的网络得到了广泛的共识的支持。当两个修饰之间的相关性与其偏相关性之间存在较大差异时,就会确定潜在的混杂因素,并提供解释。还利用来自不同细胞类型(IMR90、H1)的数据来评估网络的稳定性。结果在细胞类型之间非常相似。基于这一观察结果,将来自三种细胞类型的网络整合到一个共识网络中,以提高稳健性。本文讨论的数据和结果可以在 http://spcn.molgen.mpg.de/index.html 上找到,同时还有代码。