Budden David M, Hurley Daniel G, Cursons Joseph, Markham John F, Davis Melissa J, Crampin Edmund J
Systems Biology Laboratory, Melbourne School of Engineering, The University of Melbourne, 3010 Parkville, Australia ; NICTA Victoria Research Laboratory, The University of Melbourne, 3010 Parkville, Australia.
Systems Biology Laboratory, Melbourne School of Engineering, The University of Melbourne, 3010 Parkville, Australia.
Epigenetics Chromatin. 2014 Nov 24;7(1):36. doi: 10.1186/1756-8935-7-36. eCollection 2014.
Transcription factors (TFs) and histone modifications (HMs) play critical roles in gene expression by regulating mRNA transcription. Modelling frameworks have been developed to integrate high-throughput omics data, with the aim of elucidating the regulatory logic that results from the interactions of DNA, TFs and HMs. These models have yielded an unexpected and poorly understood result: that TFs and HMs are statistically redundant in explaining mRNA transcript abundance at a genome-wide level.
We constructed predictive models of gene expression by integrating RNA-sequencing, TF and HM chromatin immunoprecipitation sequencing and DNase I hypersensitivity data for two mammalian cell types. All models identified genome-wide statistical redundancy both within and between TFs and HMs, as previously reported. To investigate potential explanations, groups of genes were constructed for ontology-classified biological processes. Predictive models were constructed for each process to explore the distribution of statistical redundancy. We found significant variation in the predictive capacity of TFs and HMs across these processes and demonstrated the predictive power of HMs to be inversely proportional to process enrichment for housekeeping genes.
It is well established that the roles played by TFs and HMs are not functionally redundant. Instead, we attribute the statistical redundancy reported in this and previous genome-wide modelling studies to the heterogeneous distribution of HMs across chromatin domains. Furthermore, we conclude that statistical redundancy between individual TFs can be readily explained by nucleosome-mediated cooperative binding. This could possibly help the cell confer regulatory robustness by rejecting signalling noise and allowing control via multiple pathways.
转录因子(TFs)和组蛋白修饰(HMs)通过调节mRNA转录在基因表达中发挥关键作用。已经开发了建模框架来整合高通量组学数据,目的是阐明由DNA、TFs和HMs相互作用产生的调控逻辑。这些模型产生了一个意想不到且理解不足的结果:在全基因组水平上,TFs和HMs在解释mRNA转录本丰度方面在统计学上是冗余的。
我们通过整合两种哺乳动物细胞类型的RNA测序、TF和HM染色质免疫沉淀测序以及DNase I超敏数据构建了基因表达预测模型。如先前报道的那样,所有模型都在TFs和HMs内部以及它们之间识别出了全基因组范围内的统计学冗余。为了探究潜在的解释,针对本体分类的生物学过程构建了基因组。为每个过程构建了预测模型,以探索统计学冗余的分布。我们发现TFs和HMs在这些过程中的预测能力存在显著差异,并证明HMs的预测能力与管家基因的过程富集呈反比。
TFs和HMs所起的作用在功能上并非冗余,这一点已得到充分证实。相反,我们将本研究及之前全基因组建模研究中报道的统计学冗余归因于HMs在染色质结构域中的异质分布。此外,我们得出结论,单个TF之间的统计学冗余可以很容易地通过核小体介导的协同结合来解释。这可能有助于细胞通过拒绝信号噪声并允许通过多种途径进行控制来赋予调控稳健性。