Zhu Zhou, Shendure Jay, Church George M
Department of Genetics, Harvard Medical School, Boston, Massachusetts 02115, USA.
Genome Res. 2005 Jun;15(6):848-55. doi: 10.1101/gr.3394405.
With the completion of full genome sequences and advancement in high-throughput technologies, in silico methods have been successfully used to integrate diverse data sources toward unraveling the combinatorial nature of transcriptional regulation. So far, almost all of these studies are restricted to lower eukaryotes such as budding yeast. We describe here a computational search for functional transcription-factor (TF) combinations using phylogenetically conserved sequences and microarray-based expression data. Taking into account both orientational and positional constraints, we investigated the overrepresentation of binding sites in the vicinity of one another and whether these combinations result in more coherent expression profiles. Without any prior biological knowledge, the search led to the discovery of several experimentally established TF associations, as well as some novel ones. In particular, we identified a regulatory module controlling cell cycle-dependent transcription of G2-M genes and expanded its functional generality. We also detected many homotypic combinations, supporting the importance of binding-site density in transcriptional regulation of higher eukaryotes.
随着全基因组序列的完成以及高通量技术的进步,计算机方法已成功用于整合各种数据源,以揭示转录调控的组合性质。到目前为止,几乎所有这些研究都局限于诸如芽殖酵母等低等真核生物。我们在此描述一种使用系统发育保守序列和基于微阵列的表达数据对功能性转录因子(TF)组合进行的计算搜索。考虑到方向和位置限制,我们研究了彼此附近结合位点的过度富集情况以及这些组合是否导致更一致的表达谱。在没有任何先验生物学知识的情况下,该搜索导致发现了一些实验确定的TF关联以及一些新的关联。特别是,我们鉴定出一个控制G2 - M期基因细胞周期依赖性转录的调控模块,并扩展了其功能普遍性。我们还检测到许多同型组合,支持了结合位点密度在高等真核生物转录调控中的重要性。