Chang Li-Wei, Nagarajan Rakesh, Magee Jeffrey A, Milbrandt Jeffrey, Stormo Gary D
Department of Genetics, Washington University School of Medicine, St. Louis, Missouri 63110, USA.
Genome Res. 2006 Mar;16(3):405-13. doi: 10.1101/gr.4303406. Epub 2006 Jan 31.
An important aspect of understanding a biological pathway is to delineate the transcriptional regulatory mechanisms of the genes involved. Two important tasks are often encountered when studying transcription regulation, i.e., (1) the identification of common transcriptional regulators of a set of coexpressed genes; (2) the identification of genes that are regulated by one or several transcription factors. In this study, a systematic and statistical approach was taken to accomplish these tasks by establishing an integrated model considering all of the promoters and characterized transcription factors (TFs) in the genome. A promoter analysis pipeline (PAP) was developed to implement this approach. PAP was tested using coregulated gene clusters collected from the literature. In most test cases, PAP identified the transcription regulators of the input genes accurately. When compared with chromatin immunoprecipitation experiment data, PAP's predictions are consistent with the experimental observations. When PAP was used to analyze one published expression-profiling data set and two novel coregulated gene sets, PAP was able to generate biologically meaningful hypotheses. Therefore, by taking a systematic approach of considering all promoters and characterized TFs in our model, we were able to make more reliable predictions about the regulation of gene expression in mammalian organisms.
理解生物途径的一个重要方面是描绘所涉及基因的转录调控机制。在研究转录调控时,通常会遇到两个重要任务,即:(1)识别一组共表达基因的共同转录调节因子;(2)识别受一个或几个转录因子调控的基因。在本研究中,我们采用了一种系统的统计方法来完成这些任务,即通过建立一个综合模型,该模型考虑了基因组中的所有启动子和已表征的转录因子(TFs)。我们开发了一个启动子分析流程(PAP)来实施这种方法。使用从文献中收集的共调控基因簇对PAP进行了测试。在大多数测试案例中,PAP准确地识别了输入基因的转录调节因子。与染色质免疫沉淀实验数据相比,PAP的预测与实验观察结果一致。当使用PAP分析一个已发表的表达谱数据集和两个新的共调控基因集时,PAP能够产生具有生物学意义的假设。因此,通过在我们的模型中采用考虑所有启动子和已表征TFs的系统方法,我们能够对哺乳动物生物体中的基因表达调控做出更可靠的预测。