Hvidsten Torgeir R, Wilczyński Bartosz, Kryshtafovych Andriy, Tiuryn Jerzy, Komorowski Jan, Fidelis Krzysztof
The Linnaeus Centre for Bioinformatics, Uppsala University, 751 24 Uppsala, Sweden.
Genome Res. 2005 Jun;15(6):856-66. doi: 10.1101/gr.3760605.
Transcription factors regulate expression by binding selectively to sequence sites in cis-regulatory regions of genes. It is therefore reasonable to assume that genes regulated by the same transcription factors should all contain the corresponding binding sites in their regulatory regions and exhibit similar expression profiles as measured by, for example, microarray technology. We have used this assumption to analyze genome-wide yeast binding-site and microarray expression data to reveal the combinatorial nature of gene regulation. We obtained IF-THEN rules linking binding-site combinations (binding-site modules) to genes with particular expression profiles, and thereby provided testable hypotheses on the combinatorial coregulation of gene expression. We showed that genes associated with such rules have a significantly higher probability of being bound by the same transcription factors, as indicated by a genome-wide location analysis, than genes associated with only common binding sites or similar expression. Furthermore, we also found that such genes were significantly more often biologically related in terms of Gene Ontology annotations than genes only associated with common binding sites or similar expression. We analyzed expression data collected under different sets of stress conditions and found many binding-site modules that are conserved over several of these condition sets, as well as modules that are specific to particular biological responses. Our results on the reoccurrence of binding sites in different modules provide specific data on how binding sites may be combined to allow a large number of expression outcomes using relatively few transcription factors.
转录因子通过选择性地结合基因顺式调控区域中的序列位点来调节基因表达。因此可以合理推测,受相同转录因子调控的基因在其调控区域应都含有相应的结合位点,并表现出相似的表达谱,例如通过微阵列技术测量得到的表达谱。我们利用这一推测分析了全基因组酵母结合位点和微阵列表达数据,以揭示基因调控的组合性质。我们获得了将结合位点组合(结合位点模块)与具有特定表达谱的基因联系起来的“如果-那么”规则,从而为基因表达的组合协同调控提供了可检验的假设。我们表明,如全基因组定位分析所示,与这些规则相关的基因被相同转录因子结合的概率显著高于仅与常见结合位点或相似表达相关的基因。此外,我们还发现,就基因本体注释而言,这类基因在生物学上的相关性也显著高于仅与常见结合位点或相似表达相关的基因。我们分析了在不同应激条件下收集的表达数据,发现了许多在多个条件组中都保守的结合位点模块,以及特定于特定生物学反应的模块。我们关于不同模块中结合位点重现性的结果提供了具体数据,说明了结合位点如何组合以利用相对较少的转录因子实现大量的表达结果。