Walther Dirk, Brunnemann Roman, Selbig Joachim
Max Planck Institute for Molecular Plant Physiology, Potsdam, Germany.
PLoS Genet. 2007 Feb 9;3(2):e11. doi: 10.1371/journal.pgen.0030011. Epub 2006 Dec 6.
Regulation of gene expression via specific cis-regulatory promoter elements has evolved in cellular organisms as a major adaptive mechanism to respond to environmental change. Assuming a simple model of transcriptional regulation, genes that are differentially expressed in response to a large number of different external stimuli should harbor more distinct regulatory elements in their upstream regions than do genes that only respond to few environmental challenges. We tested this hypothesis in Arabidopsis thaliana using the compendium of gene expression profiling data available in AtGenExpress and known cis-element motifs mapped to upstream gene promoter regions and studied the relation of the observed breadth of differential gene expression response with several fundamental genome architectural properties. We observed highly significant positive correlations between the density of cis-elements in upstream regions and the number of conditions in which a gene was differentially regulated. The correlation was most pronounced in regions immediately upstream of the transcription start sites. Multistimuli response genes were observed to be associated with significantly longer upstream intergenic regions, retain more paralogs in the Arabidopsis genome, are shorter, have fewer introns, and are more likely to contain TATA-box motifs in their promoters. In abiotic stress time series data, multistimuli response genes were found to be overrepresented among early-responding genes. Genes involved in the regulation of transcription, stress response, and signaling processes were observed to possess the greatest regulatory capacity. Our results suggest that greater gene expression regulatory complexity appears to be encoded by an increased density of cis-regulatory elements and provide further evidence for an evolutionary adaptation of the regulatory code at the genomic layout level. Larger intergenic spaces preceding multistimuli response genes may have evolved to allow greater regulatory gene expression potential.
通过特定的顺式调控启动子元件对基因表达进行调控,已在细胞生物体中演变为一种应对环境变化的主要适应性机制。假设存在一个简单的转录调控模型,那么响应大量不同外部刺激而差异表达的基因,其上游区域应比仅响应少数环境挑战的基因含有更多不同的调控元件。我们利用AtGenExpress中可用的基因表达谱数据汇编以及映射到上游基因启动子区域的已知顺式元件基序,在拟南芥中验证了这一假设,并研究了观察到的差异基因表达响应广度与几个基本基因组结构特性之间的关系。我们观察到上游区域顺式元件的密度与基因差异调控的条件数量之间存在高度显著的正相关。这种相关性在转录起始位点上游紧邻区域最为明显。观察到多刺激响应基因与明显更长的上游基因间区域相关,在拟南芥基因组中保留更多旁系同源基因,基因更短,内含子更少,并且其启动子中更有可能包含TATA盒基序。在非生物胁迫时间序列数据中,发现多刺激响应基因在早期响应基因中占比过高。参与转录调控、应激反应和信号传导过程的基因被观察到具有最大的调控能力。我们的结果表明,更高的基因表达调控复杂性似乎由顺式调控元件密度的增加所编码,并为基因组布局水平上调控密码的进化适应性提供了进一步证据。多刺激响应基因之前更大的基因间空间可能已经进化,以允许更大的调控基因表达潜力。