Geisler Matt, Kleczkowski Leszek A, Karpinski Stanislaw
Department of Plant Physiology, Umeå Plant Science Centre, Umeå University, 901 87 Umeå, Sweden.
Plant J. 2006 Feb;45(3):384-98. doi: 10.1111/j.1365-313X.2005.02634.x.
Short motifs of many cis-regulatory elements (CREs) can be found in the promoters of most Arabidopsis genes, and this raises the question of how their presence can confer specific regulation. We developed a universal algorithm to test the biological significance of CREs by first identifying every Arabidopsis gene with a CRE and then statistically correlating the presence or absence of the element with the gene expression profile on multiple DNA microarrays. This algorithm was successfully verified for previously characterized abscisic acid, ethylene, sucrose and drought responsive CREs in Arabidopsis, showing that the presence of these elements indeed correlates with treatment-specific gene induction. Later, we used standard motif sampling methods to identify 128 putative motifs induced by excess light, reactive oxygen species and sucrose. Our algorithm was able to filter 20 out of 128 novel CREs which significantly correlated with gene induction by either heat, reactive oxygen species and/or sucrose. The position, orientation and sequence specificity of CREs was tested in silicio by analyzing the expression of genes with naturally occurring sequence variations. In three novel CREs the forward orientation correlated with sucrose induction and the reverse orientation with sucrose suppression. The functionality of the predicted novel CREs was experimentally confirmed using Arabidopsis cell-suspension cultures transformed with short promoter fragments or artificial promoters fused with the GUS reporter gene. Our genome-wide analysis opens up new possibilities for in silicio verification of the biological significance of newly discovered CREs, and allows for subsequent selection of such CREs for experimental studies.
在大多数拟南芥基因的启动子中都能发现许多顺式作用元件(CRE)的短基序,这就引发了一个问题,即它们的存在如何赋予特定的调控作用。我们开发了一种通用算法来测试CRE的生物学意义,首先识别每个含有CRE的拟南芥基因,然后在多个DNA微阵列上统计该元件的存在与否与基因表达谱之间的相关性。该算法已成功验证了拟南芥中先前已鉴定的脱落酸、乙烯、蔗糖和干旱响应性CRE,表明这些元件的存在确实与特定处理诱导的基因相关。后来,我们使用标准的基序采样方法来识别由过量光照、活性氧和蔗糖诱导的128个推定基序。我们的算法能够从128个新的CRE中筛选出20个,这些CRE与热、活性氧和/或蔗糖诱导的基因显著相关。通过分析具有自然发生序列变异的基因的表达,在计算机上测试了CRE的位置、方向和序列特异性。在三个新的CRE中,正向与蔗糖诱导相关,反向与蔗糖抑制相关。使用用短启动子片段或与GUS报告基因融合的人工启动子转化的拟南芥细胞悬浮培养物,通过实验证实了预测的新CRE的功能。我们的全基因组分析为在计算机上验证新发现的CRE的生物学意义开辟了新的可能性,并允许随后选择此类CRE进行实验研究。