Trindade L M, van Berloo R, Fiers M, Visser R G F
Graduate School of Experimental Plant Sciences, Laboratory of Plant Breeding, Department of Plant Sciences, Wageningen University, P.O. Box 386, 6700 AJ Wageningen, The Netherlands.
J Hered. 2005 Sep-Oct;96(5):618-22. doi: 10.1093/jhered/esi094. Epub 2005 Aug 31.
The regulation of gene expression at the transcription initiation level is highly complex and requires the presence of multiple transcription factors. These transcription factors are often proteins or peptides that bind to the so-called cis-acting elements, which are present in the promoter regions and conserved among different species. In order to predict these cis-acting elements, a computer program called PRECISE (Prediction of REgulatory CIS-acting Elements) was developed. The power of the tool lies in its user-friendly interface and in the possibility of using empirical motif frequency tables to filter through the many discovered motifs. The tools to create the empirical motif frequency table (e.g., from a whole genome sequence) are included in the package. In the first case study, the upstream regions of all the genes in the Arabidopsis genome were used to create an empirical motif frequency table and a set of 64 upstream sequences of genes known to be involved in starch metabolism was subjected to analysis by PRECISE. The 20 motifs with the highest specificity in the selected set were analyzed in more detail. Of these 20 motifs, 15 showed a very high or complete homology to the sequences of known cis-acting elements. These cis-acting elements are regulated by light, auxin, and abscisic acid, and confer specific expression in sink organs such as leaves and seeds. All these factors have been shown to play an important role in starch biosynthesis. In the second case study, the upstream regions of 16 genes whose transcription is induced by gibberellins (GA) in Arabidopsis were analyzed with PRECISE and compared to the motifs present in the PLACE database. Among the most promising motifs found by PRECISE were 6 of the 17 known GA motifs. These results indicate the power of the PRECISE software package in the prediction of regulatory elements.
在转录起始水平上的基因表达调控非常复杂,需要多种转录因子的存在。这些转录因子通常是蛋白质或肽,它们与所谓的顺式作用元件结合,这些元件存在于启动子区域且在不同物种间保守。为了预测这些顺式作用元件,开发了一个名为PRECISE(调控顺式作用元件预测)的计算机程序。该工具的强大之处在于其用户友好的界面以及使用经验基序频率表来筛选众多已发现基序的可能性。创建经验基序频率表的工具(例如从全基因组序列创建)包含在该软件包中。在第一个案例研究中,拟南芥基因组中所有基因的上游区域被用于创建一个经验基序频率表,并且一组已知参与淀粉代谢的64个基因的上游序列通过PRECISE进行分析。对所选集合中特异性最高的20个基序进行了更详细的分析。在这20个基序中,有15个与已知顺式作用元件的序列显示出非常高或完全的同源性。这些顺式作用元件受光、生长素和脱落酸调控,并在叶和种子等库器官中赋予特异性表达。所有这些因素都已被证明在淀粉生物合成中起重要作用。在第二个案例研究中,用PRECISE分析了拟南芥中16个其转录由赤霉素(GA)诱导的基因的上游区域,并与PLACE数据库中存在的基序进行比较。PRECISE发现的最有前景的基序中有17个已知GA基序中的6个。这些结果表明了PRECISE软件包在调控元件预测方面的能力。