van Helden J, André B, Collado-Vides J
Centro de Investigación sobre Fijación de Nitrógeno, Universidad Nacional Autónoma de México, AP565A Cuernavaca, Morelos, 62100, México.
J Mol Biol. 1998 Sep 4;281(5):827-42. doi: 10.1006/jmbi.1998.1947.
We present here a simple and fast method allowing the isolation of DNA binding sites for transcription factors from families of coregulated genes, with results illustrated in Saccharomyces cerevisiae. Although conceptually simple, the algorithm proved efficient for extracting, from most of the yeast regulatory families analyzed, the upstream regulatory sequences which had been previously found by experimental analysis. Furthermore, putative new regulatory sites are predicted within upstream regions of several regulons. The method is based on the detection of over-represented oligonucleotides. A specificity of this approach is to define the statistical significance of a site based on tables of oligonucleotide frequencies observed in all non-coding sequences from the yeast genome. In contrast with heuristic methods, this oligonucleotide analysis is rigorous and exhaustive. Its range of detection is however limited to relatively simple patterns: short motifs with a highly conserved core. These features seem to be shared by a good number of regulatory sites in yeast. This, and similar methods, should be increasingly required to identify unknown regulatory elements within the numerous new coregulated families resulting from measurements of gene expression levels at the genomic scale. All tools described here are available on the web at the site http://copan.cifn.unam.mx/Computational_Biology/ yeast-tools
我们在此介绍一种简单快速的方法,可从共调控基因家族中分离转录因子的DNA结合位点,结果以酿酒酵母为例进行说明。尽管该算法概念上很简单,但事实证明它能有效地从大多数分析过的酵母调控家族中提取先前通过实验分析发现的上游调控序列。此外,在几个调控子的上游区域预测到了假定的新调控位点。该方法基于对过度出现的寡核苷酸的检测。这种方法的一个特点是根据在酵母基因组所有非编码序列中观察到的寡核苷酸频率表来定义位点的统计显著性。与启发式方法不同,这种寡核苷酸分析是严谨且详尽无遗的。然而,其检测范围仅限于相对简单的模式:具有高度保守核心的短基序。酵母中的许多调控位点似乎都具有这些特征。在通过基因组规模测量基因表达水平产生的众多新的共调控家族中,越来越需要这种方法以及类似方法来识别未知的调控元件。此处描述的所有工具可在网站http://copan.cifn.unam.mx/Computational_Biology/yeast-tools上获取。