Biozentrum, University of Basel, and Swiss Institute of Bioinformatics, Klingelbergstrasse 50/70, CH-4056 Basel, Switzerland.
Nucleic Acids Res. 2013 Jan;41(Database issue):D214-20. doi: 10.1093/nar/gks1145. Epub 2012 Nov 24.
Identification of genomic regulatory elements is essential for understanding the dynamics of cellular processes. This task has been substantially facilitated by the availability of genome sequences for many species and high-throughput data of transcripts and transcription factor (TF) binding. However, rigorous computational methods are necessary to derive accurate genome-wide annotations of regulatory sites from such data. SwissRegulon (http://swissregulon.unibas.ch) is a database containing genome-wide annotations of regulatory motifs, promoters and TF binding sites (TFBSs) in promoter regions across model organisms. Its binding site predictions were obtained with rigorous Bayesian probabilistic methods that operate on orthologous regions from related genomes, and use explicit evolutionary models to assess the evidence of purifying selection on each site. New in the current version of SwissRegulon is a curated collection of 190 mammalian regulatory motifs associated with ∼340 TFs, and TFBS annotations across a curated set of ∼35 000 promoters in both human and mouse. Predictions of TFBSs for Saccharomyces cerevisiae have also been significantly extended and now cover 158 of yeast's ∼180 TFs. All data are accessible through both an easily navigable genome browser with search functions, and as flat files that can be downloaded for further analysis.
鉴定基因组调控元件对于理解细胞过程的动态至关重要。随着许多物种的基因组序列和转录本及转录因子(TF)结合的高通量数据的可用性,这项任务得到了极大的促进。然而,需要严格的计算方法才能从这些数据中得出准确的全基因组调控位点注释。SwissRegulon(http://swissregulon.unibas.ch)是一个数据库,包含了模型生物中启动子区域的全基因组调控基序、启动子和转录因子结合位点(TFBS)的注释。其结合位点预测是使用严格的贝叶斯概率方法获得的,该方法基于相关基因组的同源区域进行操作,并使用明确的进化模型来评估每个位点上纯化选择的证据。SwissRegulon 当前版本的新增内容是一个经过精心整理的与约 340 个 TF 相关的 190 个哺乳动物调控基序的集合,以及在人类和小鼠中精心挑选的约 35000 个启动子的 TFBS 注释。酿酒酵母的 TFBS 预测也得到了显著扩展,现在涵盖了酵母约 180 个 TF 中的 158 个。所有数据都可以通过具有搜索功能的易于导航的基因组浏览器以及可下载进行进一步分析的平面文件进行访问。