Woolfe Adam, Goode Debbie K, Cooke Julie, Callaway Heather, Smith Sarah, Snell Phil, McEwen Gayle K, Elgar Greg
School of Biological Sciences, Queen Mary, University of London, Mile End Road, London E1 4NS, UK.
BMC Dev Biol. 2007 Aug 30;7:100. doi: 10.1186/1471-213X-7-100.
Comparative genomics is currently one of the most popular approaches to study the regulatory architecture of vertebrate genomes. Fish-mammal genomic comparisons have proved powerful in identifying conserved non-coding elements likely to be distal cis-regulatory modules such as enhancers, silencers or insulators that control the expression of genes involved in the regulation of early development. The scientific community is showing increasing interest in characterizing the function, evolution and language of these sequences. Despite this, there remains little in the way of user-friendly access to a large dataset of such elements in conjunction with the analysis and the visualization tools needed to study them.
Here we present CONDOR (COnserved Non-coDing Orthologous Regions) available at: http://condor.fugu.biology.qmul.ac.uk. In an interactive and intuitive way the website displays data on > 6800 non-coding elements associated with over 120 early developmental genes and conserved across vertebrates. The database regularly incorporates results of ongoing in vivo zebrafish enhancer assays of the CNEs carried out in-house, which currently number approximately 100. Included and highlighted within this set are elements derived from duplication events both at the origin of vertebrates and more recently in the teleost lineage, thus providing valuable data for studying the divergence of regulatory roles between paralogs. CONDOR therefore provides a number of tools and facilities to allow scientists to progress in their own studies on the function and evolution of developmental cis-regulation.
By providing access to data with an approachable graphics interface, the CONDOR database presents a rich resource for further studies into the regulation and evolution of genes involved in early development.
比较基因组学是目前研究脊椎动物基因组调控结构最流行的方法之一。鱼类与哺乳动物的基因组比较已证明在识别可能是远端顺式调控模块的保守非编码元件方面很有成效,这些模块如增强子、沉默子或绝缘子,可控制参与早期发育调控的基因的表达。科学界对表征这些序列的功能、进化和特性的兴趣日益浓厚。尽管如此,对于这样一个包含此类元件的大型数据集,以及研究它们所需的分析和可视化工具,仍缺乏用户友好的访问方式。
我们在此展示了CONDOR(保守非编码直系同源区域),网址为:http://condor.fugu.biology.qmul.ac.uk。该网站以交互式且直观的方式展示了与120多个早期发育基因相关且在脊椎动物中保守的6800多个非编码元件的数据。该数据库定期纳入内部正在进行的斑马鱼体内CNE增强子检测结果,目前约有100个。在这组数据中包含并突出显示了源自脊椎动物起源以及最近硬骨鱼谱系中的复制事件的元件,从而为研究旁系同源物之间调控作用的差异提供了有价值的数据。因此,CONDOR提供了许多工具和设施,使科学家能够在自己关于发育顺式调控的功能和进化研究中取得进展。
通过提供具有易懂图形界面的数据访问方式,CONDOR数据库为进一步研究参与早期发育的基因的调控和进化提供了丰富的资源。