Waterhouse Robert M, Zdobnov Evgeny M, Tegenfeldt Fredrik, Li Jia, Kriventseva Evgenia V
Department of Genetic Medicine and Development, University of Geneva Medical School, Swiss Institute of Bioinformatics, rue Michel-Servet 1, 1211 Geneva, Switzerland.
Nucleic Acids Res. 2011 Jan;39(Database issue):D283-8. doi: 10.1093/nar/gkq930. Epub 2010 Oct 23.
The concept of homology drives speculation on a gene's function in any given species when its biological roles in other species are characterized. With reference to a specific species radiation homologous relations define orthologs, i.e. descendants from a single gene of the ancestor. The large-scale delineation of gene genealogies is a challenging task, and the numerous approaches to the problem reflect the importance of the concept of orthology as a cornerstone for comparative studies. Here, we present the updated OrthoDB catalog of eukaryotic orthologs delineated at each radiation of the species phylogeny in an explicitly hierarchical manner of over 100 species of vertebrates, arthropods and fungi (including the metazoa level). New database features include functional annotations, and quantification of evolutionary divergence and relations among orthologous groups. The interface features extended phyletic profile querying and enhanced text-based searches. The ever-increasing sampling of sequenced eukaryotic genomes brings a clearer account of the majority of gene genealogies that will facilitate informed hypotheses of gene function in newly sequenced genomes. Furthermore, uniform analysis across lineages as different as vertebrates, arthropods and fungi with divergence levels varying from several to hundreds of millions of years will provide essential data for uncovering and quantifying long-term trends of gene evolution. OrthoDB is freely accessible from http://cegg.unige.ch/orthodb.
当一个基因在其他物种中的生物学作用得以明确时,同源性概念会促使人们推测该基因在任何特定物种中的功能。参照特定物种的进化分支,同源关系定义了直系同源基因,即来自祖先单一基因的后代。大规模描绘基因谱系是一项具有挑战性的任务,针对该问题的众多方法反映了直系同源概念作为比较研究基石的重要性。在此,我们展示了经过更新的真核生物直系同源基因目录(OrthoDB),该目录以明确的层次结构方式描绘了超过100种脊椎动物、节肢动物和真菌(包括后生动物级别)在物种系统发育的每个进化分支中的直系同源基因。新的数据库功能包括功能注释,以及对直系同源基因群之间进化差异和关系的量化。其界面具有扩展的系统发育谱查询和增强的基于文本的搜索功能。测序真核基因组样本的不断增加,使得对大多数基因谱系有了更清晰的认识,这将有助于对新测序基因组中的基因功能提出有根据的假设。此外,对脊椎动物、节肢动物和真菌等不同谱系进行统一分析,这些谱系的分化水平从几百万年到几亿年不等,将为揭示和量化基因进化的长期趋势提供重要数据。可从http://cegg.unige.ch/orthodb免费访问OrthoDB。