Huerta-Cepas Jaime, Bueno Anibal, Dopazo Joaquín, Gabaldón Toni
Bioinformatics Department, Centro de Investigación Príncipe Felipe, Avda. Autopista del Saler, 13 Valencia 46013, Spain.
Nucleic Acids Res. 2008 Jan;36(Database issue):D491-6. doi: 10.1093/nar/gkm899. Epub 2007 Oct 25.
The complete collection of evolutionary histories of all genes in a genome, also known as phylome, constitutes a valuable source of information. The reconstruction of phylomes has been previously prevented by large demands of time and computer power, but is now feasible thanks to recent developments in computers and algorithms. To provide a publicly available repository of complete phylomes that allows researchers to access and store large-scale phylogenomic analyses, we have developed PhylomeDB. PhylomeDB is a database of complete phylomes derived for different genomes within a specific taxonomic range. All phylomes in the database are built using a high-quality phylogenetic pipeline that includes evolutionary model testing and alignment trimming phases. For each genome, PhylomeDB provides the alignments, phylogentic trees and tree-based orthology predictions for every single encoded protein. The current version of PhylomeDB includes the phylomes of Human, the yeast Saccharomyces cerevisiae and the bacterium Escherichia coli, comprising a total of 32 289 seed sequences with their corresponding alignments and 172 324 phylogenetic trees. PhylomeDB can be publicly accessed at http://phylomedb.bioinfo.cipf.es.
基因组中所有基因进化历史的完整集合,也称为系统发育基因组,构成了一个有价值的信息来源。以前,由于对时间和计算机能力的巨大需求,系统发育基因组的重建受到阻碍,但由于计算机和算法的最新发展,现在是可行的。为了提供一个可公开访问的完整系统发育基因组库,使研究人员能够访问和存储大规模系统发育基因组分析,我们开发了系统发育基因组数据库(PhylomeDB)。PhylomeDB是一个针对特定分类范围内不同基因组推导的完整系统发育基因组数据库。数据库中的所有系统发育基因组都是使用高质量的系统发育流程构建的,该流程包括进化模型测试和比对修剪阶段。对于每个基因组,PhylomeDB为每个编码蛋白质提供比对、系统发育树和基于树的直系同源预测。PhylomeDB的当前版本包括人类、酿酒酵母和大肠杆菌的系统发育基因组,共有32289个种子序列及其相应的比对和172324个系统发育树。可通过http://phylomedb.bioinfo.cipf.es公开访问PhylomeDB。