QB3 Institute, University of California, Berkeley, Berkeley, CA 94720-1762, USA.
Nucleic Acids Res. 2013 Jul;41(Web Server issue):W242-8. doi: 10.1093/nar/gkt399. Epub 2013 May 18.
The PhyloFacts 'Fast Approximate Tree Classification' (FAT-CAT) web server provides a novel approach to ortholog identification using subtree hidden Markov model-based placement of protein sequences to phylogenomic orthology groups in the PhyloFacts database. Results on a data set of microbial, plant and animal proteins demonstrate FAT-CAT's high precision at separating orthologs and paralogs and robustness to promiscuous domains. We also present results documenting the precision of ortholog identification based on subtree hidden Markov model scoring. The FAT-CAT phylogenetic placement is used to derive a functional annotation for the query, including confidence scores and drill-down capabilities. PhyloFacts' broad taxonomic and functional coverage, with >7.3 M proteins from across the Tree of Life, enables FAT-CAT to predict orthologs and assign function for most sequence inputs. Four pipeline parameter presets are provided to handle different sequence types, including partial sequences and proteins containing promiscuous domains; users can also modify individual parameters. PhyloFacts trees matching the query can be viewed interactively online using the PhyloScope Javascript tree viewer and are hyperlinked to various external databases. The FAT-CAT web server is available at http://phylogenomics.berkeley.edu/phylofacts/fatcat/.
PhyloFacts 的“Fast Approximate Tree Classification”(FAT-CAT)网络服务器提供了一种新颖的方法,用于使用基于子树隐马尔可夫模型的蛋白质序列在 PhyloFacts 数据库中的系统发育同源物组中进行同源物识别。对微生物、植物和动物蛋白质数据集的结果表明,FAT-CAT 在分离同源物和同系物方面具有很高的精度,并且对混杂结构域具有鲁棒性。我们还提供了基于子树隐马尔可夫模型评分的同源物识别精度的结果。FAT-CAT 的系统发育定位用于为查询提供功能注释,包括置信度得分和向下钻取功能。PhyloFacts 的广泛的分类学和功能覆盖范围,涵盖了来自生命之树的超过 730 万种蛋白质,使 FAT-CAT 能够预测同源物并为大多数序列输入分配功能。提供了四个管道参数预设来处理不同的序列类型,包括部分序列和包含混杂结构域的蛋白质;用户还可以修改单个参数。与查询匹配的 PhyloFacts 树可以使用 PhyloScope JavaScript 树查看器在线交互式查看,并链接到各种外部数据库。FAT-CAT 网络服务器可在 http://phylogenomics.berkeley.edu/phylofacts/fatcat/ 获得。