Institute of Biotechnology, HiLife, University of Helsinki, 00014 Helsinki, Finland.
School of Informatics, University of Edinburgh, UK.
Nucleic Acids Res. 2018 Jul 2;46(W1):W84-W88. doi: 10.1093/nar/gky350.
The unprecedented growth of high-throughput sequencing has led to an ever-widening annotation gap in protein databases. While computational prediction methods are available to make up the shortfall, a majority of public web servers are hindered by practical limitations and poor performance. Here, we introduce PANNZER2 (Protein ANNotation with Z-scoRE), a fast functional annotation web server that provides both Gene Ontology (GO) annotations and free text description predictions. PANNZER2 uses SANSparallel to perform high-performance homology searches, making bulk annotation based on sequence similarity practical. PANNZER2 can output GO annotations from multiple scoring functions, enabling users to see which predictions are robust across predictors. Finally, PANNZER2 predictions scored within the top 10 methods for molecular function and biological process in the CAFA2 NK-full benchmark. The PANNZER2 web server is updated on a monthly schedule and is accessible at http://ekhidna2.biocenter.helsinki.fi/sanspanz/. The source code is available under the GNU Public Licence v3.
高通量测序的空前增长导致蛋白质数据库中的注释缺口不断扩大。虽然有计算预测方法可以弥补这一不足,但大多数公共网络服务器都受到实际限制和性能不佳的阻碍。在这里,我们介绍 PANNZER2(使用 Z 分数进行蛋白质注释),这是一个快速的功能注释网络服务器,提供基因本体论 (GO) 注释和自由文本描述预测。PANNZER2 使用 SANSparallel 执行高性能同源搜索,使得基于序列相似性的批量注释成为可能。PANNZER2 可以从多个评分函数输出 GO 注释,使用户能够看到哪些预测在预测器之间是稳健的。最后,在 CAFA2 NK-full 基准测试中,PANNZER2 的预测在分子功能和生物学过程方面的前 10 种方法中得分最高。PANNZER2 网络服务器每月更新一次,可在 http://ekhidna2.biocenter.helsinki.fi/sanspanz/ 上访问。源代码根据 GNU 公共许可证 v3 提供。