Center for Information Biology, National Institute of Genetics, Research Organization of Information and Systems, 1111 Yata, Mishima 411-8540, Japan.
Bioinformatics. 2018 Mar 15;34(6):1037-1039. doi: 10.1093/bioinformatics/btx713.
We developed a prokaryotic genome annotation pipeline, DFAST, that also supports genome submission to public sequence databases. DFAST was originally started as an on-line annotation server, and to date, over 7000 jobs have been processed since its first launch in 2016. Here, we present a newly implemented background annotation engine for DFAST, which is also available as a standalone command-line program. The new engine can annotate a typical-sized bacterial genome within 10 min, with rich information such as pseudogenes, translation exceptions and orthologous gene assignment between given reference genomes. In addition, the modular framework of DFAST allows users to customize the annotation workflow easily and will also facilitate extensions for new functions and incorporation of new tools in the future.
The software is implemented in Python 3 and runs in both Python 2.7 and 3.4-on Macintosh and Linux systems. It is freely available at https://github.com/nigyta/dfast_core/under the GPLv3 license with external binaries bundled in the software distribution. An on-line version is also available at https://dfast.nig.ac.jp/.
Supplementary data are available at Bioinformatics online.
我们开发了一个原核基因组注释流水线 DFAST,它还支持将基因组提交到公共序列数据库。DFAST 最初是作为一个在线注释服务器启动的,自 2016 年首次推出以来,已经处理了超过 7000 个作业。在这里,我们展示了一个新实现的 DFAST 背景注释引擎,它也可以作为一个独立的命令行程序使用。新引擎可以在 10 分钟内注释一个典型大小的细菌基因组,提供丰富的信息,如假基因、翻译异常和给定参考基因组之间的同源基因分配。此外,DFAST 的模块化框架允许用户轻松定制注释工作流程,并将有助于未来为新功能扩展和整合新工具。
该软件是用 Python 3 实现的,在 Macintosh 和 Linux 系统中都可以在 Python 2.7 和 3.4 上运行。它在 GPLv3 许可证下可在 https://github.com/nigyta/dfast_core/ 上免费获得,外部二进制文件捆绑在软件分发中。在线版本也可在 https://dfast.nig.ac.jp/ 获得。
补充数据可在生物信息学在线获得。