Suppr超能文献

BpWrapper:基于 BioPerl 的序列和树实用程序,用于快速原型化生物信息学管道。

BpWrapper: BioPerl-based sequence and tree utilities for rapid prototyping of bioinformatics pipelines.

机构信息

Department of Biological Sciences, Hunter College, City University of New York, New York, 10065, USA.

Graduate Program in Bioinformatics, Boston University, Boston, MA, 02215, USA.

出版信息

BMC Bioinformatics. 2018 Mar 2;19(1):76. doi: 10.1186/s12859-018-2074-9.

Abstract

BACKGROUND

Automated bioinformatics workflows are more robust, easier to maintain, and results more reproducible when built with command-line utilities than with custom-coded scripts. Command-line utilities further benefit by relieving bioinformatics developers to learn the use of, or to interact directly with, biological software libraries. There is however a lack of command-line utilities that leverage popular Open Source biological software toolkits such as BioPerl ( http://bioperl.org ) to make many of the well-designed, robust, and routinely used biological classes available for a wider base of end users.

RESULTS

Designed as standard utilities for UNIX-family operating systems, BpWrapper makes functionality of some of the most popular BioPerl modules readily accessible on the command line to novice as well as to experienced bioinformatics practitioners. The initial release of BpWrapper includes four utilities with concise command-line user interfaces, bioseq, bioaln, biotree, and biopop, specialized for manipulation of molecular sequences, sequence alignments, phylogenetic trees, and DNA polymorphisms, respectively. Over a hundred methods are currently available as command-line options and new methods are easily incorporated. Performance of BpWrapper utilities lags that of precompiled utilities while equivalent to that of other utilities based on BioPerl. BpWrapper has been tested on BioPerl Release 1.6, Perl versions 5.10.1 to 5.25.10, and operating systems including Apple macOS, Microsoft Windows, and GNU/Linux. Release code is available from the Comprehensive Perl Archive Network (CPAN) at https://metacpan.org/pod/Bio::BPWrapper . Source code is available on GitHub at https://github.com/bioperl/p5-bpwrapper .

CONCLUSIONS

BpWrapper improves on existing sequence utilities by following the design principles of Unix text utilities such including a concise user interface, extensive command-line options, and standard input/output for serialized operations. Further, dozens of novel methods for manipulation of sequences, alignments, and phylogenetic trees, unavailable in existing utilities (e.g., EMBOSS, Newick Utilities, and FAST), are provided. Bioinformaticians should find BpWrapper useful for rapid prototyping of workflows on the command-line without creating custom scripts for comparative genomics and other bioinformatics applications.

摘要

背景

与自定义编码脚本相比,使用命令行实用程序构建自动化生物信息学工作流更稳健、更易于维护且结果更具可重复性。命令行实用程序还通过使生物信息学开发人员无需学习使用或直接与生物软件库交互,从而受益。但是,缺乏利用流行的开源生物软件工具包(例如 BioPerl(http://bioperl.org))的命令行实用程序,无法使许多设计良好、稳健且经常使用的生物类可用于更广泛的终端用户基础。

结果

作为 UNIX 系列操作系统的标准实用程序设计,BpWrapper 使一些最受欢迎的 BioPerl 模块的功能在命令行上对新手和有经验的生物信息学从业者都易于访问。BpWrapper 的初始版本包括四个实用程序,它们具有简洁的命令行用户界面,分别专门用于处理分子序列、序列比对、系统发育树和 DNA 多态性,bioseq、bioaln、biotree 和 biopop。目前有一百多种方法可作为命令行选项使用,并且可以轻松合并新方法。BpWrapper 实用程序的性能落后于预编译实用程序,但其性能与基于 BioPerl 的其他实用程序相当。BpWrapper 已在 BioPerl 版本 1.6、Perl 版本 5.10.1 到 5.25.10 以及包括 Apple macOS、Microsoft Windows 和 GNU/Linux 在内的操作系统上进行了测试。发布代码可从 Comprehensive Perl Archive Network(CPAN)(https://metacpan.org/pod/Bio::BPWrapper)获得。源代码可在 GitHub(https://github.com/bioperl/p5-bpwrapper)上获得。

结论

BpWrapper 通过遵循 Unix 文本实用程序的设计原则(包括简洁的用户界面、广泛的命令行选项以及序列化操作的标准输入/输出),改进了现有的序列实用程序。此外,还提供了数十种用于处理序列、比对和系统发育树的新方法,这些方法在现有的实用程序中不可用(例如,EMBOSS、Newick Utilities 和 FAST)。生物信息学家应该发现 BpWrapper 在命令行上快速原型设计工作流时非常有用,而无需为比较基因组学和其他生物信息学应用程序创建自定义脚本。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b30a/5833151/1b01121574aa/12859_2018_2074_Fig1_HTML.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验