Department of Biology, Boston College, Chestnut Hill, MA 02467, USA.
Bioinformatics. 2011 Jun 15;27(12):1691-2. doi: 10.1093/bioinformatics/btr174. Epub 2011 Apr 14.
Analysis of genomic sequencing data requires efficient, easy-to-use access to alignment results and flexible data management tools (e.g. filtering, merging, sorting, etc.). However, the enormous amount of data produced by current sequencing technologies is typically stored in compressed, binary formats that are not easily handled by the text-based parsers commonly used in bioinformatics research.
We introduce a software suite for programmers and end users that facilitates research analysis and data management using BAM files. BamTools provides both the first C++ API publicly available for BAM file support as well as a command-line toolkit.
BamTools was written in C++, and is supported on Linux, Mac OSX and MS Windows. Source code and documentation are freely available at http://github.org/pezmaster31/bamtools.
分析基因组测序数据需要高效、易用的对齐结果访问和灵活的数据管理工具(例如过滤、合并、排序等)。然而,当前测序技术产生的大量数据通常以压缩的二进制格式存储,这使得在生物信息学研究中常用的基于文本的解析器难以处理。
我们为程序员和最终用户引入了一个软件套件,该套件使用 BAM 文件方便研究分析和数据管理。BamTools 提供了第一个公开的用于支持 BAM 文件的 C++ API,以及一个命令行工具包。
BamTools 是用 C++编写的,支持 Linux、Mac OSX 和 MS Windows。源代码和文档可在 http://github.org/pezmaster31/bamtools 上免费获取。