Wala Jeremiah, Beroukhim Rameen
The Broad Institute of Harvard and MIT, Cambridge, MA 02142, USA.
Bioinformatics and Integrative Genomics, Harvard University, Cambridge, MA 02138, USA.
Bioinformatics. 2017 Mar 1;33(5):751-753. doi: 10.1093/bioinformatics/btw741.
We present SeqLib, a C ++ API and command line tool that provides a rapid and user-friendly interface to BAM/SAM/CRAM files, global sequence alignment operations and sequence assembly. Four C libraries perform core operations in SeqLib: HTSlib for BAM access, BWA-MEM and BLAT for sequence alignment and Fermi for error correction and sequence assembly. Benchmarking indicates that SeqLib has lower CPU and memory requirements than leading C ++ sequence analysis APIs. We demonstrate an example of how minimal SeqLib code can extract, error-correct and assemble reads from a CRAM file and then align with BWA-MEM. SeqLib also provides additional capabilities, including chromosome-aware interval queries and read plotting. Command line tools are available for performing integrated error correction, micro-assemblies and alignment.
SeqLib is available on Linux and OSX for the C ++98 standard and later at github.com/walaj/SeqLib. SeqLib is released under the Apache2 license. Additional capabilities for BLAT alignment are available under the BLAT license.
我们展示了SeqLib,它是一个C++ API和命令行工具,为BAM/SAM/CRAM文件、全局序列比对操作和序列组装提供了一个快速且用户友好的界面。四个C库在SeqLib中执行核心操作:用于访问BAM的HTSlib、用于序列比对的BWA-MEM和BLAT,以及用于纠错和序列组装的Fermi。基准测试表明,SeqLib比领先的C++序列分析API具有更低的CPU和内存需求。我们展示了一个示例,说明最少的SeqLib代码如何从CRAM文件中提取、纠错和组装读段,然后与BWA-MEM进行比对。SeqLib还提供了其他功能,包括染色体感知区间查询和读段绘图。有命令行工具可用于执行集成纠错、微组装和比对。
SeqLib可在Linux和OSX上用于C++98标准及更高版本,位于github.com/walaj/SeqLib。SeqLib根据Apache2许可发布。BLAT比对的其他功能根据BLAT许可提供。