SPAdes:一种新的基因组组装算法及其在单细胞测序中的应用

SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing.

作者信息

Bankevich Anton, Nurk Sergey, Antipov Dmitry, Gurevich Alexey A, Dvorkin Mikhail, Kulikov Alexander S, Lesin Valery M, Nikolenko Sergey I, Pham Son, Prjibelski Andrey D, Pyshkin Alexey V, Sirotkin Alexander V, Vyahhi Nikolay, Tesler Glenn, Alekseyev Max A, Pevzner Pavel A

机构信息

Algorithmic Biology Laboratory, St. Petersburg Academic University, Russian Academy of Sciences, St. Petersburg, Russia.

出版信息

J Comput Biol. 2012 May;19(5):455-77. doi: 10.1089/cmb.2012.0021. Epub 2012 Apr 16.

Abstract

The lion's share of bacteria in various environments cannot be cloned in the laboratory and thus cannot be sequenced using existing technologies. A major goal of single-cell genomics is to complement gene-centric metagenomic data with whole-genome assemblies of uncultivated organisms. Assembly of single-cell data is challenging because of highly non-uniform read coverage as well as elevated levels of sequencing errors and chimeric reads. We describe SPAdes, a new assembler for both single-cell and standard (multicell) assembly, and demonstrate that it improves on the recently released E+V-SC assembler (specialized for single-cell data) and on popular assemblers Velvet and SoapDeNovo (for multicell data). SPAdes generates single-cell assemblies, providing information about genomes of uncultivatable bacteria that vastly exceeds what may be obtained via traditional metagenomics studies. SPAdes is available online ( http://bioinf.spbau.ru/spades ). It is distributed as open source software.

摘要

在各种环境中,大部分细菌无法在实验室中克隆,因此无法使用现有技术进行测序。单细胞基因组学的一个主要目标是通过未培养生物的全基因组组装来补充以基因为中心的宏基因组数据。由于读取覆盖度高度不均匀以及测序错误和嵌合读取水平升高,单细胞数据的组装具有挑战性。我们描述了SPAdes,一种用于单细胞和标准(多细胞)组装的新型组装器,并证明它在最近发布的E+V-SC组装器(专门用于单细胞数据)以及流行的组装器Velvet和SoapDeNovo(用于多细胞数据)的基础上有所改进。SPAdes生成单细胞组装体,提供有关不可培养细菌基因组的信息,远远超过通过传统宏基因组学研究所获得的信息。SPAdes可在线获取(http://bioinf.spbau.ru/spades)。它作为开源软件分发。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索