Suppr超能文献

SPAdes:一种新的基因组组装算法及其在单细胞测序中的应用

SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing.

作者信息

Bankevich Anton, Nurk Sergey, Antipov Dmitry, Gurevich Alexey A, Dvorkin Mikhail, Kulikov Alexander S, Lesin Valery M, Nikolenko Sergey I, Pham Son, Prjibelski Andrey D, Pyshkin Alexey V, Sirotkin Alexander V, Vyahhi Nikolay, Tesler Glenn, Alekseyev Max A, Pevzner Pavel A

机构信息

Algorithmic Biology Laboratory, St. Petersburg Academic University, Russian Academy of Sciences, St. Petersburg, Russia.

出版信息

J Comput Biol. 2012 May;19(5):455-77. doi: 10.1089/cmb.2012.0021. Epub 2012 Apr 16.

Abstract

The lion's share of bacteria in various environments cannot be cloned in the laboratory and thus cannot be sequenced using existing technologies. A major goal of single-cell genomics is to complement gene-centric metagenomic data with whole-genome assemblies of uncultivated organisms. Assembly of single-cell data is challenging because of highly non-uniform read coverage as well as elevated levels of sequencing errors and chimeric reads. We describe SPAdes, a new assembler for both single-cell and standard (multicell) assembly, and demonstrate that it improves on the recently released E+V-SC assembler (specialized for single-cell data) and on popular assemblers Velvet and SoapDeNovo (for multicell data). SPAdes generates single-cell assemblies, providing information about genomes of uncultivatable bacteria that vastly exceeds what may be obtained via traditional metagenomics studies. SPAdes is available online ( http://bioinf.spbau.ru/spades ). It is distributed as open source software.

摘要

在各种环境中,大部分细菌无法在实验室中克隆,因此无法使用现有技术进行测序。单细胞基因组学的一个主要目标是通过未培养生物的全基因组组装来补充以基因为中心的宏基因组数据。由于读取覆盖度高度不均匀以及测序错误和嵌合读取水平升高,单细胞数据的组装具有挑战性。我们描述了SPAdes,一种用于单细胞和标准(多细胞)组装的新型组装器,并证明它在最近发布的E+V-SC组装器(专门用于单细胞数据)以及流行的组装器Velvet和SoapDeNovo(用于多细胞数据)的基础上有所改进。SPAdes生成单细胞组装体,提供有关不可培养细菌基因组的信息,远远超过通过传统宏基因组学研究所获得的信息。SPAdes可在线获取(http://bioinf.spbau.ru/spades)。它作为开源软件分发。

相似文献

1
SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing.
J Comput Biol. 2012 May;19(5):455-77. doi: 10.1089/cmb.2012.0021. Epub 2012 Apr 16.
2
Assembling single-cell genomes and mini-metagenomes from chimeric MDA products.
J Comput Biol. 2013 Oct;20(10):714-37. doi: 10.1089/cmb.2013.0084.
3
IDBA-UD: a de novo assembler for single-cell and metagenomic sequencing data with highly uneven depth.
Bioinformatics. 2012 Jun 1;28(11):1420-8. doi: 10.1093/bioinformatics/bts174. Epub 2012 Apr 11.
4
Assembling short reads from jumping libraries with large insert sizes.
Bioinformatics. 2015 Oct 15;31(20):3262-8. doi: 10.1093/bioinformatics/btv337. Epub 2015 Jun 3.
5
Efficient de novo assembly of single-cell bacterial genomes from short-read data sets.
Nat Biotechnol. 2011 Sep 18;29(10):915-21. doi: 10.1038/nbt.1966.
6
hybridSPAdes: an algorithm for hybrid assembly of short and long reads.
Bioinformatics. 2016 Apr 1;32(7):1009-15. doi: 10.1093/bioinformatics/btv688. Epub 2015 Nov 20.
7
EPGA-SC : A Framework for de novo Assembly of Single-Cell Sequencing Reads.
IEEE/ACM Trans Comput Biol Bioinform. 2021 Jul-Aug;18(4):1492-1503. doi: 10.1109/TCBB.2019.2945761. Epub 2021 Aug 6.
8
Fragmentation and Coverage Variation in Viral Metagenome Assemblies, and Their Effect in Diversity Calculations.
Front Bioeng Biotechnol. 2015 Sep 17;3:141. doi: 10.3389/fbioe.2015.00141. eCollection 2015.
9
ExSPAnder: a universal repeat resolver for DNA fragment assembly.
Bioinformatics. 2014 Jun 15;30(12):i293-301. doi: 10.1093/bioinformatics/btu266.
10
TruSPAdes: barcode assembly of TruSeq synthetic long reads.
Nat Methods. 2016 Mar;13(3):248-50. doi: 10.1038/nmeth.3737. Epub 2016 Feb 1.

引用本文的文献

2
Gabija restricts phages that antagonize a conserved host DNA repair complex.
bioRxiv. 2025 Aug 30:2025.08.30.673261. doi: 10.1101/2025.08.30.673261.
3
Complete mitochondrial genomes of biological control stains in the Rifai complex (strains DL1-3, KC1-1, and PAR10) isolated from Californian grapevines.
Mitochondrial DNA B Resour. 2025 Sep 2;10(10):893-898. doi: 10.1080/23802359.2025.2552822. eCollection 2025.
4
Next-generation sequencing applications in food science: fundamentals and recent advances.
Front Bioeng Biotechnol. 2025 Aug 20;13:1638957. doi: 10.3389/fbioe.2025.1638957. eCollection 2025.
7
The global genomic landscape of hypervirulent from 1932 to 2021.
mLife. 2025 Aug 24;4(4):378-396. doi: 10.1002/mlf2.70029. eCollection 2025 Aug.
8
Evaluation of shotgun metagenomics as a diagnostic tool for infectious gastroenteritis.
PLoS One. 2025 Sep 2;20(9):e0331288. doi: 10.1371/journal.pone.0331288. eCollection 2025.
9
10
Application of a comprehensive approach to pathogen screening in a stowaway rat on an airplane.
Sci Rep. 2025 Aug 30;15(1):31963. doi: 10.1038/s41598-025-13199-6.

本文引用的文献

1
Single-cell dissection of transcriptional heterogeneity in human colon tumors.
Nat Biotechnol. 2011 Nov 13;29(12):1120-7. doi: 10.1038/nbt.2038.
2
Paired de bruijn graphs: a novel approach for incorporating mate pair information into genome assemblers.
J Comput Biol. 2011 Nov;18(11):1625-34. doi: 10.1089/cmb.2011.0151. Epub 2011 Oct 14.
3
Efficient de novo assembly of single-cell bacterial genomes from short-read data sets.
Nat Biotechnol. 2011 Sep 18;29(10):915-21. doi: 10.1038/nbt.1966.
4
Partial genome assembly for a candidate division OP11 single cell from an anoxic spring (Zodletone Spring, Oklahoma).
Appl Environ Microbiol. 2011 Nov;77(21):7804-14. doi: 10.1128/AEM.06059-11. Epub 2011 Sep 9.
5
Error correction of high-throughput sequencing datasets with non-uniform coverage.
Bioinformatics. 2011 Jul 1;27(13):i137-41. doi: 10.1093/bioinformatics/btr208.
6
Characterization of the single-cell transcriptional landscape by highly multiplex RNA-seq.
Genome Res. 2011 Jul;21(7):1160-7. doi: 10.1101/gr.110882.110. Epub 2011 May 4.
8
Tumour evolution inferred by single-cell sequencing.
Nature. 2011 Apr 7;472(7341):90-4. doi: 10.1038/nature09807. Epub 2011 Mar 13.
9
Genome of a low-salinity ammonia-oxidizing archaeon determined by single-cell and metagenomic analysis.
PLoS One. 2011 Feb 22;6(2):e16626. doi: 10.1371/journal.pone.0016626.
10
High-quality draft assemblies of mammalian genomes from massively parallel sequence data.
Proc Natl Acad Sci U S A. 2011 Jan 25;108(4):1513-8. doi: 10.1073/pnas.1017351108. Epub 2010 Dec 27.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验