Suppr超能文献

下一代测序数据的组装算法。

Assembly algorithms for next-generation sequencing data.

机构信息

J. Craig Venter Institute, Rockville, MD 20850-3343, USA.

出版信息

Genomics. 2010 Jun;95(6):315-27. doi: 10.1016/j.ygeno.2010.03.001. Epub 2010 Mar 6.

Abstract

The emergence of next-generation sequencing platforms led to resurgence of research in whole-genome shotgun assembly algorithms and software. DNA sequencing data from the Roche 454, Illumina/Solexa, and ABI SOLiD platforms typically present shorter read lengths, higher coverage, and different error profiles compared with Sanger sequencing data. Since 2005, several assembly software packages have been created or revised specifically for de novo assembly of next-generation sequencing data. This review summarizes and compares the published descriptions of packages named SSAKE, SHARCGS, VCAKE, Newbler, Celera Assembler, Euler, Velvet, ABySS, AllPaths, and SOAPdenovo. More generally, it compares the two standard methods known as the de Bruijn graph approach and the overlap/layout/consensus approach to assembly.

摘要

下一代测序平台的出现促使全基因组鸟枪法组装算法和软件的研究再次兴起。与 Sanger 测序数据相比,罗氏 454、Illumina/Solexa 和 ABI SOLiD 平台的 DNA 测序数据通常具有较短的读长、更高的覆盖率和不同的错误分布。自 2005 年以来,已经创建或专门修订了几个组装软件包,用于从头组装下一代测序数据。本综述总结并比较了名为 SSAKE、SHARCGS、VCAKE、Newbler、Celera Assembler、Euler、Velvet、ABySS、AllPaths 和 SOAPdenovo 的软件包的已发表描述。更一般地说,它比较了两种称为 de Bruijn 图方法和重叠/布局/共识方法的两种标准组装方法。

相似文献

1
Assembly algorithms for next-generation sequencing data.
Genomics. 2010 Jun;95(6):315-27. doi: 10.1016/j.ygeno.2010.03.001. Epub 2010 Mar 6.
2
Benchmarking of de novo assembly algorithms for Nanopore data reveals optimal performance of OLC approaches.
BMC Genomics. 2016 Aug 22;17 Suppl 7(Suppl 7):507. doi: 10.1186/s12864-016-2895-8.
3
Clover: a clustering-oriented de novo assembler for Illumina sequences.
BMC Bioinformatics. 2020 Nov 17;21(1):528. doi: 10.1186/s12859-020-03788-9.
4
Comparative studies of de novo assembly tools for next-generation sequencing technologies.
Bioinformatics. 2011 Aug 1;27(15):2031-7. doi: 10.1093/bioinformatics/btr319. Epub 2011 Jun 2.
5
The MaSuRCA genome assembler.
Bioinformatics. 2013 Nov 1;29(21):2669-77. doi: 10.1093/bioinformatics/btt476. Epub 2013 Aug 29.
6
NeatFreq: reference-free data reduction and coverage normalization for De Novo sequence assembly.
BMC Bioinformatics. 2014 Nov 19;15(1):357. doi: 10.1186/s12859-014-0357-3.
7
Evaluating de Bruijn graph assemblers on 454 transcriptomic data.
PLoS One. 2012;7(12):e51188. doi: 10.1371/journal.pone.0051188. Epub 2012 Dec 7.
10
Comparison of the two major classes of assembly algorithms: overlap-layout-consensus and de-bruijn-graph.
Brief Funct Genomics. 2012 Jan;11(1):25-37. doi: 10.1093/bfgp/elr035. Epub 2011 Dec 19.

引用本文的文献

2
Genome Analysis and Characterization of Enterococcus sp. SB12 Isolated from Carpathian Artisanal Cheese.
Curr Microbiol. 2025 Jun 28;82(8):349. doi: 10.1007/s00284-025-04337-4.
4
Genome Sequence of the Cluster EF Bacteriophage TinyMiny, Isolated using .
MicroPubl Biol. 2025 May 29;2025. doi: 10.17912/micropub.biology.001488. eCollection 2025.
5
A PCR primer design method for identifying spider mite species using k-mer counting.
PLoS One. 2025 Jun 9;20(6):e0321199. doi: 10.1371/journal.pone.0321199. eCollection 2025.
6
Complete genome sequences of phages HazuAndZazu and Tubberson.
Microbiol Resour Announc. 2025 Jul 10;14(7):e0036125. doi: 10.1128/mra.00361-25. Epub 2025 Jun 9.
9
Advanced Strategies in Enhancing the Hepatoprotective Efficacy of Natural Products: Integrating Nanotechnology, Genomics, and Mechanistic Insights.
ACS Biomater Sci Eng. 2025 May 12;11(5):2528-2549. doi: 10.1021/acsbiomaterials.5c00004. Epub 2025 Apr 11.
10
Complete genome sequences of phages CoffeeBean, Dorin, Francesca, Madraxi, and Tonitrus.
MicroPubl Biol. 2025 Mar 16;2025. doi: 10.17912/micropub.biology.001491. eCollection 2025.

本文引用的文献

2
De novo assembly of human genomes with massively parallel short read sequencing.
Genome Res. 2010 Feb;20(2):265-72. doi: 10.1101/gr.097261.109. Epub 2009 Dec 17.
3
The sequence and de novo assembly of the giant panda genome.
Nature. 2010 Jan 21;463(7279):311-7. doi: 10.1038/nature08696. Epub 2009 Dec 13.
4
Building the sequence map of the human pan-genome.
Nat Biotechnol. 2010 Jan;28(1):57-63. doi: 10.1038/nbt.1596. Epub 2009 Dec 7.
5
BOAT: Basic Oligonucleotide Alignment Tool.
BMC Genomics. 2009 Dec 3;10 Suppl 3(Suppl 3):S2. doi: 10.1186/1471-2164-10-S3-S2.
6
ALLPATHS 2: small genomes assembled accurately and with high continuity from short paired reads.
Genome Biol. 2009;10(10):R103. doi: 10.1186/gb-2009-10-10-r103. Epub 2009 Oct 1.
7
Simultaneous alignment of short reads against multiple genomes.
Genome Biol. 2009;10(9):R98. doi: 10.1186/gb-2009-10-9-r98. Epub 2009 Sep 17.
8
Fast mapping of short sequences with mismatches, insertions and deletions using index structures.
PLoS Comput Biol. 2009 Sep;5(9):e1000502. doi: 10.1371/journal.pcbi.1000502. Epub 2009 Sep 11.
9
De novo genome sequence assembly of a filamentous fungus using Sanger, 454 and Illumina sequence data.
Genome Biol. 2009;10(9):R94. doi: 10.1186/gb-2009-10-9-r94. Epub 2009 Sep 11.
10
PerM: efficient mapping of short sequencing reads with periodic full sensitive spaced seeds.
Bioinformatics. 2009 Oct 1;25(19):2514-21. doi: 10.1093/bioinformatics/btp486. Epub 2009 Aug 12.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验