从长的未校正读段中进行快速且准确的从头基因组组装。

Fast and accurate de novo genome assembly from long uncorrected reads.

作者信息

Vaser Robert, Sović Ivan, Nagarajan Niranjan, Šikić Mile

机构信息

Department of Electronic Systems and Information Processing, University of Zagreb, Faculty of Electrical Engineering and Computing, 10000 Zagreb, Croatia.

Centre for Informatics and Computing, Ruđer Bošković Institute, 10000 Zagreb, Croatia.

出版信息

Genome Res. 2017 May;27(5):737-746. doi: 10.1101/gr.214270.116. Epub 2017 Jan 18.

DOI:10.1101/gr.214270.116

PMID:28100585

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5411768/

Abstract

The assembly of long reads from Pacific Biosciences and Oxford Nanopore Technologies typically requires resource-intensive error-correction and consensus-generation steps to obtain high-quality assemblies. We show that the error-correction step can be omitted and that high-quality consensus sequences can be generated efficiently with a SIMD-accelerated, partial-order alignment-based, stand-alone consensus module called Racon. Based on tests with PacBio and Oxford Nanopore data sets, we show that Racon coupled with miniasm enables consensus genomes with similar or better quality than state-of-the-art methods while being an order of magnitude faster.

摘要

来自太平洋生物科学公司（Pacific Biosciences）和牛津纳米孔技术公司（Oxford Nanopore Technologies）的长读长序列组装通常需要资源密集型的纠错和生成一致序列步骤，以获得高质量的组装结果。我们表明，可以省略纠错步骤，并且使用一个名为Racon的基于单指令多数据（SIMD）加速、基于偏序比对的独立一致序列模块，能够高效地生成高质量的一致序列。基于对PacBio和牛津纳米孔数据集的测试，我们表明，Racon与miniasm相结合，能够生成质量与现有最先进方法相似或更好的一致基因组，同时速度快一个数量级。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4692/5411768/a3dcc56780ef/737f01.jpg

相似文献

Fast and accurate de novo genome assembly from long uncorrected reads.

Genome Res. 2017 May;27(5):737-746. doi: 10.1101/gr.214270.116. Epub 2017 Jan 18.

Canu: scalable and accurate long-read assembly via adaptive -mer weighting and repeat separation.

Genome Res. 2017 May;27(5):722-736. doi: 10.1101/gr.215087.116. Epub 2017 Mar 15.

Benchmarking Long-Read Assemblers for Genomic Analyses of Bacterial Pathogens Using Oxford Nanopore Sequencing.

Int J Mol Sci. 2020 Dec 1;21(23):9161. doi: 10.3390/ijms21239161.

Improved assembly of noisy long reads by k-mer validation.

Genome Res. 2016 Dec;26(12):1710-1720. doi: 10.1101/gr.209247.116. Epub 2016 Oct 7.

Hybrid assembly of the large and highly repetitive genome of , a progenitor of bread wheat, with the MaSuRCA mega-reads algorithm.

Genome Res. 2017 May;27(5):787-792. doi: 10.1101/gr.213405.116. Epub 2017 Jan 27.

Highly accurate long reads are crucial for realizing the potential of biodiversity genomics.

BMC Genomics. 2023 Mar 16;24(1):117. doi: 10.1186/s12864-023-09193-9.

Benchmarking of de novo assembly algorithms for Nanopore data reveals optimal performance of OLC approaches.

BMC Genomics. 2016 Aug 22;17 Suppl 7(Suppl 7):507. doi: 10.1186/s12864-016-2895-8.

Direct comparison of performance of single nucleotide variant calling in human genome with alignment-based and assembly-based approaches.

Sci Rep. 2017 Sep 8;7(1):10963. doi: 10.1038/s41598-017-10826-9.

HINGE: long-read assembly achieves optimal repeat resolution.

Genome Res. 2017 May;27(5):747-756. doi: 10.1101/gr.216465.116. Epub 2017 Mar 20.

Benchmarking hybrid assembly approaches for genomic analyses of bacterial pathogens using Illumina and Oxford Nanopore sequencing.

BMC Genomics. 2020 Sep 14;21(1):631. doi: 10.1186/s12864-020-07041-8.

引用本文的文献

One mother for two species via obligate cross-species cloning in ants.

Nature. 2025 Sep 3. doi: 10.1038/s41586-025-09425-w.

Chromosome-level assembly of cv. 'Tokiwa' as a reference genome of Japanese cucumber.

Breed Sci. 2025 Apr;75(2):85-92. doi: 10.1270/jsbbs.24066. Epub 2025 Mar 27.

Genomic and Phenotypic Evaluation of the Gliadin-Degrading Probiotic Bacillus amyloliquefaciens EG025 from Cheonggukjang for Celiac Disease Treatment.

Probiotics Antimicrob Proteins. 2025 Sep 2. doi: 10.1007/s12602-025-10728-7.

A telomere-to-telomere genome assembly of koi carp (Cyprinus carpio) using long reads and Hi-C technology.

Gigascience. 2025 Jan 6;14. doi: 10.1093/gigascience/giaf087.

Accurately assembling nanopore sequencing data of highly pathogenic bacteria.

BMC Genomics. 2025 Aug 28;26(1):783. doi: 10.1186/s12864-025-11793-6.

Highly contiguous genome of the medicinal plant Sarcandra glabra (Thunb.) Nakai.

Sci Data. 2025 Aug 28;12(1):1508. doi: 10.1038/s41597-025-05796-x.

Re-annotation improved large-scale assembly of the reef-building coral Acropora intermedia.

Sci Data. 2025 Aug 28;12(1):1504. doi: 10.1038/s41597-025-05849-1.

Genomic Insights into Emerging Multidrug-Resistant Strains: First Report from Thailand.

Antibiotics (Basel). 2025 Jul 24;14(8):746. doi: 10.3390/antibiotics14080746.

Complex Sex Determination in the Grey Mullet Suggested by Individual Whole Genome Sequence Data.

Animals (Basel). 2025 Aug 20;15(16):2445. doi: 10.3390/ani15162445.

The bacterial community of the freshwater bryozoan Cristatella Mucedo and its secondary metabolites production potential.

Sci Rep. 2025 Aug 26;15(1):31456. doi: 10.1038/s41598-025-17084-0.

本文引用的文献

Edlib: a C/C ++ library for fast, exact sequence alignment using edit distance.

Bioinformatics. 2017 May 1;33(9):1394-1395. doi: 10.1093/bioinformatics/btw753.

de novo assembly and population genomic survey of natural yeast isolates with the Oxford Nanopore MinION sequencer.

Gigascience. 2017 Feb 1;6(2):1-13. doi: 10.1093/gigascience/giw018.

Canu: scalable and accurate long-read assembly via adaptive -mer weighting and repeat separation.

Genome Res. 2017 May;27(5):722-736. doi: 10.1101/gr.215087.116. Epub 2017 Mar 15.

Phased diploid genome assembly with single-molecule real-time sequencing.

Nat Methods. 2016 Dec;13(12):1050-1054. doi: 10.1038/nmeth.4035. Epub 2016 Oct 17.

Real-time selective sequencing using nanopore technology.

Nat Methods. 2016 Sep;13(9):751-4. doi: 10.1038/nmeth.3930. Epub 2016 Jul 25.

Sparc: a sparsity-based consensus algorithm for long erroneous sequencing reads.

PeerJ. 2016 Jun 8;4:e2016. doi: 10.7717/peerj.2016. eCollection 2016.

Evaluation of hybrid and non-hybrid methods for de novo assembly of nanopore reads.

Bioinformatics. 2016 Sep 1;32(17):2582-9. doi: 10.1093/bioinformatics/btw237. Epub 2016 May 9.

Minimap and miniasm: fast mapping and de novo assembly for noisy long sequences.

Bioinformatics. 2016 Jul 15;32(14):2103-10. doi: 10.1093/bioinformatics/btw152. Epub 2016 Mar 19.

Fast and sensitive mapping of nanopore sequencing reads with GraphMap.

Nat Commun. 2016 Apr 15;7:11307. doi: 10.1038/ncomms11307.

A complete bacterial genome assembled de novo using only nanopore sequencing data.

Nat Methods. 2015 Aug;12(8):733-5. doi: 10.1038/nmeth.3444. Epub 2015 Jun 15.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

从长的未校正读段中进行快速且准确的从头基因组组装。

Fast and accurate de novo genome assembly from long uncorrected reads.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献