• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

新一代测序技术从头组装工具的比较研究。

Comparative studies of de novo assembly tools for next-generation sequencing technologies.

机构信息

Center of System Biomedical Sciences, University of Shanghai for Science and Technology, Shanghai 200093, P R China.

出版信息

Bioinformatics. 2011 Aug 1;27(15):2031-7. doi: 10.1093/bioinformatics/btr319. Epub 2011 Jun 2.

DOI:10.1093/bioinformatics/btr319
PMID:21636596
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3137213/
Abstract

MOTIVATION

Several new de novo assembly tools have been developed recently to assemble short sequencing reads generated by next-generation sequencing platforms. However, the performance of these tools under various conditions has not been fully investigated, and sufficient information is not currently available for informed decisions to be made regarding the tool that would be most likely to produce the best performance under a specific set of conditions.

RESULTS

We studied and compared the performance of commonly used de novo assembly tools specifically designed for next-generation sequencing data, including SSAKE, VCAKE, Euler-sr, Edena, Velvet, ABySS and SOAPdenovo. Tools were compared using several performance criteria, including N50 length, sequence coverage and assembly accuracy. Various properties of read data, including single-end/paired-end, sequence GC content, depth of coverage and base calling error rates, were investigated for their effects on the performance of different assembly tools. We also compared the computation time and memory usage of these seven tools. Based on the results of our comparison, the relative performance of individual tools are summarized and tentative guidelines for optimal selection of different assembly tools, under different conditions, are provided.

摘要

动机

最近开发了几种新的从头组装工具,用于组装下一代测序平台生成的短测序reads。然而,这些工具在各种条件下的性能尚未得到充分研究,目前没有足够的信息来做出明智的决策,即选择最有可能在特定条件下产生最佳性能的工具。

结果

我们专门研究和比较了几种常用的从头组装工具,这些工具专门为下一代测序数据设计,包括 SSAKE、VCAKE、Euler-sr、Edena、Velvet、ABySS 和 SOAPdenovo。使用多个性能标准对工具进行了比较,包括 N50 长度、序列覆盖率和组装准确性。我们还研究了读段数据的各种特性,包括单端/双端、序列 GC 含量、覆盖深度和碱基调用错误率,以了解它们对不同组装工具性能的影响。我们还比较了这七种工具的计算时间和内存使用情况。根据我们比较的结果,总结了各个工具的相对性能,并提供了在不同条件下选择不同组装工具的初步指南。

相似文献

1
Comparative studies of de novo assembly tools for next-generation sequencing technologies.新一代测序技术从头组装工具的比较研究。
Bioinformatics. 2011 Aug 1;27(15):2031-7. doi: 10.1093/bioinformatics/btr319. Epub 2011 Jun 2.
2
Benchmarking of de novo assembly algorithms for Nanopore data reveals optimal performance of OLC approaches.用于纳米孔数据的从头组装算法基准测试揭示了重叠布局一致(OLC)方法的最佳性能。
BMC Genomics. 2016 Aug 22;17 Suppl 7(Suppl 7):507. doi: 10.1186/s12864-016-2895-8.
3
Assembly algorithms for next-generation sequencing data.下一代测序数据的组装算法。
Genomics. 2010 Jun;95(6):315-27. doi: 10.1016/j.ygeno.2010.03.001. Epub 2010 Mar 6.
4
A pilot study for channel catfish whole genome sequencing and de novo assembly.斑点叉尾鮰全基因组测序和从头组装的初步研究。
BMC Genomics. 2011 Dec 22;12:629. doi: 10.1186/1471-2164-12-629.
5
De novo assembly of the Pseudomonas syringae pv. syringae B728a genome using Illumina/Solexa short sequence reads.利用Illumina/Solexa短序列 reads 对丁香假单胞菌丁香致病变种B728a基因组进行从头组装。
FEMS Microbiol Lett. 2009 Feb;291(1):103-11. doi: 10.1111/j.1574-6968.2008.01441.x. Epub 2008 Dec 9.
6
Subset selection of high-depth next generation sequencing reads for de novo genome assembly using MapReduce framework.使用MapReduce框架进行从头基因组组装时对高深度下一代测序读数的子集选择。
BMC Genomics. 2015;16 Suppl 12(Suppl 12):S9. doi: 10.1186/1471-2164-16-S12-S9. Epub 2015 Dec 9.
7
Identification of optimum sequencing depth especially for de novo genome assembly of small genomes using next generation sequencing data.利用下一代测序数据鉴定最佳测序深度,特别是对于从头组装小基因组的应用。
PLoS One. 2013 Apr 12;8(4):e60204. doi: 10.1371/journal.pone.0060204. Print 2013.
8
A practical comparison of de novo genome assembly software tools for next-generation sequencing technologies.新一代测序技术中从头基因组组装软件工具的实用比较。
PLoS One. 2011 Mar 14;6(3):e17915. doi: 10.1371/journal.pone.0017915.
9
QSRA: a quality-value guided de novo short read assembler.QSRA:一种质量值引导的从头短读长序列拼接器。
BMC Bioinformatics. 2009 Feb 24;10:69. doi: 10.1186/1471-2105-10-69.
10
NeatFreq: reference-free data reduction and coverage normalization for De Novo sequence assembly.NeatFreq:用于从头序列组装的无参考数据缩减和覆盖度归一化
BMC Bioinformatics. 2014 Nov 19;15(1):357. doi: 10.1186/s12859-014-0357-3.

引用本文的文献

1
An overlooked phenomenon: complex interactions of potential error sources on the quality of bacterial de novo genome assemblies.一个被忽视的现象:潜在误差源对细菌从头基因组组装质量的复杂相互作用。
BMC Genomics. 2024 Jan 9;25(1):45. doi: 10.1186/s12864-023-09910-4.
2
Using a combination of short- and long-read sequencing to investigate the diversity in plasmid- and chromosomally encoded extended-spectrum beta-lactamases (ESBLs) in clinical and isolates in Belgium.采用短读长读测序相结合的方法,研究了比利时临床和分离株中质粒和染色体编码的超广谱β-内酰胺酶(ESBLs)的多样性。
Microb Genom. 2023 Jan;9(1). doi: 10.1099/mgen.0.000925.
3
Transcriptome repository of North-Western Himalayan endangered medicinal herbs: a paramount approach illuminating molecular perspective of phytoactive molecules and secondary metabolism.西北喜马拉雅濒危药用植物转录组数据库:阐明药用植物和次生代谢物的分子特征的重要方法。
Mol Genet Genomics. 2021 Nov;296(6):1177-1202. doi: 10.1007/s00438-021-01821-x. Epub 2021 Sep 24.
4
PAN2HGENE-tool for comparative analysis and identifying new gene products.PAN2HGENE 工具用于比较分析和识别新的基因产物。
PLoS One. 2021 May 28;16(5):e0252414. doi: 10.1371/journal.pone.0252414. eCollection 2021.
5
Construction of Whole Genomes from Scaffolds Using Single Cell Strand-Seq Data.使用单细胞链测序数据从支架构建全基因组。
Int J Mol Sci. 2021 Mar 31;22(7):3617. doi: 10.3390/ijms22073617.
6
Draft genome of Meyerozyma guilliermondii strain vka1: a yeast strain with composting potential.季也蒙毕赤酵母菌株vka1的基因组草图:一种具有堆肥潜力的酵母菌株
J Genet Eng Biotechnol. 2020 Sep 29;18(1):54. doi: 10.1186/s43141-020-00074-2.
7
Chromosome Level Genome Assembly of .……的染色体水平基因组组装
Front Genet. 2020 Jun 30;11:701. doi: 10.3389/fgene.2020.00701. eCollection 2020.
8
ConFindr: rapid detection of intraspecies and cross-species contamination in bacterial whole-genome sequence data.ConFindr:快速检测细菌全基因组序列数据中的种内和种间污染
PeerJ. 2019 May 31;7:e6995. doi: 10.7717/peerj.6995. eCollection 2019.
9
Genomic and Metagenomic Approaches for Predictive Surveillance of Emerging Pathogens and Antibiotic Resistance.基因组学和宏基因组学方法在新兴病原体和抗生素耐药性预测性监测中的应用。
Clin Pharmacol Ther. 2019 Sep;106(3):512-524. doi: 10.1002/cpt.1535. Epub 2019 Jul 22.
10
GMASS: a novel measure for genome assembly structural similarity.GMASS:一种用于基因组组装结构相似性的新度量。
BMC Bioinformatics. 2019 Mar 18;20(1):147. doi: 10.1186/s12859-019-2710-z.

本文引用的文献

1
A practical comparison of de novo genome assembly software tools for next-generation sequencing technologies.新一代测序技术中从头基因组组装软件工具的实用比较。
PLoS One. 2011 Mar 14;6(3):e17915. doi: 10.1371/journal.pone.0017915.
2
De novo assembly of human genomes with massively parallel short read sequencing.利用大规模平行短读测序进行人类基因组从头组装。
Genome Res. 2010 Feb;20(2):265-72. doi: 10.1101/gr.097261.109. Epub 2009 Dec 17.
3
ABySS: a parallel assembler for short read sequence data.ABySS:一种用于短读长序列数据的并行汇编器。
Genome Res. 2009 Jun;19(6):1117-23. doi: 10.1101/gr.089532.108. Epub 2009 Feb 27.
4
Aggressive assembly of pyrosequencing reads with mates.将焦磷酸测序读数与配对序列进行积极组装。
Bioinformatics. 2008 Dec 15;24(24):2818-24. doi: 10.1093/bioinformatics/btn548. Epub 2008 Oct 24.
5
Next-generation DNA sequencing.下一代DNA测序
Nat Biotechnol. 2008 Oct;26(10):1135-45. doi: 10.1038/nbt1486.
6
Velvet: algorithms for de novo short read assembly using de Bruijn graphs.《天鹅绒:使用德布鲁因图进行从头短读长拼接的算法》
Genome Res. 2008 May;18(5):821-9. doi: 10.1101/gr.074492.107. Epub 2008 Mar 18.
7
De novo bacterial genome sequencing: millions of very short reads assembled on a desktop computer.从头开始的细菌基因组测序:在台式计算机上组装数百万条非常短的读段。
Genome Res. 2008 May;18(5):802-9. doi: 10.1101/gr.072033.107. Epub 2008 Mar 10.
8
Next-generation sequencing transforms today's biology.新一代测序技术改变了当今的生物学。
Nat Methods. 2008 Jan;5(1):16-8. doi: 10.1038/nmeth1156. Epub 2007 Dec 19.
9
Short read fragment assembly of bacterial genomes.细菌基因组的短读片段组装
Genome Res. 2008 Feb;18(2):324-30. doi: 10.1101/gr.7088808. Epub 2007 Dec 14.
10
SHARCGS, a fast and highly accurate short-read assembly algorithm for de novo genomic sequencing.SHARCGS,一种用于从头基因组测序的快速且高度准确的短读长拼接算法。
Genome Res. 2007 Nov;17(11):1697-706. doi: 10.1101/gr.6435207. Epub 2007 Oct 1.