Suppr超能文献

ABySS:一种用于短读长序列数据的并行汇编器。

ABySS: a parallel assembler for short read sequence data.

作者信息

Simpson Jared T, Wong Kim, Jackman Shaun D, Schein Jacqueline E, Jones Steven J M, Birol Inanç

机构信息

Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, British Columbia V5Z 4E6, Canada.

出版信息

Genome Res. 2009 Jun;19(6):1117-23. doi: 10.1101/gr.089532.108. Epub 2009 Feb 27.

Abstract

Widespread adoption of massively parallel deoxyribonucleic acid (DNA) sequencing instruments has prompted the recent development of de novo short read assembly algorithms. A common shortcoming of the available tools is their inability to efficiently assemble vast amounts of data generated from large-scale sequencing projects, such as the sequencing of individual human genomes to catalog natural genetic variation. To address this limitation, we developed ABySS (Assembly By Short Sequences), a parallelized sequence assembler. As a demonstration of the capability of our software, we assembled 3.5 billion paired-end reads from the genome of an African male publicly released by Illumina, Inc. Approximately 2.76 million contigs > or =100 base pairs (bp) in length were created with an N50 size of 1499 bp, representing 68% of the reference human genome. Analysis of these contigs identified polymorphic and novel sequences not present in the human reference assembly, which were validated by alignment to alternate human assemblies and to other primate genomes.

摘要

大规模平行脱氧核糖核酸(DNA)测序仪器的广泛应用推动了从头短读组装算法的近期发展。现有工具的一个常见缺点是它们无法有效地组装大规模测序项目产生的大量数据,例如对个体人类基因组进行测序以编目自然遗传变异。为了解决这一限制,我们开发了ABySS(短序列组装),一种并行化的序列组装器。作为我们软件能力的一个展示,我们组装了Illumina公司公开发布的一名非洲男性基因组的35亿对末端读段。创建了大约276万个长度大于或等于100碱基对(bp)的重叠群,N50大小为1499 bp,占人类参考基因组的68%。对这些重叠群的分析鉴定出人类参考组装中不存在的多态性和新序列,这些序列通过与其他人类组装和其他灵长类基因组比对得到验证。

相似文献

1
ABySS: a parallel assembler for short read sequence data.ABySS:一种用于短读长序列数据的并行汇编器。
Genome Res. 2009 Jun;19(6):1117-23. doi: 10.1101/gr.089532.108. Epub 2009 Feb 27.
4
De novo transcriptome assembly with ABySS.使用 ABySS 进行从头转录组组装。
Bioinformatics. 2009 Nov 1;25(21):2872-7. doi: 10.1093/bioinformatics/btp367. Epub 2009 Jun 15.
5
QuorUM: An Error Corrector for Illumina Reads.QuorUM:Illumina测序读数的纠错工具
PLoS One. 2015 Jun 17;10(6):e0130821. doi: 10.1371/journal.pone.0130821. eCollection 2015.
6
Evaluation of short read metagenomic assembly.短读宏基因组组装评估。
BMC Genomics. 2011;12 Suppl 2(Suppl 2):S8. doi: 10.1186/1471-2164-12-S2-S8. Epub 2011 Jul 27.

引用本文的文献

3
Characterization of the complete mitochondrial genome of A. Rich. (Poaceae).禾本科A. Rich. 完整线粒体基因组的特征分析。
Mitochondrial DNA B Resour. 2025 Aug 23;10(9):868-873. doi: 10.1080/23802359.2025.2550607. eCollection 2025.

本文引用的文献

9
The UCSC Genome Browser Database: 2008 update.加州大学圣克鲁兹分校基因组浏览器数据库:2008年更新版。
Nucleic Acids Res. 2008 Jan;36(Database issue):D773-9. doi: 10.1093/nar/gkm966. Epub 2007 Dec 17.
10
Short read fragment assembly of bacterial genomes.细菌基因组的短读片段组装
Genome Res. 2008 Feb;18(2):324-30. doi: 10.1101/gr.7088808. Epub 2007 Dec 14.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验