Suppr超能文献

利用短序列读取和简化代表性文库进行基因组组装的新策略。

A new strategy for genome assembly using short sequence reads and reduced representation libraries.

机构信息

Genome Technology Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, Maryland 20892, USA.

出版信息

Genome Res. 2010 Feb;20(2):249-56. doi: 10.1101/gr.097956.109.

Abstract

We have developed a novel approach for using massively parallel short-read sequencing to generate fast and inexpensive de novo genomic assemblies comparable to those generated by capillary-based methods. The ultrashort (<100 base) sequences generated by this technology pose specific biological and computational challenges for de novo assembly of large genomes. To account for this, we devised a method for experimentally partitioning the genome using reduced representation (RR) libraries prior to assembly. We use two restriction enzymes independently to create a series of overlapping fragment libraries, each containing a tractable subset of the genome. Together, these libraries allow us to reassemble the entire genome without the need of a reference sequence. As proof of concept, we applied this approach to sequence and assembled the majority of the 125-Mb Drosophila melanogaster genome. We subsequently demonstrate the accuracy of our assembly method with meaningful comparisons against the current available D. melanogaster reference genome (dm3). The ease of assembly and accuracy for comparative genomics suggest that our approach will scale to future mammalian genome-sequencing efforts, saving both time and money without sacrificing quality.

摘要

我们开发了一种新的方法,利用大规模并行短读测序来生成快速且廉价的从头基因组组装,与基于毛细管的方法生成的组装结果相当。该技术产生的超短(<100 碱基)序列对大型基因组的从头组装提出了特定的生物学和计算挑战。为此,我们设计了一种在组装前使用简化代表性文库(RR libraries)对基因组进行实验分区的方法。我们使用两种独立的限制酶来创建一系列重叠片段文库,每个文库都包含基因组的一个可处理子集。这些文库共同允许我们在无需参考序列的情况下重新组装整个基因组。作为概念验证,我们将这种方法应用于测序和组装了 125Mb 的果蝇(Drosophila melanogaster)基因组的大部分。随后,我们通过与当前可用的果蝇参考基因组(dm3)进行有意义的比较,证明了我们的组装方法的准确性。组装的简便性和比较基因组学的准确性表明,我们的方法将适用于未来的哺乳动物基因组测序工作,在不牺牲质量的前提下节省时间和金钱。

相似文献

引用本文的文献

3
Reconstructing ancient genomes and epigenomes.重建古代基因组和表观基因组。
Nat Rev Genet. 2015 Jul;16(7):395-408. doi: 10.1038/nrg3935. Epub 2015 Jun 9.

本文引用的文献

1
ABySS: a parallel assembler for short read sequence data.ABySS:一种用于短读长序列数据的并行汇编器。
Genome Res. 2009 Jun;19(6):1117-23. doi: 10.1101/gr.089532.108. Epub 2009 Feb 27.
3
The UCSC Genome Browser Database: update 2009.加州大学圣克鲁兹分校基因组浏览器数据库:2009年更新
Nucleic Acids Res. 2009 Jan;37(Database issue):D755-61. doi: 10.1093/nar/gkn875. Epub 2008 Nov 7.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验