• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用“可靠”重叠区域改进基于Phrap的大鼠基因组组装。

Improving Phrap-based assembly of the rat using "reliable" overlaps.

作者信息

Roberts Michael, Zimin Aleksey V, Hayes Wayne, Hunt Brian R, Ustun Cevat, White James R, Havlak Paul, Yorke James

机构信息

Institute for Physical Science and Technology, University of Maryland, College Park, Maryland, United States of America.

出版信息

PLoS One. 2008 Mar 19;3(3):e1836. doi: 10.1371/journal.pone.0001836.

DOI:10.1371/journal.pone.0001836
PMID:18350171
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2266800/
Abstract

The assembly methods used for whole-genome shotgun (WGS) data have a major impact on the quality of resulting draft genomes. We present a novel algorithm to generate a set of "reliable" overlaps based on identifying repeat k-mers. To demonstrate the benefits of using reliable overlaps, we have created a version of the Phrap assembly program that uses only overlaps from a specific list. We call this version PhrapUMD. Integrating PhrapUMD and our "reliable-overlap" algorithm with the Baylor College of Medicine assembler, Atlas, we assemble the BACs from the Rattus norvegicus genome project. Starting with the same data as the Nov. 2002 Atlas assembly, we compare our results and the Atlas assembly to the 4.3 Mb of rat sequence in the 21 BACs that have been finished. Our version of the draft assembly of the 21 BACs increases the coverage of finished sequence from 93.4% to 96.3%, while simultaneously reducing the base error rate from 4.5 to 1.1 errors per 10,000 bases. There are a number of ways of assessing the relative merits of assemblies when the finished sequence is available. If one views the overall quality of an assembly as proportional to the inverse of the product of the error rate and sequence missed, then the assembly presented here is seven times better. The UMD Overlapper with options for reliable overlaps is available from the authors at http://www.genome.umd.edu. We also provide the changes to the Phrap source code enabling it to use only the reliable overlaps.

摘要

用于全基因组鸟枪法(WGS)数据的组装方法对所得草图基因组的质量有重大影响。我们提出了一种新算法,通过识别重复k-mer来生成一组“可靠的”重叠区域。为了证明使用可靠重叠区域的好处,我们创建了一个Phrap组装程序版本,该版本仅使用特定列表中的重叠区域。我们将此版本称为PhrapUMD。将PhrapUMD和我们的“可靠重叠”算法与贝勒医学院的组装程序Atlas相结合,我们对褐家鼠基因组计划中的BAC进行了组装。从与2002年11月Atlas组装相同的数据开始,我们将我们的结果和Atlas组装与已完成的21个BAC中4.3 Mb的大鼠序列进行比较。我们版本的21个BAC草图组装将已完成序列的覆盖率从93.4%提高到了96.3%,同时将碱基错误率从每10000个碱基4.5个错误降低到了1.1个错误。当有完成的序列时,有多种评估组装相对优点的方法。如果将组装的整体质量视为与错误率和遗漏序列的乘积的倒数成正比,那么这里展示的组装质量提高了七倍。作者提供了带有可靠重叠选项的UMD Overlapper,可从http://www.genome.umd.edu获取。我们还提供了对Phrap源代码的修改,使其能够仅使用可靠的重叠区域。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0eaf/2266800/05d2b6dbd998/pone.0001836.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0eaf/2266800/e1d8fab9b034/pone.0001836.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0eaf/2266800/cbaf292015c3/pone.0001836.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0eaf/2266800/05d2b6dbd998/pone.0001836.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0eaf/2266800/e1d8fab9b034/pone.0001836.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0eaf/2266800/cbaf292015c3/pone.0001836.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0eaf/2266800/05d2b6dbd998/pone.0001836.g003.jpg

相似文献

1
Improving Phrap-based assembly of the rat using "reliable" overlaps.利用“可靠”重叠区域改进基于Phrap的大鼠基因组组装。
PLoS One. 2008 Mar 19;3(3):e1836. doi: 10.1371/journal.pone.0001836.
2
The Atlas genome assembly system.阿特拉斯基因组组装系统。
Genome Res. 2004 Apr;14(4):721-32. doi: 10.1101/gr.2264004.
3
A preprocessor for shotgun assembly of large genomes.用于大型基因组鸟枪法组装的预处理器。
J Comput Biol. 2004;11(4):734-52. doi: 10.1089/cmb.2004.11.734.
4
Bos taurus genome assembly.牛基因组组装
BMC Genomics. 2009 Apr 24;10:180. doi: 10.1186/1471-2164-10-180.
5
QuorUM: An Error Corrector for Illumina Reads.QuorUM:Illumina测序读数的纠错工具
PLoS One. 2015 Jun 17;10(6):e0130821. doi: 10.1371/journal.pone.0130821. eCollection 2015.
6
Finishing a whole-genome shotgun: release 3 of the Drosophila melanogaster euchromatic genome sequence.完成全基因组鸟枪法测序:黑腹果蝇常染色质基因组序列第3版发布。
Genome Biol. 2002;3(12):RESEARCH0079. doi: 10.1186/gb-2002-3-12-research0079. Epub 2002 Dec 23.
7
Assembly reconciliation.装配核对
Bioinformatics. 2008 Jan 1;24(1):42-5. doi: 10.1093/bioinformatics/btm542. Epub 2007 Dec 5.
8
CAP3: A DNA sequence assembly program.CAP3:一个DNA序列组装程序。
Genome Res. 1999 Sep;9(9):868-77. doi: 10.1101/gr.9.9.868.
9
HISEA: HIerarchical SEed Aligner for PacBio data.HISEA:用于PacBio数据的分层种子比对器。
BMC Bioinformatics. 2017 Dec 19;18(1):564. doi: 10.1186/s12859-017-1953-9.
10
Assembling genomic DNA sequences with PHRAP.使用PHRAP组装基因组DNA序列。
Curr Protoc Bioinformatics. 2007 Mar;Chapter 11:Unit11.4. doi: 10.1002/0471250953.bi1104s17.

引用本文的文献

1
Improving de novo sequence assembly using machine learning and comparative genomics for overlap correction.利用机器学习和比较基因组学进行重叠校正,提高从头序列组装质量。
BMC Bioinformatics. 2010 Jan 15;11:33. doi: 10.1186/1471-2105-11-33.
2
A whole-genome assembly of the domestic cow, Bos taurus.家牛(Bos taurus)的全基因组组装。
Genome Biol. 2009;10(4):R42. doi: 10.1186/gb-2009-10-4-r42. Epub 2009 Apr 24.

本文引用的文献

1
Beware of mis-assembled genomes.谨防基因组组装错误。
Bioinformatics. 2005 Dec 15;21(24):4320-1. doi: 10.1093/bioinformatics/bti769.
2
A preprocessor for shotgun assembly of large genomes.用于大型基因组鸟枪法组装的预处理器。
J Comput Biol. 2004;11(4):734-52. doi: 10.1089/cmb.2004.11.734.
3
The Atlas genome assembly system.阿特拉斯基因组组装系统。
Genome Res. 2004 Apr;14(4):721-32. doi: 10.1101/gr.2264004.
4
Genome sequence of the Brown Norway rat yields insights into mammalian evolution.褐家鼠的基因组序列为哺乳动物进化研究提供了新见解。
Nature. 2004 Apr 1;428(6982):493-521. doi: 10.1038/nature02426.
5
PCAP: a whole-genome assembly program.PCAP:一个全基因组组装程序。
Genome Res. 2003 Sep;13(9):2164-70. doi: 10.1101/gr.1390403.
6
Human-mouse alignments with BLASTZ.使用BLASTZ进行人-小鼠序列比对。
Genome Res. 2003 Jan;13(1):103-7. doi: 10.1101/gr.809403.
7
The phusion assembler.Phusion 组装器。
Genome Res. 2003 Jan;13(1):81-90. doi: 10.1101/gr.731003.
8
Whole-genome shotgun assembly and analysis of the genome of Fugu rubripes.红鳍东方鲀全基因组鸟枪法测序组装与基因组分析
Science. 2002 Aug 23;297(5585):1301-10. doi: 10.1126/science.1072104. Epub 2002 Jul 25.
9
ARACHNE: a whole-genome shotgun assembler.ARACHNE:一种全基因组鸟枪法测序序列拼接程序。
Genome Res. 2002 Jan;12(1):177-89. doi: 10.1101/gr.208902.
10
A whole-genome assembly of Drosophila.果蝇的全基因组组装
Science. 2000 Mar 24;287(5461):2196-204. doi: 10.1126/science.287.5461.2196.