• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

带噪reads 的鸟枪法测序的近最优组装。

Near-optimal assembly for shotgun sequencing with noisy reads.

出版信息

BMC Bioinformatics. 2014;15 Suppl 9(Suppl 9):S4. doi: 10.1186/1471-2105-15-S9-S4. Epub 2014 Sep 10.

DOI:10.1186/1471-2105-15-S9-S4
PMID:25252708
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4168708/
Abstract

Recent work identified the fundamental limits on the information requirements in terms of read length and coverage depth required for successful de novo genome reconstruction from shotgun sequencing data, based on the idealistic assumption of no errors in the reads (noiseless reads). In this work, we show that even when there is noise in the reads, one can successfully reconstruct with information requirements close to the noiseless fundamental limit. A new assembly algorithm, X-phased Multibridging, is designed based on a probabilistic model of the genome. It is shown through analysis to perform well on the model, and through simulations to perform well on real genomes.

摘要

最近的工作确定了从鸟枪法测序数据成功从头重建基因组所需的读长和覆盖深度的信息要求的基本限制,这是基于读取中没有错误(无噪声读取)的理想化假设。在这项工作中,我们表明,即使读取中有噪声,也可以接近无噪声基本限制成功地进行重建。基于基因组的概率模型设计了一种新的组装算法 X 相多桥接。通过对模型的分析表明该算法性能良好,通过对真实基因组的模拟也表明该算法性能良好。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6395/4168708/ae3eb70ca8e4/1471-2105-15-S9-S4-9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6395/4168708/e9eba0c5449b/1471-2105-15-S9-S4-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6395/4168708/51ce24b0aa9f/1471-2105-15-S9-S4-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6395/4168708/7c905bcab596/1471-2105-15-S9-S4-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6395/4168708/9fdf04aaa6a9/1471-2105-15-S9-S4-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6395/4168708/1713651f5bcd/1471-2105-15-S9-S4-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6395/4168708/614ec5c9313b/1471-2105-15-S9-S4-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6395/4168708/5efe2133d84b/1471-2105-15-S9-S4-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6395/4168708/e0a160177d91/1471-2105-15-S9-S4-8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6395/4168708/ae3eb70ca8e4/1471-2105-15-S9-S4-9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6395/4168708/e9eba0c5449b/1471-2105-15-S9-S4-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6395/4168708/51ce24b0aa9f/1471-2105-15-S9-S4-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6395/4168708/7c905bcab596/1471-2105-15-S9-S4-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6395/4168708/9fdf04aaa6a9/1471-2105-15-S9-S4-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6395/4168708/1713651f5bcd/1471-2105-15-S9-S4-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6395/4168708/614ec5c9313b/1471-2105-15-S9-S4-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6395/4168708/5efe2133d84b/1471-2105-15-S9-S4-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6395/4168708/e0a160177d91/1471-2105-15-S9-S4-8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6395/4168708/ae3eb70ca8e4/1471-2105-15-S9-S4-9.jpg

相似文献

1
Near-optimal assembly for shotgun sequencing with noisy reads.带噪reads 的鸟枪法测序的近最优组装。
BMC Bioinformatics. 2014;15 Suppl 9(Suppl 9):S4. doi: 10.1186/1471-2105-15-S9-S4. Epub 2014 Sep 10.
2
Identification of optimum sequencing depth especially for de novo genome assembly of small genomes using next generation sequencing data.利用下一代测序数据鉴定最佳测序深度,特别是对于从头组装小基因组的应用。
PLoS One. 2013 Apr 12;8(4):e60204. doi: 10.1371/journal.pone.0060204. Print 2013.
3
Improving de novo Assembly Based on Read Classification.基于读段分类的从头组装改进。
IEEE/ACM Trans Comput Biol Bioinform. 2020 Jan-Feb;17(1):177-188. doi: 10.1109/TCBB.2018.2861380. Epub 2018 Jul 30.
4
Benchmarking of de novo assembly algorithms for Nanopore data reveals optimal performance of OLC approaches.用于纳米孔数据的从头组装算法基准测试揭示了重叠布局一致(OLC)方法的最佳性能。
BMC Genomics. 2016 Aug 22;17 Suppl 7(Suppl 7):507. doi: 10.1186/s12864-016-2895-8.
5
Illumina error correction near highly repetitive DNA regions improves de novo genome assembly.Illumina 纠错技术在高度重复 DNA 区域的应用提高了从头基因组组装的质量。
BMC Bioinformatics. 2019 Jun 3;20(1):298. doi: 10.1186/s12859-019-2906-2.
6
ALLPATHS: de novo assembly of whole-genome shotgun microreads.ALLPATHS:全基因组鸟枪法测序短读段的从头组装。
Genome Res. 2008 May;18(5):810-20. doi: 10.1101/gr.7337908. Epub 2008 Mar 13.
7
GAPPadder: a sensitive approach for closing gaps on draft genomes with short sequence reads.GAPPadder:一种使用短序列读长来闭合草图基因组缺口的灵敏方法。
BMC Genomics. 2019 Jun 6;20(Suppl 5):426. doi: 10.1186/s12864-019-5703-4.
8
RepLong: de novo repeat identification using long read sequencing data.RepLong:利用长读测序数据进行从头重复识别。
Bioinformatics. 2018 Apr 1;34(7):1099-1107. doi: 10.1093/bioinformatics/btx717.
9
Improving de novo sequence assembly using machine learning and comparative genomics for overlap correction.利用机器学习和比较基因组学进行重叠校正,提高从头序列组装质量。
BMC Bioinformatics. 2010 Jan 15;11:33. doi: 10.1186/1471-2105-11-33.
10
Optimal assembly for high throughput shotgun sequencing.高通量鸟枪法测序的最佳组装。
BMC Bioinformatics. 2013;14 Suppl 5(Suppl 5):S18. doi: 10.1186/1471-2105-14-S5-S18. Epub 2013 Jul 9.

引用本文的文献

1
Draft genome sequence of the Tremellomycetes yeast 5307AH, isolated from aircraft.从飞机上分离出的银耳纲酵母5307AH的基因组序列草图
Microbiol Resour Announc. 2024 Aug 13;13(8):e0014124. doi: 10.1128/mra.00141-24. Epub 2024 Jul 5.
2
Improving RNA Assembly via Safety and Completeness in Flow Decompositions.通过流分解中的安全性和完整性提高 RNA 组装
J Comput Biol. 2022 Dec;29(12):1270-1287. doi: 10.1089/cmb.2022.0261. Epub 2022 Oct 25.
3
Modeling of shotgun sequencing of DNA plasmids using experimental and theoretical approaches.

本文引用的文献

1
Optimal assembly for high throughput shotgun sequencing.高通量鸟枪法测序的最佳组装。
BMC Bioinformatics. 2013;14 Suppl 5(Suppl 5):S18. doi: 10.1186/1471-2105-14-S5-S18. Epub 2013 Jul 9.
2
Nonhybrid, finished microbial genome assemblies from long-read SMRT sequencing data.非杂交、基于长读长 SMRT 测序数据的完成微生物基因组组装。
Nat Methods. 2013 Jun;10(6):563-9. doi: 10.1038/nmeth.2474. Epub 2013 May 5.
3
Hybrid error correction and de novo assembly of single-molecule sequencing reads.单分子测序reads 的混合纠错与从头组装。
使用实验和理论方法对 DNA 质粒的鸟枪法测序进行建模。
BMC Bioinformatics. 2020 Apr 3;21(1):132. doi: 10.1186/s12859-020-3461-6.
4
Multiplexed Non-barcoded Long-Read Sequencing and Assembling Genomes of Bacillus Strains in Error-Free Simulations.多通道无条码长读测序和组装芽孢杆菌菌株的基因组,在无错误模拟中。
Curr Microbiol. 2020 Jan;77(1):79-84. doi: 10.1007/s00284-019-01808-3. Epub 2019 Nov 13.
5
A chromosome-level sequence assembly reveals the structure of the Arabidopsis thaliana Nd-1 genome and its gene set.一个染色体水平的序列组装揭示了拟南芥 Nd-1 基因组及其基因集的结构。
PLoS One. 2019 May 21;14(5):e0216233. doi: 10.1371/journal.pone.0216233. eCollection 2019.
6
Rapid Low-Cost Assembly of the Reference Genome Using Low-Coverage, Long-Read Sequencing.利用低覆盖度长读长测序技术快速低成本组装参考基因组
G3 (Bethesda). 2018 Oct 3;8(10):3143-3154. doi: 10.1534/g3.118.200162.
7
Versatile genome assembly evaluation with QUAST-LG.QUAST-LG 进行多功能基因组组装评估。
Bioinformatics. 2018 Jul 1;34(13):i142-i150. doi: 10.1093/bioinformatics/bty266.
8
HISEA: HIerarchical SEed Aligner for PacBio data.HISEA:用于PacBio数据的分层种子比对器。
BMC Bioinformatics. 2017 Dec 19;18(1):564. doi: 10.1186/s12859-017-1953-9.
9
Scaffolding of long read assemblies using long range contact information.利用长程接触信息对长读长组装进行支架构建。
BMC Genomics. 2017 Jul 12;18(1):527. doi: 10.1186/s12864-017-3879-z.
10
Unicycler: Resolving bacterial genome assemblies from short and long sequencing reads.单轮循环器:从短读长和长读长测序数据中解析细菌基因组组装结果
PLoS Comput Biol. 2017 Jun 8;13(6):e1005595. doi: 10.1371/journal.pcbi.1005595. eCollection 2017 Jun.
Nat Biotechnol. 2012 Jul 1;30(7):693-700. doi: 10.1038/nbt.2280.
4
Efficient de novo assembly of large genomes using compressed data structures.利用压缩数据结构进行高效的从头基因组组装。
Genome Res. 2012 Mar;22(3):549-56. doi: 10.1101/gr.126953.111. Epub 2011 Dec 7.
5
Comparing de novo genome assembly: the long and short of it.从头开始比较基因组组装:长与短。
PLoS One. 2011 Apr 29;6(4):e19175. doi: 10.1371/journal.pone.0019175.
6
High-quality draft assemblies of mammalian genomes from massively parallel sequence data.利用大规模平行测序数据生成高质量的哺乳动物基因组草图组装。
Proc Natl Acad Sci U S A. 2011 Jan 25;108(4):1513-8. doi: 10.1073/pnas.1017351108. Epub 2010 Dec 27.
7
Genome assembly reborn: recent computational challenges.基因组组装重生:近期的计算挑战
Brief Bioinform. 2009 Jul;10(4):354-66. doi: 10.1093/bib/bbp026. Epub 2009 May 29.
8
Microbiology in the post-genomic era.后基因组时代的微生物学。
Nat Rev Microbiol. 2008 Jun;6(6):419-30. doi: 10.1038/nrmicro1901. Epub 2008 May 13.
9
Velvet: algorithms for de novo short read assembly using de Bruijn graphs.《天鹅绒:使用德布鲁因图进行从头短读长拼接的算法》
Genome Res. 2008 May;18(5):821-9. doi: 10.1101/gr.074492.107. Epub 2008 Mar 18.
10
DNA sequencing. A plan to capture human diversity in 1000 genomes.DNA测序。一项在千人基因组计划中捕捉人类多样性的计划。
Science. 2008 Jan 25;319(5862):395. doi: 10.1126/science.319.5862.395.