• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

细菌基因组的短读片段组装

Short read fragment assembly of bacterial genomes.

作者信息

Chaisson Mark J, Pevzner Pavel A

机构信息

Bioinformatics Program, University of California San Diego, La Jolla, California 92093, USA.

出版信息

Genome Res. 2008 Feb;18(2):324-30. doi: 10.1101/gr.7088808. Epub 2007 Dec 14.

DOI:10.1101/gr.7088808
PMID:18083777
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2203630/
Abstract

In the last year, high-throughput sequencing technologies have progressed from proof-of-concept to production quality. While these methods produce high-quality reads, they have yet to produce reads comparable in length to Sanger-based sequencing. Current fragment assembly algorithms have been implemented and optimized for mate-paired Sanger-based reads, and thus do not perform well on short reads produced by short read technologies. We present a new Eulerian assembler that generates nearly optimal short read assemblies of bacterial genomes and describe an approach to assemble reads in the case of the popular hybrid protocol when short and long Sanger-based reads are combined.

摘要

在过去的一年里,高通量测序技术已从概念验证发展到生产质量阶段。虽然这些方法能产生高质量的读数,但它们产生的读数长度仍无法与基于桑格测序法的读数相媲美。当前的片段组装算法是针对基于配对末端桑格测序法的读数而实现和优化的,因此在短读技术产生的短读数上表现不佳。我们提出了一种新的欧拉组装器,它能生成接近最优的细菌基因组短读组装结果,并描述了一种在流行的混合协议中,当基于桑格测序法的短读数和长读数相结合时组装读数的方法。

相似文献

1
Short read fragment assembly of bacterial genomes.细菌基因组的短读片段组装
Genome Res. 2008 Feb;18(2):324-30. doi: 10.1101/gr.7088808. Epub 2007 Dec 14.
2
Unicycler: Resolving bacterial genome assemblies from short and long sequencing reads.单轮循环器:从短读长和长读长测序数据中解析细菌基因组组装结果
PLoS Comput Biol. 2017 Jun 8;13(6):e1005595. doi: 10.1371/journal.pcbi.1005595. eCollection 2017 Jun.
3
Benchmarking of de novo assembly algorithms for Nanopore data reveals optimal performance of OLC approaches.用于纳米孔数据的从头组装算法基准测试揭示了重叠布局一致(OLC)方法的最佳性能。
BMC Genomics. 2016 Aug 22;17 Suppl 7(Suppl 7):507. doi: 10.1186/s12864-016-2895-8.
4
Improved assembly of noisy long reads by k-mer validation.通过k-mer验证改进嘈杂长读段的组装。
Genome Res. 2016 Dec;26(12):1710-1720. doi: 10.1101/gr.209247.116. Epub 2016 Oct 7.
5
GAPPadder: a sensitive approach for closing gaps on draft genomes with short sequence reads.GAPPadder:一种使用短序列读长来闭合草图基因组缺口的灵敏方法。
BMC Genomics. 2019 Jun 6;20(Suppl 5):426. doi: 10.1186/s12864-019-5703-4.
6
Fragment assembly with short reads.使用短读段进行片段组装。
Bioinformatics. 2004 Sep 1;20(13):2067-74. doi: 10.1093/bioinformatics/bth205. Epub 2004 Apr 1.
7
Completion of draft bacterial genomes by long-read sequencing of synthetic genomic pools.通过合成基因组文库的长读长测序完成细菌基因组草图
BMC Genomics. 2020 Jul 29;21(1):519. doi: 10.1186/s12864-020-06910-6.
8
SOPRA: Scaffolding algorithm for paired reads via statistical optimization.SOPRA:基于统计优化的配对读取支架算法。
BMC Bioinformatics. 2010 Jun 24;11:345. doi: 10.1186/1471-2105-11-345.
9
One chromosome, one contig: complete microbial genomes from long-read sequencing and assembly.一条染色体,一个连续体:长读测序和组装的完整微生物基因组。
Curr Opin Microbiol. 2015 Feb;23:110-20. doi: 10.1016/j.mib.2014.11.014. Epub 2014 Dec 1.
10
Assembling short reads from jumping libraries with large insert sizes.利用带有较大插入片段的跳跃文库进行短读序列组装。
Bioinformatics. 2015 Oct 15;31(20):3262-8. doi: 10.1093/bioinformatics/btv337. Epub 2015 Jun 3.

引用本文的文献

1
Applications of de Bruijn graphs in microbiome research.德布鲁因图在微生物组研究中的应用。
Imeta. 2022 Mar 1;1(1):e4. doi: 10.1002/imt2.4. eCollection 2022 Mar.
2
Matchtigs: minimum plain text representation of k-mer sets.匹配项:k-mer 集的最小纯文本表示。
Genome Biol. 2023 Jun 9;24(1):136. doi: 10.1186/s13059-023-02968-z.
3
Investigating Possible Interspecies Communication of Plasmids Associated with Transfer of Third-Generation Cephalosporin, Quinolone, and Colistin Resistance Between Simultaneously Isolated Escherichia Coli and Klebsiella Pneumoniae.调查与第三代头孢菌素、喹诺酮类和多粘菌素耐药性转移相关的质粒在同时分离的大肠杆菌和肺炎克雷伯菌之间的种间可能的通讯。
Microbiol Spectr. 2023 Jun 15;11(3):e0355422. doi: 10.1128/spectrum.03554-22. Epub 2023 May 1.
4
Genome Characterization of Phage Amore2.噬菌体Amore2的基因组特征分析
Microbiol Resour Announc. 2022 Oct 20;11(10):e0053722. doi: 10.1128/mra.00537-22. Epub 2022 Sep 28.
5
Draft Genome Sequence of Lentilactobacillus kosonis NBRC 111893, Isolated from a Japanese Sugar-Vegetable Fermented Beverage called Kôso.从日本一种名为“kôso”的糖蔬菜发酵饮料中分离出的科氏扁豆乳杆菌NBRC 111893的基因组序列草图
Microbiol Resour Announc. 2022 Oct 20;11(10):e0077522. doi: 10.1128/mra.00775-22. Epub 2022 Sep 22.
6
Introduction to the principles and methods underlying the recovery of metagenome-assembled genomes from metagenomic data.从宏基因组数据中恢复宏基因组组装基因组的原理和方法简介。
Microbiologyopen. 2022 Jun;11(3):e1298. doi: 10.1002/mbo3.1298.
7
Breeding for Economically and Environmentally Sustainable Wheat Varieties: An Integrated Approach from Genomics to Selection.培育经济和环境可持续型小麦品种:从基因组学到选择的综合方法
Biology (Basel). 2022 Jan 17;11(1):149. doi: 10.3390/biology11010149.
8
Discovery of an Antarctic Ascidian-Associated Uncultivated with Antimelanoma Palmerolide Biosynthetic Potential.发现一种南极内肛动物相关的未培养 ,具有抗黑色素瘤 Palmerolide 生物合成潜力。
mSphere. 2021 Dec 22;6(6):e0075921. doi: 10.1128/mSphere.00759-21. Epub 2021 Dec 1.
9
De novo assembly, transcriptome characterization and marker discovery in Indian major carp, Labeo rohita through pyrosequencing.通过焦磷酸测序对印度鲤鱼罗非鱼进行从头组装、转录组特征分析和标记发现。
Genetica. 2022 Feb;150(1):59-66. doi: 10.1007/s10709-021-00141-7. Epub 2021 Nov 26.
10
Cuttlefish: fast, parallel and low-memory compaction of de Bruijn graphs from large-scale genome collections.乌贼算法:从大规模基因组集合中快速、并行且低内存消耗的 de Bruijn 图压缩。
Bioinformatics. 2021 Jul 12;37(Suppl_1):i177-i186. doi: 10.1093/bioinformatics/btab309.

本文引用的文献

1
Ancestral reconstruction of segmental duplications reveals punctuated cores of human genome evolution.节段性重复的祖先重建揭示了人类基因组进化的间断核心。
Nat Genet. 2007 Nov;39(11):1361-8. doi: 10.1038/ng.2007.9. Epub 2007 Oct 7.
2
Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project.ENCODE试点项目对人类基因组1%的功能元件进行鉴定与分析。
Nature. 2007 Jun 14;447(7146):799-816. doi: 10.1038/nature05874.
3
Polony multiplex analysis of gene expression (PMAGE) in mouse hypertrophic cardiomyopathy.小鼠肥厚型心肌病中基因表达的聚合酶链式反应多重分析(PMAGE)
Science. 2007 Jun 8;316(5830):1481-4. doi: 10.1126/science.1137325.
4
Whole-genome sequencing and assembly with high-throughput, short-read technologies.高通量、短读长技术的全基因组测序和组装。
PLoS One. 2007 May 30;2(5):e484. doi: 10.1371/journal.pone.0000484.
5
High-resolution profiling of histone methylations in the human genome.人类基因组中组蛋白甲基化的高分辨率分析。
Cell. 2007 May 18;129(4):823-37. doi: 10.1016/j.cell.2007.05.009.
6
Protein identification by spectral networks analysis.通过光谱网络分析进行蛋白质鉴定。
Proc Natl Acad Sci U S A. 2007 Apr 10;104(15):6140-5. doi: 10.1073/pnas.0701130104. Epub 2007 Apr 2.
7
Assembling millions of short DNA sequences using SSAKE.使用SSAKE组装数百万条短DNA序列。
Bioinformatics. 2007 Feb 15;23(4):500-1. doi: 10.1093/bioinformatics/btl629. Epub 2006 Dec 8.
8
A Sanger/pyrosequencing hybrid approach for the generation of high-quality draft assemblies of marine microbial genomes.一种用于生成海洋微生物基因组高质量草图组装的桑格/焦磷酸测序混合方法。
Proc Natl Acad Sci U S A. 2006 Jul 25;103(30):11240-5. doi: 10.1073/pnas.0604351103. Epub 2006 Jul 13.
9
Identifying repeat domains in large genomes.识别大型基因组中的重复结构域。
Genome Biol. 2006;7(1):R7. doi: 10.1186/gb-2006-7-1-r7. Epub 2006 Jan 31.
10
Structural variation in the human genome.人类基因组中的结构变异。
Nat Rev Genet. 2006 Feb;7(2):85-97. doi: 10.1038/nrg1767.