高效准确的大肠杆菌全基因组组装和甲基组分析。

Efficient and accurate whole genome assembly and methylome profiling of E. coli.

机构信息

Expression Analysis, A Quintiles Company, Durham NC 27713, USA.

出版信息

BMC Genomics. 2013 Oct 3;14(1):675. doi: 10.1186/1471-2164-14-675.

DOI:10.1186/1471-2164-14-675

PMID:24090403

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4046830/

Abstract

BACKGROUND

With the price of next generation sequencing steadily decreasing, bacterial genome assembly is now accessible to a wide range of researchers. It is therefore necessary to understand the best methods for generating a genome assembly, specifically, which combination of sequencing and bioinformatics strategies result in the most accurate assemblies. Here, we sequence three E. coli strains on the Illumina MiSeq, Life Technologies Ion Torrent PGM, and Pacific Biosciences RS. We then perform genome assemblies on all three datasets alone or in combination to determine the best methods for the assembly of bacterial genomes.

RESULTS

Three E. coli strains - BL21(DE3), Bal225, and DH5α - were sequenced to a depth of 100× on the MiSeq and Ion Torrent machines and to at least 125× on the PacBio RS. Four assembly methods were examined and compared. The previously published BL21(DE3) genome [GenBank:AM946981.2], allowed us to evaluate the accuracy of each of the BL21(DE3) assemblies. BL21(DE3) PacBio-only assemblies resulted in a 90% reduction in contigs versus short read only assemblies, while N50 numbers increased by over 7-fold. Strikingly, the number of SNPs in PacBio-only assemblies were less than half that seen with short read assemblies (~~20 SNPs vs. ~50 SNPs) and indels also saw dramatic reductions (~~2 indel >5 bp in PacBio-only assemblies vs. ~12 for short-read only assemblies). Assemblies that used a mixture of PacBio and short read data generally fell in between these two extremes. Use of PacBio sequencing reads also allowed us to call covalent base modifications for the three strains. Each of the strains used here had a known covalent base modification genotype, which was confirmed by PacBio sequencing.

CONCLUSION

Using data generated solely from the Pacific Biosciences RS, we were able to generate the most complete and accurate de novo assemblies of E. coli strains. We found that the addition of other sequencing technology data offered no improvements over use of PacBio data alone. In addition, the sequencing data from the PacBio RS allowed for sensitive and specific calling of covalent base modifications.

摘要

背景

随着下一代测序技术价格的稳步下降，细菌基因组组装现在已经可以为广泛的研究人员所接受。因此，有必要了解生成基因组组装的最佳方法，具体来说，哪种测序和生物信息学策略的组合可以产生最准确的组装。在这里，我们在 Illumina MiSeq、Life Technologies Ion Torrent PGM 和 Pacific Biosciences RS 上对三种大肠杆菌菌株进行测序。然后，我们单独或组合使用所有三个数据集进行基因组组装，以确定细菌基因组组装的最佳方法。

结果

我们对三种大肠杆菌菌株 - BL21(DE3)、Bal225 和 DH5α - 在 MiSeq 和 Ion Torrent 机器上进行了深度为 100×的测序，在 PacBio RS 上至少进行了 125×的测序。我们检查和比较了四种组装方法。之前发表的 BL21(DE3)基因组[GenBank:AM946981.2]使我们能够评估每种 BL21(DE3)组装的准确性。与仅使用短读测序组装相比，BL21(DE3)仅使用 PacBio 测序组装的结果使 contigs 减少了 90%，而 N50 数量增加了 7 倍以上。引人注目的是，PacBio 仅使用测序组装的 SNP 数量不到仅使用短读测序组装的 SNP 数量的一半（~~20 个 SNP 与~~50 个 SNP 相比），插入缺失也显著减少（~~2 个大于 5 bp 的插入缺失在 PacBio 仅使用测序组装中，而短读仅使用测序组装中有~~12 个）。使用 PacBio 测序reads 和短读数据的混合物的组装通常介于这两个极端之间。使用 PacBio 测序reads 还使我们能够为三种菌株调用共价碱基修饰。这里使用的每种菌株都有一个已知的共价碱基修饰基因型，这是通过 PacBio 测序确认的。

结论

仅使用 Pacific Biosciences RS 生成的数据，我们能够生成最完整和最准确的大肠杆菌菌株从头组装。我们发现，添加其他测序技术数据并没有比仅使用 PacBio 数据带来改进。此外，PacBio RS 的测序数据允许敏感和特异性地调用共价碱基修饰。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cedd/4046830/44e2b3aec433/12864_2013_5438_Fig1_HTML.jpg

相似文献

Efficient and accurate whole genome assembly and methylome profiling of E. coli.高效准确的大肠杆菌全基因组组装和甲基组分析。

BMC Genomics. 2013 Oct 3;14(1):675. doi: 10.1186/1471-2164-14-675.

Performance comparison of second- and third-generation sequencers using a bacterial genome with two chromosomes.使用具有两条染色体的细菌基因组对第二代和第三代测序仪进行性能比较。

BMC Genomics. 2014 Aug 21;15(1):699. doi: 10.1186/1471-2164-15-699.

PacBio But Not Illumina Technology Can Achieve Fast, Accurate and Complete Closure of the High GC, Complex Two-Chromosome Genome.PacBio技术而非Illumina技术能够实现对高GC含量、复杂的双染色体基因组的快速、准确且完整的封闭。

Front Microbiol. 2017 Aug 2;8:1448. doi: 10.3389/fmicb.2017.01448. eCollection 2017.

An evaluation of the PacBio RS platform for sequencing and de novo assembly of a chloroplast genome.评价 PacBio RS 平台在叶绿体基因组测序和从头组装方面的应用。

BMC Genomics. 2013 Oct 1;14:670. doi: 10.1186/1471-2164-14-670.

Comparison of long-read sequencing technologies in interrogating bacteria and fly genomes.比较长读测序技术在细菌和果蝇基因组分析中的应用。

G3 (Bethesda). 2021 Jun 17;11(6). doi: 10.1093/g3journal/jkab083.

Comparisons of genome assembly tools for characterization of genomes using hybrid sequencing technologies.利用混合测序技术对基因组进行特征分析的基因组组装工具比较。

PeerJ. 2024 Aug 29;12:e17964. doi: 10.7717/peerj.17964. eCollection 2024.

The long and short of it: benchmarking viromics using Illumina, Nanopore and PacBio sequencing technologies.简而言之：使用Illumina、Nanopore和PacBio测序技术对病毒组进行基准测试。

Microb Genom. 2024 Feb;10(2). doi: 10.1099/mgen.0.001198.

Completion of draft bacterial genomes by long-read sequencing of synthetic genomic pools.通过合成基因组文库的长读长测序完成细菌基因组草图

BMC Genomics. 2020 Jul 29;21(1):519. doi: 10.1186/s12864-020-06910-6.

Advantages of genome sequencing by long-read sequencer using SMRT technology in medical area.使用单分子实时（SMRT）技术的长读长测序仪进行基因组测序在医学领域的优势。

Hum Cell. 2017 Jul;30(3):149-161. doi: 10.1007/s13577-017-0168-8. Epub 2017 Mar 31.

Comparison of long-read sequencing technologies in the hybrid assembly of complex bacterial genomes.比较长读长测序技术在复杂细菌基因组混合组装中的应用。

Microb Genom. 2019 Sep;5(9). doi: 10.1099/mgen.0.000294. Epub 2019 Aug 30.

引用本文的文献

Genomic Analysis of an Excellent Wine-Making Strain SD-2a.酿酒优良菌株 SD-2a 的基因组分析

Pol J Microbiol. 2022 Jun 19;71(2):279-292. doi: 10.33073/pjm-2022-026.

Testing assembly strategies of Francisella tularensis genomes to infer an evolutionary conservation analysis of genomic structures.测试弗朗西斯菌基因组的组装策略，以推断基因组结构的进化保守性分析。

BMC Genomics. 2021 Nov 14;22(1):822. doi: 10.1186/s12864-021-08115-x.

SMRT sequencing reveals differential patterns of methylation in two O111:H- STEC isolates from a hemolytic uremic syndrome outbreak in Australia.SMRT 测序揭示了澳大利亚溶血尿毒综合征暴发中两株 O111:H-肠出血性大肠杆菌分离株甲基化模式的差异。

Sci Rep. 2019 Jul 1;9(1):9436. doi: 10.1038/s41598-019-45760-5.

Identification of genetic relationships and subspecies signatures in Xylella fastidiosa.鉴定韧皮部坏死病菌属（Xylella fastidiosa）的遗传关系和亚种特征。

BMC Genomics. 2019 Mar 25;20(1):239. doi: 10.1186/s12864-019-5565-9.

The complete methylome of an entomopathogenic bacterium reveals the existence of loci with unmethylated Adenines.昆虫病原细菌的全甲基组揭示了存在未甲基化腺嘌呤的基因座。

Sci Rep. 2018 Aug 14;8(1):12091. doi: 10.1038/s41598-018-30620-5.

A Whole Genome Assembly of the Horn Fly, , and Prediction of Genes with Roles in Metabolism and Sex Determination.角蝇的全基因组组装以及对参与代谢和性别决定的基因的预测

G3 (Bethesda). 2018 May 4;8(5):1675-1686. doi: 10.1534/g3.118.200154.

Single molecule real-time (SMRT) sequencing comes of age: applications and utilities for medical diagnostics.单分子实时 (SMRT) 测序崭露头角：在医学诊断中的应用和用途。

Nucleic Acids Res. 2018 Mar 16;46(5):2159-2168. doi: 10.1093/nar/gky066.

Next-generation sequencing technologies and their application to the study and control of bacterial infections.下一代测序技术及其在细菌感染研究和控制中的应用。

Clin Microbiol Infect. 2018 Apr;24(4):335-341. doi: 10.1016/j.cmi.2017.10.013. Epub 2017 Oct 23.

Whole-Genome Sequencing of Bacterial Pathogens: the Future of Nosocomial Outbreak Analysis.细菌病原体的全基因组测序：医院感染暴发分析的未来

Clin Microbiol Rev. 2017 Oct;30(4):1015-1063. doi: 10.1128/CMR.00016-17.

Genomic sequencing of Neisseria gonorrhoeae to respond to the urgent threat of antimicrobial-resistant gonorrhea.对淋病奈瑟菌进行基因组测序，以应对耐抗菌药物淋病的紧迫威胁。

Pathog Dis. 2017 Jun 1;75(4). doi: 10.1093/femspd/ftx041.

本文引用的文献

Reducing assembly complexity of microbial genomes with single-molecule sequencing.利用单分子测序降低微生物基因组的组装复杂性

Genome Biol. 2013;14(9):R101. doi: 10.1186/gb-2013-14-9-r101.

Assemblathon 2: evaluating de novo methods of genome assembly in three vertebrate species.Assemblathon2：在三个脊椎动物物种中评估从头组装基因组方法。

Gigascience. 2013 Jul 22;2(1):10. doi: 10.1186/2047-217X-2-10.

Nonhybrid, finished microbial genome assemblies from long-read SMRT sequencing data.非杂交、基于长读长 SMRT 测序数据的完成微生物基因组组装。

Nat Methods. 2013 Jun;10(6):563-9. doi: 10.1038/nmeth.2474. Epub 2013 May 5.

Analysis of RNA base modification and structural rearrangement by single-molecule real-time detection of reverse transcription.通过反转录的单分子实时检测分析 RNA 碱基修饰和结构重排。

J Nanobiotechnology. 2013 Apr 3;11:8. doi: 10.1186/1477-3155-11-8.

Sequence assembly demystified.序列组装揭秘。

Nat Rev Genet. 2013 Mar;14(3):157-67. doi: 10.1038/nrg3367. Epub 2013 Jan 29.

Enhanced 5-methylcytosine detection in single-molecule, real-time sequencing via Tet1 oxidation.通过 Tet1 氧化作用增强单分子实时测序中 5-甲基胞嘧啶的检测。

BMC Biol. 2013 Jan 22;11:4. doi: 10.1186/1741-7007-11-4.

The fast changing landscape of sequencing technologies and their impact on microbial genome assemblies and annotation.测序技术的快速变化及其对微生物基因组组装和注释的影响。

PLoS One. 2012;7(12):e48837. doi: 10.1371/journal.pone.0048837. Epub 2012 Dec 12.

Mind the gap: upgrading genomes with Pacific Biosciences RS long-read sequencing technology.注意差距：使用 Pacific Biosciences RS 长读测序技术升级基因组。

PLoS One. 2012;7(11):e47768. doi: 10.1371/journal.pone.0047768. Epub 2012 Nov 21.

Genome-wide mapping of methylated adenine residues in pathogenic Escherichia coli using single-molecule real-time sequencing.利用单分子实时测序技术对致病性大肠杆菌中甲基化腺嘌呤残基进行全基因组图谱绘制。

Nat Biotechnol. 2012 Dec;30(12):1232-9. doi: 10.1038/nbt.2432. Epub 2012 Nov 8.

The birth of the Epitranscriptome: deciphering the function of RNA modifications.表观转录组的诞生：解读RNA修饰的功能。

Genome Biol. 2012 Oct 31;13(10):175. doi: 10.1186/gb-2012-13-10-175.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

高效准确的大肠杆菌全基因组组装和甲基组分析。

Efficient and accurate whole genome assembly and methylome profiling of E. coli.

机构信息

Expression Analysis, A Quintiles Company, Durham NC 27713, USA.

出版信息

BMC Genomics. 2013 Oct 3;14(1):675. doi: 10.1186/1471-2164-14-675.

DOI:10.1186/1471-2164-14-675

PMID:24090403

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4046830/

Abstract

BACKGROUND

RESULTS

CONCLUSION

摘要

高效准确的大肠杆菌全基因组组装和甲基组分析。

Efficient and accurate whole genome assembly and methylome profiling of E. coli.

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSION

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

高效准确的大肠杆菌全基因组组装和甲基组分析。

Efficient and accurate whole genome assembly and methylome profiling of E. coli.

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSION

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献