• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

针对纳米孔组装的短读和长读抛光工具进行基准测试:实现暴发分离株的近乎完美基因组。

Benchmarking short and long read polishing tools for nanopore assemblies: achieving near-perfect genomes for outbreak isolates.

机构信息

Department of Computer Science, University of Maryland, College Park, MD, 20742, USA.

Center for Food Safety and Applied Nutrition, Food and Drug Administration, Laurel, MD, 20708, USA.

出版信息

BMC Genomics. 2024 Jul 8;25(1):679. doi: 10.1186/s12864-024-10582-x.

DOI:10.1186/s12864-024-10582-x
PMID:38978005
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11232133/
Abstract

BACKGROUND

Oxford Nanopore provides high throughput sequencing platforms able to reconstruct complete bacterial genomes with 99.95% accuracy. However, even small levels of error can obscure the phylogenetic relationships between closely related isolates. Polishing tools have been developed to correct these errors, but it is uncertain if they obtain the accuracy needed for the high-resolution source tracking of foodborne illness outbreaks.

RESULTS

We tested 132 combinations of assembly and short- and long-read polishing tools to assess their accuracy for reconstructing the genome sequences of 15 highly similar Salmonella enterica serovar Newport isolates from a 2020 onion outbreak. While long-read polishing alone improved accuracy, near perfect accuracy (99.9999% accuracy or ~ 5 nucleotide errors across the 4.8 Mbp genome, excluding low confidence regions) was only obtained by pipelines that combined both long- and short-read polishing tools. Notably, medaka was a more accurate and efficient long-read polisher than Racon. Among short-read polishers, NextPolish showed the highest accuracy, but Pilon, Polypolish, and POLCA performed similarly. Among the 5 best performing pipelines, polishing with medaka followed by NextPolish was the most common combination. Importantly, the order of polishing tools mattered i.e., using less accurate tools after more accurate ones introduced errors. Indels in homopolymers and repetitive regions, where the short reads could not be uniquely mapped, remained the most challenging errors to correct.

CONCLUSIONS

Short reads are still needed to correct errors in nanopore sequenced assemblies to obtain the accuracy required for source tracking investigations. Our granular assessment of the performance of the polishing pipelines allowed us to suggest best practices for tool users and areas for improvement for tool developers.

摘要

背景

牛津纳米孔提供高通量测序平台,能够以 99.95%的准确率重建完整的细菌基因组。然而,即使是很小的错误水平也会掩盖密切相关分离株之间的系统发育关系。已经开发了抛光工具来纠正这些错误,但不确定它们是否能获得用于高分辨率食物源追踪的爆发所需的准确性。

结果

我们测试了 132 种组合的组装和短读和长读抛光工具,以评估它们用于重建 2020 年洋葱爆发中 15 个高度相似的肠炎沙门氏菌纽波特血清型分离株基因组序列的准确性。虽然单独使用长读抛光可以提高准确性,但只有结合使用长读和短读抛光工具的管道才能获得近乎完美的准确性(在 480 万碱基对基因组中,准确率为 99.9999%,或~5 个核苷酸错误,不包括置信度低的区域)。值得注意的是,medaka 是一种比 Racon 更准确和高效的长读抛光机。在短读抛光机中,NextPolish 显示出最高的准确性,但 Pilon、Polypolish 和 POLCA 表现相似。在 5 个表现最好的管道中,用 medaka 进行抛光,然后用 NextPolish 进行抛光是最常见的组合。重要的是,抛光工具的顺序很重要,即用不太准确的工具在后会引入错误。在短读无法唯一映射的同聚物和重复区域中的插入缺失,仍然是最难纠正的错误。

结论

为了获得用于源追踪调查的准确性,仍然需要使用短读来纠正纳米孔测序组装中的错误。我们对抛光管道性能的详细评估使我们能够为工具使用者提供最佳实践建议,并为工具开发者提供改进的领域。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0c4c/11232133/264d5c052b8a/12864_2024_10582_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0c4c/11232133/222ce647aca2/12864_2024_10582_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0c4c/11232133/22448fc4622b/12864_2024_10582_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0c4c/11232133/556921a5a71b/12864_2024_10582_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0c4c/11232133/264d5c052b8a/12864_2024_10582_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0c4c/11232133/222ce647aca2/12864_2024_10582_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0c4c/11232133/22448fc4622b/12864_2024_10582_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0c4c/11232133/556921a5a71b/12864_2024_10582_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0c4c/11232133/264d5c052b8a/12864_2024_10582_Fig4_HTML.jpg

相似文献

1
Benchmarking short and long read polishing tools for nanopore assemblies: achieving near-perfect genomes for outbreak isolates.针对纳米孔组装的短读和长读抛光工具进行基准测试:实现暴发分离株的近乎完美基因组。
BMC Genomics. 2024 Jul 8;25(1):679. doi: 10.1186/s12864-024-10582-x.
2
Polishing the Oxford Nanopore long-read assemblies of bacterial pathogens with Illumina short reads to improve genomic analyses.用 Illumina 短读序列对牛津纳米孔长读序列组装的细菌病原体进行打磨,以改进基因组分析。
Genomics. 2021 May;113(3):1366-1377. doi: 10.1016/j.ygeno.2021.03.018. Epub 2021 Mar 11.
3
How low can you go? Short-read polishing of Oxford Nanopore bacterial genome assemblies.能降到多低?牛津纳米孔细菌基因组组装的短读补洞。
Microb Genom. 2024 Jun;10(6). doi: 10.1099/mgen.0.001254.
4
Benchmarking reveals superiority of deep learning variant callers on bacterial nanopore sequence data.基准测试显示深度学习变异调用程序在细菌纳米孔测序数据上的优越性。
Elife. 2024 Oct 10;13:RP98300. doi: 10.7554/eLife.98300.
5
Polypolish: Short-read polishing of long-read bacterial genome assemblies.多聚波兰:长读细菌基因组组装的短读抛光。
PLoS Comput Biol. 2022 Jan 24;18(1):e1009802. doi: 10.1371/journal.pcbi.1009802. eCollection 2022 Jan.
6
Assembling the perfect bacterial genome using Oxford Nanopore and Illumina sequencing.利用牛津纳米孔测序和Illumina测序组装完美的细菌基因组。
PLoS Comput Biol. 2023 Mar 2;19(3):e1010905. doi: 10.1371/journal.pcbi.1010905. eCollection 2023 Mar.
7
Benchmarking Long-Read Assemblers for Genomic Analyses of Bacterial Pathogens Using Oxford Nanopore Sequencing.基于 Oxford Nanopore 测序的细菌病原体基因组分析的长读长组装器基准测试
Int J Mol Sci. 2020 Dec 1;21(23):9161. doi: 10.3390/ijms21239161.
8
Are we there yet? Benchmarking low-coverage nanopore long-read sequencing for the assembling of mitochondrial genomes using the vulnerable silky shark Carcharhinus falciformis.我们到了吗?使用脆弱的灰鲭鲨(Carcharhinus falciformis)对低覆盖度纳米孔长读测序进行线粒体基因组组装的基准测试。
BMC Genomics. 2022 Apr 22;23(1):320. doi: 10.1186/s12864-022-08482-z.
9
Evaluation of strategies for the assembly of diverse bacterial genomes using MinION long-read sequencing.利用 MinION 长读测序技术评估组装多种细菌基因组的策略。
BMC Genomics. 2019 Jan 9;20(1):23. doi: 10.1186/s12864-018-5381-7.
10
Assembly methods for nanopore-based metagenomic sequencing: a comparative study.基于纳米孔的宏基因组测序的组装方法:一项比较研究。
Sci Rep. 2020 Aug 12;10(1):13588. doi: 10.1038/s41598-020-70491-3.

引用本文的文献

1
Bioactive molecules unearthed by terabase-scale long-read sequencing of a soil metagenome.通过对土壤宏基因组进行太字节规模的长读长测序发掘出的生物活性分子。
Nat Biotechnol. 2025 Sep 12. doi: 10.1038/s41587-025-02810-w.
2
SyFi: generating and using sequence fingerprints to distinguish SynCom isolates.SyFi:生成并使用序列指纹来区分合成群落分离株。
Microb Genom. 2025 Sep;11(9). doi: 10.1099/mgen.0.001461.
3
Genomic evolution of Dublin in cattle and humans in the United States.美国牛和人类中都柏林的基因组进化。

本文引用的文献

1
Long read genome assemblers struggle with small plasmids.长读基因组组装器难以处理小型质粒。
Microb Genom. 2023 May;9(5). doi: 10.1099/mgen.0.001024.
2
Benchmarking of Nanopore R10.4 and R9.4.1 flow cells in single-cell whole-genome amplification and whole-genome shotgun sequencing.纳米孔R10.4和R9.4.1流动槽在单细胞全基因组扩增和全基因组鸟枪法测序中的基准测试
Comput Struct Biotechnol J. 2023 Mar 24;21:2352-2364. doi: 10.1016/j.csbj.2023.03.038. eCollection 2023.
3
Assessment of plasmids for relating the 2020 Salmonella enterica serovar Newport onion outbreak to farms implicated by the outbreak investigation.
Appl Environ Microbiol. 2025 Sep 17;91(9):e0068925. doi: 10.1128/aem.00689-25. Epub 2025 Aug 19.
4
Complete genomic sequences of nine Bacillota isolated from Alaskan permafrost.从阿拉斯加永久冻土中分离出的9种芽孢杆菌纲细菌的完整基因组序列。
Microbiol Resour Announc. 2025 Sep 11;14(9):e0026825. doi: 10.1128/mra.00268-25. Epub 2025 Aug 15.
5
A telomere-to-telomere genome of wild soybean with resistance to soybean cyst nematode X12.对大豆胞囊线虫X12具有抗性的野生大豆的端粒到端粒基因组。
Sci Data. 2025 Aug 13;12(1):1412. doi: 10.1038/s41597-025-05741-y.
6
Decoding bacterial methylomes in four public health-relevant microbial species: nanopore sequencing enables reproducible analysis of DNA modifications.解码四种与公共卫生相关的微生物物种的细菌甲基化组:纳米孔测序可实现对DNA修饰的可重复分析。
BMC Genomics. 2025 Apr 23;26(1):394. doi: 10.1186/s12864-025-11592-z.
7
GoldPolish-target: targeted long-read genome assembly polishing.GoldPolish目标:靶向长读长基因组组装优化
BMC Bioinformatics. 2025 Mar 7;26(1):78. doi: 10.1186/s12859-025-06091-7.
8
Easing genomic surveillance: A comprehensive performance evaluation of long-read assemblers across multi-strain mixture data of HIV-1 and Other pathogenic viruses for constructing a user-friendly bioinformatic pipeline.简化基因组监测:针对 HIV-1 和其他病原性病毒的多菌株混合数据,对长读长组装器进行全面性能评估,以构建用户友好的生物信息学管道。
F1000Res. 2024 May 31;13:556. doi: 10.12688/f1000research.149577.1. eCollection 2024.
评估质粒,以将 2020 年沙门氏菌纽波特洋葱血清暴发与暴发调查中涉及的农场联系起来。
BMC Genomics. 2023 Apr 4;24(1):165. doi: 10.1186/s12864-023-09245-0.
4
Assembling the perfect bacterial genome using Oxford Nanopore and Illumina sequencing.利用牛津纳米孔测序和Illumina测序组装完美的细菌基因组。
PLoS Comput Biol. 2023 Mar 2;19(3):e1010905. doi: 10.1371/journal.pcbi.1010905. eCollection 2023 Mar.
5
Long-read metagenomics paves the way toward a complete microbial tree of life.长读长宏基因组学为构建完整的微生物生命之树铺平了道路。
Nat Methods. 2023 Jan;20(1):30-31. doi: 10.1038/s41592-022-01726-6.
6
Regulator STM0347 Mediates Flagellar Phase Variation via Hin Invertase.调控子 STM0347 通过 Hin 反转酶介导鞭毛相变异。
Int J Mol Sci. 2022 Jul 30;23(15):8481. doi: 10.3390/ijms23158481.
7
New-Generation Sequencing Technology in Diagnosis of Fungal Plant Pathogens: A Dream Comes True?新一代测序技术在植物真菌病原体诊断中的应用:梦想成真?
J Fungi (Basel). 2022 Jul 16;8(7):737. doi: 10.3390/jof8070737.
8
Subtyping Evaluation of Enteritidis Using Single Nucleotide Polymorphism and Core Genome Multilocus Sequence Typing with Nanopore Reads.利用单核苷酸多态性和核心基因组多位点序列分型结合纳米孔读取对肠炎沙门氏菌进行亚型评估。
Appl Environ Microbiol. 2022 Aug 9;88(15):e0078522. doi: 10.1128/aem.00785-22. Epub 2022 Jul 12.
9
The complete sequence of a human genome.人类基因组的完整序列。
Science. 2022 Apr;376(6588):44-53. doi: 10.1126/science.abj6987. Epub 2022 Mar 31.
10
Relation between two evolutionary clocks reveal new insights in bacterial evolution.两个进化时钟之间的关系揭示了细菌进化的新见解。
Access Microbiol. 2022 Feb 16;4(2):000265. doi: 10.1099/acmi.0.000265. eCollection 2022.