使用全基因组测序（WGS）数据对细菌分离株基于单核苷酸多态性（SNP）的亚型分析工作流程的比较，应用于肠炎沙门氏菌鼠伤寒血清型和1,4,[5],12:i:血清型

Comparison of SNP-based subtyping workflows for bacterial isolates using WGS data, applied to Salmonella enterica serotype Typhimurium and serotype 1,4,[5],12:i:

作者信息

Saltykova Assia, Wuyts Véronique, Mattheus Wesley, Bertrand Sophie, Roosens Nancy H C, Marchal Kathleen, De Keersmaecker Sigrid C J

机构信息

Platform Biotechnology and Molecular Biology, Scientific Institute of Public Health, Brussels, Belgium.

Department of Information Technology, IDLab, Ghent University, IMEC, Ghent, Belgium.

出版信息

PLoS One. 2018 Feb 6;13(2):e0192504. doi: 10.1371/journal.pone.0192504. eCollection 2018.

DOI:10.1371/journal.pone.0192504

PMID:29408896

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5800660/

Abstract

Whole genome sequencing represents a promising new technology for subtyping of bacterial pathogens. Besides the technological advances which have pushed the approach forward, the last years have been marked by considerable evolution of the whole genome sequencing data analysis methods. Prior to application of the technology as a routine epidemiological typing tool, however, reliable and efficient data analysis strategies need to be identified among the wide variety of the emerged methodologies. In this work, we have compared three existing SNP-based subtyping workflows using a benchmark dataset of 32 Salmonella enterica subsp. enterica serovar Typhimurium and serovar 1,4,[5],12:i:- isolates including five isolates from a confirmed outbreak and three isolates obtained from the same patient at different time points. The analysis was carried out using the original (high-coverage) and a down-sampled (low-coverage) datasets and two different reference genomes. All three tested workflows, namely CSI Phylogeny-based workflow, CFSAN-based workflow and PHEnix-based workflow, were able to correctly group the confirmed outbreak isolates and isolates from the same patient with all combinations of reference genomes and datasets. However, the workflows differed strongly with respect to the SNP distances between isolates and sensitivity towards sequencing coverage, which could be linked to the specific data analysis strategies used therein. To demonstrate the effect of particular data analysis steps, several modifications of the existing workflows were also tested. This allowed us to propose data analysis schemes most suitable for routine SNP-based subtyping applied to S. Typhimurium and S. 1,4,[5],12:i:-. Results presented in this study illustrate the importance of using correct data analysis strategies and to define benchmark and fine-tune parameters applied within routine data analysis pipelines to obtain optimal results.

摘要

全基因组测序是一种用于细菌病原体分型的有前景的新技术。除了推动该方法发展的技术进步外，过去几年全基因组测序数据分析方法也有了显著发展。然而，在将该技术作为常规流行病学分型工具应用之前，需要在众多已出现的方法中确定可靠且高效的数据分析策略。在这项工作中，我们使用32株肠炎沙门氏菌亚种肠炎血清型鼠伤寒沙门氏菌和血清型1,4,[5],12:i:-菌株的基准数据集，比较了三种现有的基于单核苷酸多态性（SNP）的分型工作流程，其中包括来自一次确诊疫情的5株菌株以及在不同时间点从同一患者获得的3株菌株。分析使用原始（高覆盖度）和下采样（低覆盖度）数据集以及两个不同的参考基因组进行。所有三种测试的工作流程，即基于CSI系统发育的工作流程、基于CFSAN的工作流程和基于PHEnix的工作流程，在所有参考基因组和数据集组合下，都能够正确地将确诊疫情菌株和来自同一患者的菌株分组。然而，这些工作流程在菌株间的SNP距离以及对测序覆盖度的敏感性方面差异很大，这可能与其中使用的特定数据分析策略有关。为了证明特定数据分析步骤的效果，还测试了对现有工作流程的几种修改。这使我们能够提出最适合应用于鼠伤寒沙门氏菌和1,4,[5],12:i:-血清型的基于SNP的常规分型的数据分析方案。本研究给出的结果说明了使用正确数据分析策略以及定义常规数据分析流程中应用的基准和微调参数以获得最佳结果的重要性。

相似文献

Comparison of SNP-based subtyping workflows for bacterial isolates using WGS data, applied to Salmonella enterica serotype Typhimurium and serotype 1,4,[5],12:i:

PLoS One. 2018 Feb 6;13(2):e0192504. doi: 10.1371/journal.pone.0192504. eCollection 2018.

Implications of Mobile Genetic Elements for Single-Nucleotide Polymorphism Subtyping and Source Tracking Investigations.

Appl Environ Microbiol. 2019 Nov 27;85(24). doi: 10.1128/AEM.01985-19. Print 2019 Dec 15.

Comparison of conventional molecular and whole-genome sequencing methods for subtyping Salmonella enterica serovar Enteritidis strains from Tunisia.

Eur J Clin Microbiol Infect Dis. 2021 Mar;40(3):597-606. doi: 10.1007/s10096-020-04055-8. Epub 2020 Oct 8.

Evaluation of whole genome sequencing for outbreak detection of Salmonella enterica.

PLoS One. 2014 Feb 4;9(2):e87991. doi: 10.1371/journal.pone.0087991. eCollection 2014.

Genetic diversity of clinical Salmonella enterica serovar Typhimurium in a university hospital of south Tunisia, 2000-2013.

Infect Genet Evol. 2020 Nov;85:104436. doi: 10.1016/j.meegid.2020.104436. Epub 2020 Jun 19.

Whole-Genome Sequencing and Bioinformatic Analysis of Isolates from Foodborne Illness Outbreaks of Campylobacter jejuni and Salmonella enterica.

J Clin Microbiol. 2018 Oct 25;56(11). doi: 10.1128/JCM.00161-18. Print 2018 Nov.

Usefulness of High-Quality Core Genome Single-Nucleotide Variant Analysis for Subtyping the Highly Clonal and the Most Prevalent Salmonella enterica Serovar Heidelberg Clone in the Context of Outbreak Investigations.

J Clin Microbiol. 2016 Feb;54(2):289-95. doi: 10.1128/JCM.02200-15. Epub 2015 Nov 18.

Enhancing genomics-based outbreak detection of endemic serovar Typhimurium using dynamic thresholds.

Microb Genom. 2021 Jun;7(6). doi: 10.1099/mgen.0.000310.

A whole-genome single nucleotide polymorphism-based approach to trace and identify outbreaks linked to a common Salmonella enterica subsp. enterica serovar Montevideo pulsed-field gel electrophoresis type.

Appl Environ Microbiol. 2011 Dec;77(24):8648-55. doi: 10.1128/AEM.06538-11. Epub 2011 Oct 14.

Detailed Evaluation of Data Analysis Tools for Subtyping of Bacterial Isolates Based on Whole Genome Sequencing: as a Proof of Concept.

Front Microbiol. 2019 Dec 18;10:2897. doi: 10.3389/fmicb.2019.02897. eCollection 2019.

引用本文的文献

MGV-seq: a sensitive and culture-independent method for detecting microbial genetic variation.

Front Microbiol. 2025 Jun 25;16:1603255. doi: 10.3389/fmicb.2025.1603255. eCollection 2025.

Using SNP addresses for DT104 in routine veterinary outbreak detection.

Epidemiol Infect. 2023 Oct 25;151:e187. doi: 10.1017/S0950268823001723.

Genetic diversity of Salmonella enterica isolated over 13 years from raw California almonds and from an almond orchard.

PLoS One. 2023 Sep 7;18(9):e0291109. doi: 10.1371/journal.pone.0291109. eCollection 2023.

Genome-Wide Searching Single Nucleotide-Polymorphisms (SNPs) and SNPs-Targeting a Multiplex Primer for Identification of Common Serotypes.

Pathogens. 2022 Sep 21;11(10):1075. doi: 10.3390/pathogens11101075.

A retrospective and regional approach assessing the genomic diversity of Dublin.

NAR Genom Bioinform. 2022 Jul 9;4(3):lqac047. doi: 10.1093/nargab/lqac047. eCollection 2022 Sep.

Towards Real-Time and Affordable Strain-Level Metagenomics-Based Foodborne Outbreak Investigations Using Oxford Nanopore Sequencing Technologies.

Front Microbiol. 2021 Nov 5;12:738284. doi: 10.3389/fmicb.2021.738284. eCollection 2021.

Microbial source tracking using metagenomics and other new technologies.

J Microbiol. 2021 Mar;59(3):259-269. doi: 10.1007/s12275-021-0668-9. Epub 2021 Feb 10.

Whole-genome analyses of extended-spectrum or AmpC β-lactamase-producing Escherichia coli isolates from companion dogs in Japan.

PLoS One. 2021 Feb 5;16(2):e0246482. doi: 10.1371/journal.pone.0246482. eCollection 2021.

Taxonomic Evaluation of the (Basonym ) Group ( , , ) Based on Whole Genome Sequences.

Microorganisms. 2021 Jan 26;9(2):246. doi: 10.3390/microorganisms9020246.

A Bioinformatic Pipeline for Improved Genome Analysis and Clustering of Isolates during Outbreaks of Legionnaires' Disease.

J Clin Microbiol. 2021 Jan 21;59(2). doi: 10.1128/JCM.00967-20.

本文引用的文献

The European Union summary report on trends and sources of zoonoses, zoonotic agents and food-borne outbreaks in 2017.

EFSA J. 2018 Dec 12;16(12):e05500. doi: 10.2903/j.efsa.2018.5500. eCollection 2018 Dec.

Whole genome sequencing-based detection of antimicrobial resistance and virulence in non-typhoidal isolated from wildlife.

Gut Pathog. 2017 Nov 21;9:66. doi: 10.1186/s13099-017-0213-x. eCollection 2017.

Comparison of Whole-Genome Sequencing Methods for Analysis of Three Methicillin-Resistant Staphylococcus aureus Outbreaks.

J Clin Microbiol. 2017 Jun;55(6):1946-1953. doi: 10.1128/JCM.00029-17. Epub 2017 Apr 12.

Prospective use of whole genome sequencing (WGS) detected a multi-country outbreak of Salmonella Enteritidis.

Epidemiol Infect. 2017 Jan;145(2):289-298. doi: 10.1017/S0950268816001941. Epub 2016 Oct 26.

Evaluation of an Optimal Epidemiological Typing Scheme for Legionella pneumophila with Whole-Genome Sequence Data Using Validation Guidelines.

J Clin Microbiol. 2016 Aug;54(8):2135-48. doi: 10.1128/JCM.00432-16. Epub 2016 Jun 8.

Whole Genome DNA Sequence Analysis of Salmonella subspecies enterica serotype Tennessee obtained from related peanut butter foodborne outbreaks.

PLoS One. 2016 Jun 3;11(6):e0146929. doi: 10.1371/journal.pone.0146929. eCollection 2016.

Identification of Salmonella for public health surveillance using whole genome sequencing.

PeerJ. 2016 Apr 5;4:e1752. doi: 10.7717/peerj.1752. eCollection 2016.

MEGA7: Molecular Evolutionary Genetics Analysis Version 7.0 for Bigger Datasets.

Mol Biol Evol. 2016 Jul;33(7):1870-4. doi: 10.1093/molbev/msw054. Epub 2016 Mar 22.

Global Genomic Epidemiology of Salmonella enterica Serovar Typhimurium DT104.

Appl Environ Microbiol. 2016 Apr 4;82(8):2516-26. doi: 10.1128/AEM.03821-15. Print 2016 Apr.

The Salmonella In Silico Typing Resource (SISTR): An Open Web-Accessible Tool for Rapidly Typing and Subtyping Draft Salmonella Genome Assemblies.

PLoS One. 2016 Jan 22;11(1):e0147101. doi: 10.1371/journal.pone.0147101. eCollection 2016.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

使用全基因组测序（WGS）数据对细菌分离株基于单核苷酸多态性（SNP）的亚型分析工作流程的比较，应用于肠炎沙门氏菌鼠伤寒血清型和1,4,[5],12:i:血清型

Comparison of SNP-based subtyping workflows for bacterial isolates using WGS data, applied to Salmonella enterica serotype Typhimurium and serotype 1,4,[5],12:i:

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献