基于全基因组短读测序数据的 spp. 分型的四种开源工具的性能和准确性。

Performance and Accuracy of Four Open-Source Tools for Serotyping of spp. Based on Whole-Genome Short-Read Sequencing Data.

机构信息

German Federal Institute for Risk Assessment (BfR), Berlin, Germany.

German Federal Institute for Risk Assessment (BfR), Berlin, Germany

出版信息

Appl Environ Microbiol. 2020 Feb 18;86(5). doi: 10.1128/AEM.02265-19.

DOI:10.1128/AEM.02265-19

PMID:31862714

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7028957/

Abstract

We compared the performance of four open-source typing tools (SeqSero, SeqSero2, Typing Resource [SISTR], and Metric Oriented Sequence Typer [MOST]) to assess their potential for replacing laboratory serological testing with serovar predictions from whole-genome sequencing data. We conducted a retrospective analysis of 1,624 isolates of 72 serovars submitted to the German National Salmonella Reference Laboratory between 1999 and 2019. All isolates are derived from animal and foodstuff origins. We conducted Illumina short-read sequencing and compared the serovar prediction results with the results of routine laboratory serotyping. We found the best-performing serovar prediction tool to be SISTR, with 94% correctly typed isolates, followed by SeqSero2 (87%), SeqSero (81%), and MOST (79%). Furthermore, we found that mapping-based tools like SeqSero and SeqSero2 (allele mode) were more reliable for the prediction of monophasic variants, while sequence type and cluster-based methods like MOST and SISTR (core-genome multilocus sequence type [cgMLST]), showed greater resilience when confronted with GC-biased sequencing data. We showed that the choice of library preparation kit could substantially affect O antigen detection, due to the low GC content of the and genes. Although the accuracy of computational serovar predictions is still not quite on par with traditional serotyping by reference laboratories, the command-line tools investigated in this study perform a rapid, efficient, inexpensive, and reproducible analysis, which can be integrated into in-house characterization pipelines. Based on our results, we find SISTR most suitable for automated, routine serotyping for public health surveillance of spp. are important foodborne pathogens. To reduce the number of infected patients, it is essential to understand which subtypes of the bacteria cause disease outbreaks. Traditionally, characterization of requires serological testing, a laboratory method by which isolates can be classified into over 2,600 distinct subtypes, called serovars. Due to recent advances in whole-genome sequencing, many tools have been developed to replace traditional testing methods with computational analysis of genome sequences. It is crucial to validate that these tools, many already in use for routine surveillance, deliver accurate and reliable serovar information. In this study, we set out to compare which of the currently available open-source command-line tools is most suitable to replace serological testing. A thorough evaluation of the differing computational approaches is highly important to ensure the backward compatibility of serotyping data and to maintain comparability between laboratories.

摘要

我们比较了四种开源打字工具（SeqSero、SeqSero2、Typing Resource [SISTR]和Metric Oriented Sequence Typer [MOST]）的性能，以评估它们在将血清型预测从全基因组测序数据替代实验室血清学检测方面的潜力。我们对 1999 年至 2019 年间提交给德国国家沙门氏菌参考实验室的 72 个血清型的 1624 个分离株进行了回顾性分析。所有分离株均来自动物和食品来源。我们进行了 Illumina 短读测序，并将血清型预测结果与常规实验室血清分型结果进行了比较。我们发现表现最好的血清型预测工具是 SISTR，其正确分型的分离株比例为 94%，其次是 SeqSero2（87%）、SeqSero（81%）和 MOST（79%）。此外，我们发现基于映射的工具，如 SeqSero 和 SeqSero2（等位基因模式），更可靠地预测单相变体，而基于序列类型和聚类的方法，如 MOST 和 SISTR（核心基因组多位点序列类型[cgMLST]），在面对 GC 偏向性测序数据时表现出更大的弹性。我们发现，由于 O 抗原基因的 GC 含量较低，文库制备试剂盒的选择会极大地影响 O 抗原的检测。尽管计算血清型预测的准确性仍不及传统的参考实验室血清分型，但本研究中调查的命令行工具可快速、高效、廉价且可重复地进行分析，可整合到内部特征分析管道中。基于我们的结果，我们发现 SISTR 最适合用于公共卫生监测的自动化、常规血清分型，因为 spp. 是重要的食源性致病菌。为了减少感染患者的数量，了解哪些细菌亚型引起疾病爆发至关重要。传统上，沙门氏菌的特征描述需要血清学检测，这是一种实验室方法，可以将分离株分为 2600 多种不同的亚型，称为血清型。由于全基因组测序的最新进展，许多工具已被开发出来，用基因组序列的计算分析来替代传统的测试方法。验证这些工具（其中许多已用于常规监测）提供准确可靠的血清型信息至关重要。在这项研究中，我们着手比较当前可用的开源命令行工具中最适合替代血清学检测的工具。对不同计算方法的全面评估对于确保血清分型数据的向后兼容性以及保持实验室之间的可比性非常重要。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/564a/7028957/4ef1732bda54/AEM.02265-19-f0001.jpg

相似文献

Performance and Accuracy of Four Open-Source Tools for Serotyping of spp. Based on Whole-Genome Short-Read Sequencing Data.

Appl Environ Microbiol. 2020 Feb 18;86(5). doi: 10.1128/AEM.02265-19.

SeqSero2: Rapid and Improved Serotype Determination Using Whole-Genome Sequencing Data.

Appl Environ Microbiol. 2019 Nov 14;85(23). doi: 10.1128/AEM.01746-19. Print 2019 Dec 1.

Serotyping Using Whole Genome Sequencing.

Front Microbiol. 2018 Dec 13;9:2993. doi: 10.3389/fmicb.2018.02993. eCollection 2018.

Comprehensive assessment of the quality of Salmonella whole genome sequence data available in public sequence databases using the Salmonella in silico Typing Resource (SISTR).

Microb Genom. 2018 Feb;4(2). doi: 10.1099/mgen.0.000151. Epub 2018 Jan 17.

The Salmonella In Silico Typing Resource (SISTR): An Open Web-Accessible Tool for Rapidly Typing and Subtyping Draft Salmonella Genome Assemblies.

PLoS One. 2016 Jan 22;11(1):e0147101. doi: 10.1371/journal.pone.0147101. eCollection 2016.

The Validation and Implications of Using Whole Genome Sequencing as a Replacement for Traditional Serotyping for a National Reference Laboratory.

Front Microbiol. 2017 Jun 9;8:1044. doi: 10.3389/fmicb.2017.01044. eCollection 2017.

Oxford nanopore technologies-a valuable tool to generate whole-genome sequencing data for serotyping and the detection of genetic markers in .

Front Vet Sci. 2023 Jun 1;10:1178922. doi: 10.3389/fvets.2023.1178922. eCollection 2023.

Evaluation of real-time nanopore sequencing for Salmonella serotype prediction.

Food Microbiol. 2020 Aug;89:103452. doi: 10.1016/j.fm.2020.103452. Epub 2020 Feb 5.

Molecular methods for serovar determination of Salmonella.

Crit Rev Microbiol. 2015;41(3):309-25. doi: 10.3109/1040841X.2013.837862. Epub 2013 Nov 14.

Serotyping; Comparison of the Traditional Method to a Microarray-Based Method and an Platform Using Whole Genome Sequencing Data.

Front Microbiol. 2019 Nov 11;10:2554. doi: 10.3389/fmicb.2019.02554. eCollection 2019.

引用本文的文献

Phylogeny and divergence of the 100 most common serovars available in the NCBI Pathogen Detection database.

Front Microbiol. 2025 Jun 13;16:1547190. doi: 10.3389/fmicb.2025.1547190. eCollection 2025.

Fritz Kauffmann: innovator in microbial classification.

APMIS. 2025 Jan;133(1):e13504. doi: 10.1111/apm.13504.

Oxford nanopore technologies-a valuable tool to generate whole-genome sequencing data for serotyping and the detection of genetic markers in .

Front Vet Sci. 2023 Jun 1;10:1178922. doi: 10.3389/fvets.2023.1178922. eCollection 2023.

Genomic Surveillance of from the Comunitat Valenciana (Spain).

Antibiotics (Basel). 2023 May 9;12(5):883. doi: 10.3390/antibiotics12050883.

Are and Isolated from Powdered Infant Formula a Hazard for Infants? A Genomic Analysis.

Foods. 2022 Nov 8;11(22):3556. doi: 10.3390/foods11223556.

Phenotypic and genotypic characterization of antimicrobial resistance profiles in isolated from waterfowl in 2002-2005 and 2018-2020 in Sichuan, China.

Front Microbiol. 2022 Oct 6;13:987613. doi: 10.3389/fmicb.2022.987613. eCollection 2022.

Combination of Whole Genome Sequencing and Metagenomics for Microbiological Diagnostics.

Int J Mol Sci. 2022 Aug 30;23(17):9834. doi: 10.3390/ijms23179834.

EnteroBase: hierarchical clustering of 100 000s of bacterial genomes into species/subspecies and populations.

Philos Trans R Soc Lond B Biol Sci. 2022 Oct 10;377(1861):20210240. doi: 10.1098/rstb.2021.0240. Epub 2022 Aug 22.

The efficiency of Nextera XT tagmentation depends on G and C bases in the binding motif leading to uneven coverage in bacterial species with low and neutral GC-content.

Front Microbiol. 2022 Jul 14;13:944770. doi: 10.3389/fmicb.2022.944770. eCollection 2022.

subsp. II serovar 4,5,12:a:- may cause gastroenteritis infections in humans.

Gut Microbes. 2022 Jan-Dec;14(1):2089007. doi: 10.1080/19490976.2022.2089007.

本文引用的文献

SeqSero2: Rapid and Improved Serotype Determination Using Whole-Genome Sequencing Data.

Appl Environ Microbiol. 2019 Nov 14;85(23). doi: 10.1128/AEM.01746-19. Print 2019 Dec 1.

FoodOn: a harmonized food ontology to increase global food traceability, quality control and data integration.

NPJ Sci Food. 2018 Dec 18;2:23. doi: 10.1038/s41538-018-0032-6. eCollection 2018.

Identification of Serovar-Specific Genes for Serotyping.

Front Microbiol. 2019 Apr 24;10:835. doi: 10.3389/fmicb.2019.00835. eCollection 2019.

Worldwide Epidemiology of Serovars in Animal-Based Foods: a Meta-analysis.

Appl Environ Microbiol. 2019 Jul 1;85(14). doi: 10.1128/AEM.00591-19. Print 2019 Jul 15.

Serotyping Using Whole Genome Sequencing.

Front Microbiol. 2018 Dec 13;9:2993. doi: 10.3389/fmicb.2018.02993. eCollection 2018.

fastp: an ultra-fast all-in-one FASTQ preprocessor.

Bioinformatics. 2018 Sep 1;34(17):i884-i890. doi: 10.1093/bioinformatics/bty560.

Salmonella in Foods: A Reemerging Problem.

Adv Food Nutr Res. 2018;86:137-179. doi: 10.1016/bs.afnr.2018.02.007. Epub 2018 Apr 2.

Comprehensive assessment of the quality of Salmonella whole genome sequence data available in public sequence databases using the Salmonella in silico Typing Resource (SISTR).

Microb Genom. 2018 Feb;4(2). doi: 10.1099/mgen.0.000151. Epub 2018 Jan 17.

Pan-genome Analyses of the Species , and Identification of Genomic Markers Predictive for Species, Subspecies, and Serovar.

Front Microbiol. 2017 Jul 31;8:1345. doi: 10.3389/fmicb.2017.01345. eCollection 2017.

Context Is Everything: Harmonization of Critical Food Microbiology Descriptors and Metadata for Improved Food Safety and Surveillance.

Front Microbiol. 2017 Jun 26;8:1068. doi: 10.3389/fmicb.2017.01068. eCollection 2017.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于全基因组短读测序数据的 spp. 分型的四种开源工具的性能和准确性。

Performance and Accuracy of Four Open-Source Tools for Serotyping of spp. Based on Whole-Genome Short-Read Sequencing Data.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献