一种在基因组规模上定义与疫情相关样本遗传相关性的简单且稳健的统计方法——应用于回顾性食源性疫情调查。

A Simple and Robust Statistical Method to Define Genetic Relatedness of Samples Related to Outbreaks at the Genomic Scale - Application to Retrospective Foodborne Outbreak Investigations.

作者信息

Radomski Nicolas, Cadel-Six Sabrina, Cherchame Emeline, Felten Arnaud, Barbet Pauline, Palma Federica, Mallet Ludovic, Le Hello Simon, Weill François-Xavier, Guillier Laurent, Mistou Michel-Yves

机构信息

ANSES, Laboratory for Food Safety, Université PARIS-EST, Maisons-Alfort, France.

Unité des Bactéries Pathogènes Entériques, Institut Pasteur, Centre National de Référence des Salmonella, Paris, France.

出版信息

Front Microbiol. 2019 Oct 24;10:2413. doi: 10.3389/fmicb.2019.02413. eCollection 2019.

DOI:10.3389/fmicb.2019.02413

PMID:31708892

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6821717/

Abstract

The investigation of foodborne outbreaks (FBOs) from genomic data typically relies on inspecting the relatedness of samples through a phylogenomic tree computed on either SNPs, genes, kmers, or alleles (i.e., cgMLST and wgMLST). The phylogenomic reconstruction is often time-consuming, computation-intensive and depends on hidden assumptions, pipelines implementation and their parameterization. In the context of FBO investigations, robust links between isolates are required in a timely manner to trigger appropriate management actions. Here, we propose a non-parametric statistical method to assert the relatedness of samples (i.e., outbreak cases) or whether to reject them (i.e., non-outbreak cases). With typical computation running within minutes on a desktop computer, we benchmarked the ability of three non-parametric statistical tests (i.e., Wilcoxon rank-sum, Kolmogorov-Smirnov and Kruskal-Wallis) on six different genomic features (i.e., SNPs, SNPs excluding recombination events, genes, kmers, cgMLST alleles, and wgMLST alleles) to discriminate outbreak cases (i.e., positive control: C+) from non-outbreak cases (i.e., negative control: C-). We leveraged four well-characterized and retrospectively investigated FBOs of Typhimurium and its monophasic variant . 1,4,[5],12:i:- from France, setting positive and negative controls in all the assays. We show that the approaches relying on pairwise SNP differences distinguished all four considered outbreaks in contrast to the other tested genomic features (i.e., genes, kmers, cgMLST alleles, and wgMLST alleles). The freely available non-parametric method written in R has been designed to be independent of both the phylogenomic reconstruction and the detection methods of genomic features (i.e., SNPs, genes, kmers, or alleles), making it widely and easily usable to anybody working on genomic data from suspected samples.

摘要

基于基因组数据对食源性疾病暴发（FBOs）进行调查通常依赖于通过基于单核苷酸多态性（SNPs）、基因、k-mer或等位基因（即核心多位点序列分型（cgMLST）和全基因组多位点序列分型（wgMLST））计算的系统发育树来检查样本的相关性。系统发育重建通常耗时、计算量大，并且依赖于隐藏假设、流程实施及其参数设置。在FBO调查的背景下，需要及时建立分离株之间的可靠联系，以触发适当的管理行动。在此，我们提出一种非参数统计方法来确定样本（即暴发病例）之间的相关性，或者判断是否应排除这些样本（即非暴发病例）。在台式计算机上，典型计算只需几分钟即可完成，我们对三种非参数统计检验（即威尔科克森秩和检验、柯尔莫哥洛夫-斯米尔诺夫检验和克鲁斯卡尔-沃利斯检验）在六种不同基因组特征（即SNPs、排除重组事件的SNPs、基因、k-mers、cgMLST等位基因和wgMLST等位基因）上区分暴发病例（即阳性对照：C+）和非暴发病例（即阴性对照：C-）的能力进行了基准测试。我们利用了来自法国的4例特征明确且经过回顾性调查的鼠伤寒沙门氏菌及其单相变体1,4,[5],12:i:-的FBOs，在所有检测中设置了阳性和阴性对照。我们发现，与其他测试的基因组特征（即基因、k-mers、cgMLST等位基因和wgMLST等位基因）相比，基于成对SNP差异的方法能够区分所有4例考虑到的暴发。用R语言编写的免费非参数方法被设计为独立于系统发育重建和基因组特征（即SNPs、基因、k-mers或等位基因）的检测方法，使得任何处理疑似样本基因组数据的人员都能广泛且轻松地使用它。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b115/6821717/004fe627af4b/fmicb-10-02413-g001.jpg

相似文献

A Simple and Robust Statistical Method to Define Genetic Relatedness of Samples Related to Outbreaks at the Genomic Scale - Application to Retrospective Foodborne Outbreak Investigations.一种在基因组规模上定义与疫情相关样本遗传相关性的简单且稳健的统计方法——应用于回顾性食源性疫情调查。

Front Microbiol. 2019 Oct 24;10:2413. doi: 10.3389/fmicb.2019.02413. eCollection 2019.

Evaluation of whole and core genome multilocus sequence typing allele schemes for outbreak detection in a national surveillance network, PulseNet USA.美国国家食源疾病监测网络PulseNet中用于暴发检测的全基因组和核心基因组多位点序列分型等位基因方案评估

Front Microbiol. 2023 Sep 21;14:1254777. doi: 10.3389/fmicb.2023.1254777. eCollection 2023.

Evaluation of core genome and whole genome multilocus sequence typing schemes for and outbreak detection in the USA.评估核心基因组和全基因组多位点序列分型方案，用于美国和疫情爆发检测。

Microb Genom. 2023 May;9(5). doi: 10.1099/mgen.0.001012.

Retrospective investigation of listeriosis outbreaks in small ruminants using different analytical approaches for whole genome sequencing-based typing of Listeria monocytogenes.采用不同的全基因组测序分析方法对基于李斯特菌属的单核细胞增生李斯特菌进行分型，对小型反刍动物李斯特菌病暴发进行回顾性调查。

Infect Genet Evol. 2020 Jan;77:104047. doi: 10.1016/j.meegid.2019.104047. Epub 2019 Oct 17.

Genomes-based MLST, cgMLST, wgMLST and SNP analysis of Salmonella Typhimurium from animals and humans.基于基因组的 MLST、cgMLST、wgMLST 和动物与人类来源的沙门氏菌 Typhimurium 的 SNP 分析。

Comp Immunol Microbiol Infect Dis. 2023 May;96:101973. doi: 10.1016/j.cimid.2023.101973. Epub 2023 Mar 23.

Comparative analysis of core genome MLST and SNP typing within a European Salmonella serovar Enteritidis outbreak.对欧洲肠炎沙门氏菌暴发的核心基因组 MLST 和 SNP 分型的比较分析。

Int J Food Microbiol. 2018 Jun 2;274:1-11. doi: 10.1016/j.ijfoodmicro.2018.02.023. Epub 2018 Feb 28.

A Comparative Analysis of the Lyve-SET Phylogenomics Pipeline for Genomic Epidemiology of Foodborne Pathogens.用于食源性病原体基因组流行病学的Lyve-SET系统发育基因组学流程的比较分析

Front Microbiol. 2017 Mar 13;8:375. doi: 10.3389/fmicb.2017.00375. eCollection 2017.

Evaluation of WGS-subtyping methods for epidemiological surveillance of foodborne salmonellosis.用于食源性沙门氏菌病流行病学监测的全基因组测序分型方法评估

One Health Outlook. 2020 Jul 6;2:13. doi: 10.1186/s42522-020-00016-5. eCollection 2020.

Investigations of Possible Multistate Outbreaks of Salmonella, Shiga Toxin-Producing Escherichia coli, and Listeria monocytogenes Infections - United States, 2016.2016 年美国可能发生的沙门氏菌、产志贺毒素大肠杆菌和单核细胞增生李斯特菌感染的多州疫情调查。

MMWR Surveill Summ. 2020 Nov 13;69(6):1-14. doi: 10.15585/mmwr.ss6906a1.

Large-Scale Genomic Analyses and Toxinotyping of Implicated in Foodborne Outbreaks in France.法国食源性疾病暴发相关菌株的大规模基因组分析与毒素分型

Front Microbiol. 2019 Apr 17;10:777. doi: 10.3389/fmicb.2019.00777. eCollection 2019.

引用本文的文献

Lessons from 5 Years of Routine Whole-Genome Sequencing for Epidemiologic Surveillance of Shiga Toxin-Producing Escherichia coli, France, 2018-2022.2018 - 2022年法国对产志贺毒素大肠杆菌进行流行病学监测的5年全基因组常规测序经验教训

Emerg Infect Dis. 2025 May;31(13):117-128. doi: 10.3201/eid3113.241950.

Multi-country and intersectoral assessment of cluster congruence between pipelines for genomics surveillance of foodborne pathogens.食源性病原体基因组监测渠道间集群一致性的多国跨部门评估

Nat Commun. 2025 Apr 28;16(1):3961. doi: 10.1038/s41467-025-59246-8.

Unraveling the impact of genome assembly on bacterial typing: a one health perspective.解析基因组组装对细菌分型的影响：从“同一健康”角度出发。

BMC Genomics. 2024 Nov 8;25(1):1059. doi: 10.1186/s12864-024-10982-z.

Genomic diversity of Typhimurium and its monophasic variant in pig and pork production in France.法国猪及猪肉生产中鼠伤寒沙门氏菌及其单相变体的基因组多样性

Microbiol Spectr. 2024 Nov 8;12(12):e0052624. doi: 10.1128/spectrum.00526-24.

Genomic insights of Salmonella isolated from dry fermented sausage production chains in Spain and France.西班牙和法国干发酵香肠生产链中分离的沙门氏菌的基因组分析

Sci Rep. 2024 May 22;14(1):11660. doi: 10.1038/s41598-024-62141-9.

Isolation, Identification, Antimicrobial Resistance, Genotyping, and Whole-Genome Sequencing Analysis of Enteritidis Isolated from a Food-Poisoning Incident.从一起食源性中毒事件中分离鉴定出肠炎沙门氏菌，进行了药敏试验、耐药基因分型和全基因组测序分析。

Pol J Microbiol. 2024 Mar 4;73(1):69-89. doi: 10.33073/pjm-2024-008. eCollection 2024 Mar 1.

Virulence Factors and Antimicrobial Resistance of Uropathogenic EQ101 UPEC Isolated from UTI Patient in Quetta, Balochistan, Pakistan.巴基斯坦俾路支省奎达尿路感染患者分离的尿路致病性大肠杆菌 EQ101 的毒力因子与抗菌耐药性。

Biomed Res Int. 2023 Sep 11;2023:7278070. doi: 10.1155/2023/7278070. eCollection 2023.

Tell me if you prefer bovine or poultry sectors and I'll tell you who you are: Characterization of subsp. serovar Mbandaka in France.告诉我你更喜欢牛类还是禽类养殖领域，我就能说出你是怎样的人：法国班达卡亚种血清型的特征分析。

Front Microbiol. 2023 Apr 6;14:1130891. doi: 10.3389/fmicb.2023.1130891. eCollection 2023.

Large-scale comparative genomics to refine the organization of the global population structure.大规模比较基因组学改进全球人口结构的组织。

Microb Genom. 2022 Dec;8(12). doi: 10.1099/mgen.0.000906.

Polyphyly in widespread serovars and using genomic proximity to choose the best reference genome for bioinformatics analyses.广泛血清型中的多系发生和使用基因组邻近性来选择最佳参考基因组进行生物信息学分析。

Front Public Health. 2022 Sep 8;10:963188. doi: 10.3389/fpubh.2022.963188. eCollection 2022.

本文引用的文献

Whole Genome Sequencing Based Surveillance of for Early Detection and Investigations of Listeriosis Outbreaks.基于全基因组测序的李斯特菌病暴发早期检测与调查监测

Front Public Health. 2019 Jun 4;7:139. doi: 10.3389/fpubh.2019.00139. eCollection 2019.

GenomeGraphR: A user-friendly open-source web application for foodborne pathogen whole genome sequencing data integration, analysis, and visualization.GenomeGraphR：一个用户友好的开源网络应用程序，用于食物病原体全基因组测序数据的整合、分析和可视化。

PLoS One. 2019 Feb 28;14(2):e0213039. doi: 10.1371/journal.pone.0213039. eCollection 2019.

Beyond the SNP Threshold: Identifying Outbreak Clusters Using Inferred Transmissions.超越 SNP 阈值：利用推断的传播来识别暴发集群。

Mol Biol Evol. 2019 Mar 1;36(3):587-603. doi: 10.1093/molbev/msy242.

Genomic Epidemiology of Transmission in Israel.以色列传播的基因组流行病学

Front Microbiol. 2018 Oct 16;9:2432. doi: 10.3389/fmicb.2018.02432. eCollection 2018.

Interpreting Whole-Genome Sequence Analyses of Foodborne Bacteria for Regulatory Applications and Outbreak Investigations.解读食源性病原体全基因组序列分析结果以用于监管应用和疫情调查

Front Microbiol. 2018 Jul 10;9:1482. doi: 10.3389/fmicb.2018.01482. eCollection 2018.

Operational burden of implementing Salmonella Enteritidis and Typhimurium cluster detection using whole genome sequencing surveillance data in England: a retrospective assessment.运用全基因组测序监测数据对英格兰沙门氏菌肠炎和鼠伤寒菌簇进行检测的实施工作负担：一项回顾性评估。

Epidemiol Infect. 2018 Aug;146(11):1452-1460. doi: 10.1017/S0950268818001589. Epub 2018 Jul 2.

Evaluation of phylogenetic reconstruction methods using bacterial whole genomes: a simulation based study.使用细菌全基因组评估系统发育重建方法：一项基于模拟的研究

Wellcome Open Res. 2018 Mar 23;3:33. doi: 10.12688/wellcomeopenres.14265.2. eCollection 2018.

Int J Food Microbiol. 2018 Jun 2;274:1-11. doi: 10.1016/j.ijfoodmicro.2018.02.023. Epub 2018 Feb 28.

Genomic Characterization of Listeria monocytogenes Isolates Associated with Clinical Listeriosis and the Food Production Environment in Ireland.爱尔兰与临床李斯特菌病及食品生产环境相关的单核细胞增生李斯特菌分离株的基因组特征分析

Genes (Basel). 2018 Mar 20;9(3):171. doi: 10.3390/genes9030171.

Comparison of advanced whole genome sequence-based methods to distinguish strains of Salmonella enterica serovar Heidelberg involved in foodborne outbreaks in Québec.比较基于高通量全基因组测序的先进方法，以区分与魁北克食源性暴发相关的肠炎沙门氏菌血清型 Heidelberg 菌株。

Food Microbiol. 2018 Aug;73:99-110. doi: 10.1016/j.fm.2018.01.004. Epub 2018 Jan 12.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

一种在基因组规模上定义与疫情相关样本遗传相关性的简单且稳健的统计方法——应用于回顾性食源性疫情调查。

A Simple and Robust Statistical Method to Define Genetic Relatedness of Samples Related to Outbreaks at the Genomic Scale - Application to Retrospective Foodborne Outbreak Investigations.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献