Suppr超能文献

定义常见源细菌暴发的基因组流行病学阈值:一项建模研究。

Defining genomic epidemiology thresholds for common-source bacterial outbreaks: a modelling study.

机构信息

Epidemiology and Modelling of Bacterial Escape to Antimicrobials Laboratory, Institut Pasteur, Université Paris Cité, Paris, France; Anti-infective Evasion and Pharmacoepidemiology Team, CESP, Université Paris-Saclay, UVSQ, INSERM U1018, Montigny-le-Bretonneux, France; Institut Pasteur, Université Paris Cité, Biodiversity and Epidemiology of Bacterial Pathogens, Paris, France.

Epidemiology and Modelling of Bacterial Escape to Antimicrobials Laboratory, Institut Pasteur, Université Paris Cité, Paris, France; Anti-infective Evasion and Pharmacoepidemiology Team, CESP, Université Paris-Saclay, UVSQ, INSERM U1018, Montigny-le-Bretonneux, France.

出版信息

Lancet Microbe. 2023 May;4(5):e349-e357. doi: 10.1016/S2666-5247(22)00380-9. Epub 2023 Mar 29.

Abstract

BACKGROUND

Epidemiological surveillance relies on microbial strain typing, which defines genomic relatedness among isolates to identify case clusters and their potential sources. Although predefined thresholds are often applied, known outbreak-specific features such as pathogen mutation rate and duration of source contamination are rarely considered. We aimed to develop a hypothesis-based model that estimates genetic distance thresholds and mutation rates for point-source single-strain food or environmental outbreaks.

METHODS

In this modelling study, we developed a forward model to simulate bacterial evolution at a specific mutation rate (μ) over a defined outbreak duration (D). From the distribution of genetic distances expected under the given outbreak parameters and sample isolation dates, we estimated a distance threshold beyond which isolates should not be considered as part of the outbreak. We embedded the model into a Markov Chain Monte Carlo inference framework to estimate the most probable mutation rate or time since source contamination, which are both often imprecisely documented. A simulation study validated the model over realistic durations and mutation rates. We then identified and analysed 16 published datasets of bacterial source-related outbreaks; datasets were included if they were from an identified foodborne outbreak and if whole-genome sequence data and collection dates for the described isolates were available.

FINDINGS

Analysis of simulated data validated the accuracy of our framework in both discriminating between outbreak and non-outbreak cases and estimating the parameters D and μ from outbreak data. Precision of estimation was much higher for high values of D and μ. Sensitivity of outbreak cases was always very high, and specificity in detecting non-outbreak cases was poor for low mutation rates. For 14 of the 16 outbreaks, the classification of isolates as being outbreak-related or sporadic is consistent with the original dataset. Four of these outbreaks included outliers, which were correctly classified as being beyond the threshold of exclusion estimated by our model, except for one isolate of outbreak 4. For two outbreaks, both foodborne Listeria monocytogenes, conclusions from our model were discordant with published results: in one outbreak two isolates were classified as outliers by our model and in another outbreak our algorithm separated food samples into one cluster and human samples into another, whereas the isolates were initially grouped together based on epidemiological and genetic evidence. Re-estimated values of the duration of outbreak or mutation rate were largely consistent with a priori defined values. However, in several cases the estimated values were higher and improved the fit with the observed genetic distance distribution, suggesting that early outbreak cases are sometimes missed.

INTERPRETATION

We propose here an evolutionary approach to the single-strain conundrum by estimating the genetic threshold and proposing the most probable cluster of cases for a given outbreak, as determined by its particular epidemiological and microbiological properties. This forward model, applicable to foodborne or environmental-source single point case clusters or outbreaks, is useful for epidemiological surveillance and may inform control measures.

FUNDING

European Union Horizon 2020 Research and Innovation Programme.

摘要

背景

流行病学监测依赖于微生物菌株分型,它定义了分离株之间的基因组相关性,以识别病例群及其潜在来源。尽管通常应用预设阈值,但很少考虑已知暴发特有的特征,如病原体突变率和源污染持续时间。我们旨在开发一种基于假设的模型,该模型估计单点源食源性或环境暴发的遗传距离阈值和突变率。

方法

在这项建模研究中,我们开发了一个正向模型,以在特定的突变率(μ)下模拟细菌在定义的暴发持续时间(D)内的进化。从给定暴发参数和样本分离日期下预期的遗传距离分布中,我们估计了一个距离阈值,超过该阈值的分离株不应被视为暴发的一部分。我们将模型嵌入到马尔可夫链蒙特卡罗推理框架中,以估计最可能的突变率或自源污染以来的时间,这两者通常都记录不精确。一项模拟研究验证了该模型在真实持续时间和突变率下的准确性。然后,我们确定并分析了 16 个已发表的细菌源相关暴发数据集;如果数据集来自已确定的食源性暴发,并且可获得描述分离株的全基因组序列数据和收集日期,则将数据集包括在内。

结果

模拟数据的分析验证了我们的框架在区分暴发和非暴发病例以及从暴发数据中估计参数 D 和 μ 方面的准确性。D 和 μ 值较高时,估计的精度要高得多。暴发病例的敏感性始终很高,而对于低突变率,检测非暴发病例的特异性较差。对于 16 个暴发中的 14 个,将分离株分类为暴发相关或散发的结果与原始数据集一致。其中四个暴发包括离群值,这些离群值被正确地归类为超出了我们模型估计的排除阈值,除了暴发 4 的一个分离株。对于两个暴发,都是食源性单核细胞增生李斯特菌,我们模型的分类结果与已发表的结果不一致:在一个暴发中,我们的模型将两个分离株归类为离群值,而在另一个暴发中,我们的算法将食物样本分为一组,人类样本分为另一组,而基于流行病学和遗传证据,这些分离株最初被归为一组。暴发持续时间或突变率的重新估计值与先验定义的值基本一致。然而,在某些情况下,估计值较高,并且改善了与观察到的遗传距离分布的拟合度,这表明暴发早期病例有时会被遗漏。

解释

我们在这里通过估计给定暴发的遗传阈值并提出最可能的病例群,提出了一种用于单株难题的进化方法,这是由其特定的流行病学和微生物学特性决定的。这种适用于食源性或环境源单点病例群或暴发的正向模型可用于流行病学监测,并可能为控制措施提供信息。

资助

欧盟地平线 2020 研究与创新计划。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5f06/10156608/df8c7449301f/gr1.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验