Center for Food Safety and Nutrition, Food and Drug Administration, Laurel, MD, USA.
Center for Food Safety and Nutrition, Food and Drug Administration, College Park, MD, USA.
BMC Genomics. 2023 Apr 4;24(1):165. doi: 10.1186/s12864-023-09245-0.
The Salmonella enterica serovar Newport red onion outbreak of 2020 was the largest foodborne outbreak of Salmonella in over a decade. The epidemiological investigation suggested two farms as the likely source of contamination. However, single nucleotide polymorphism (SNP) analysis of the whole genome sequencing data showed that none of the Salmonella isolates collected from the farm regions were linked to the clinical isolates-preventing the use of phylogenetics in source identification. Here, we explored an alternative method for analyzing the whole genome sequencing data driven by the hypothesis that if the outbreak strain had come from the farm regions, then the clinical isolates would disproportionately contain plasmids found in isolates from the farm regions due to horizontal transfer.
SNP analysis confirmed that the clinical isolates formed a single, nearly-clonal clade with evidence for ancestry in California going back a decade. The clinical clade had a large core genome (4,399 genes) and a large and sparsely distributed accessory genome (2,577 genes, at least 64% on plasmids). At least 20 plasmid types occurred in the clinical clade, more than were found in the literature for Salmonella Newport. A small number of plasmids, 14 from 13 clinical isolates and 17 from 8 farm isolates, were found to be highly similar (> 95% identical)-indicating they might be related by horizontal transfer. Phylogenetic analysis was unable to determine the geographic origin, isolation source, or time of transfer of the plasmids, likely due to their promiscuous and transient nature. However, our resampling analysis suggested that observing a similar number and combination of highly similar plasmids in random samples of environmental Salmonella enterica within the NCBI Pathogen Detection database was unlikely, supporting a connection between the outbreak strain and the farms implicated by the epidemiological investigation.
Horizontally transferred plasmids provided evidence for a connection between clinical isolates and the farms implicated as the source of the outbreak. Our case study suggests that such analyses might add a new dimension to source tracking investigations, but highlights the need for detailed and accurate metadata, more extensive environmental sampling, and a better understanding of plasmid molecular evolution.
2020 年沙门氏菌纽波特洋葱暴发是十多年来最大的食源性沙门氏菌暴发。流行病学调查表明,有两个农场可能是污染的来源。然而,全基因组测序数据的单核苷酸多态性(SNP)分析表明,从农场地区采集的沙门氏菌分离株中没有一株与临床分离株有关联,从而阻止了使用系统发生学进行来源识别。在这里,我们探索了一种替代方法来分析全基因组测序数据,其假设是如果暴发菌株来自农场地区,那么由于水平转移,临床分离株中会不成比例地包含来自农场地区分离株的质粒。
SNP 分析证实,临床分离株形成了一个单一的、近乎克隆的分支,其在加利福尼亚州的起源可以追溯到十年前。临床分支有一个大的核心基因组(4399 个基因)和一个大的、稀疏分布的辅助基因组(2577 个基因,至少 64%在质粒上)。临床分支中至少有 20 种质粒类型,比文献中报道的沙门氏菌纽波特还要多。从 13 个临床分离株中发现了 14 个,从 8 个农场分离株中发现了 17 个,发现它们高度相似(>95%相同),表明它们可能通过水平转移有关联。系统发育分析无法确定质粒的地理起源、分离源或转移时间,这可能是由于它们的混杂性和短暂性。然而,我们的重采样分析表明,在 NCBI 病原体检测数据库中,随机抽取的环境沙门氏菌 enterica 中的相似数量和组合的高度相似质粒不太可能出现,这支持了暴发菌株与流行病学调查中涉及的农场之间的联系。
水平转移的质粒为临床分离株与被认为是暴发源的农场之间的联系提供了证据。我们的案例研究表明,这种分析可能为来源追踪调查增加一个新的维度,但也强调了需要详细和准确的元数据、更广泛的环境采样以及更好地了解质粒分子进化。