School of Computing Science, Simon Fraser University, Burnaby, British Columbia, Canada V5A 1S6.
Department of Infectious Disease Epidemiology, Imperial College, Praed Street, London W2 1NY, UK.
Philos Trans R Soc Lond B Biol Sci. 2022 Oct 10;377(1861):20210231. doi: 10.1098/rstb.2021.0231. Epub 2022 Aug 22.
The field of genomic epidemiology is rapidly growing as many jurisdictions begin to deploy whole-genome sequencing (WGS) in their national or regional pathogen surveillance programmes. WGS data offer a rich view of the shared ancestry of a set of taxa, typically visualized with phylogenetic trees illustrating the clusters or subtypes present in a group of taxa, their relatedness and the extent of diversification within and between them. When methicillin-resistant (MRSA) arose and disseminated widely, phylogenetic trees of MRSA-containing types of had a distinctive 'comet' shape, with a 'comet head' of recently adapted drug-resistant isolates in the context of a 'comet tail' that was predominantly drug-sensitive. Placing an isolate in the context of such a 'comet' helped public health laboratories interpret local data within the broader setting of evolution. In this work, we ask what other tree shapes, analogous to the MRSA comet, are present in bacterial WGS datasets. We extract trees from large bacterial genomic datasets, visualize them as images and cluster the images. We find nine major groups of tree images, including the 'comets', star-like phylogenies, 'barbell' phylogenies and other shapes, and comment on the evolutionary and epidemiological stories these shapes might illustrate. This article is part of a discussion meeting issue 'Genomic population structures of microbial pathogens'.
基因组流行病学领域正在迅速发展,因为许多司法管辖区开始在其国家或地区病原体监测计划中部署全基因组测序(WGS)。WGS 数据提供了一组分类单元的共同祖先的丰富视图,通常通过显示聚类或亚类存在的系统发育树来直观表示,这些聚类或亚类展示了群体中的分类单元、它们的亲缘关系以及它们内部和之间的多样化程度。当耐甲氧西林金黄色葡萄球菌(MRSA)出现并广泛传播时,包含 MRSA 的类型的系统发育树具有独特的“彗星”形状,在“彗星尾巴”中主要是药物敏感的情况下,有一个“彗星头”,其中是最近适应药物的耐药分离株。在这样的“彗星”背景下对 分离株进行定位有助于公共卫生实验室在更广泛的 进化背景下解释本地数据。在这项工作中,我们询问在细菌 WGS 数据集中是否存在其他类似于 MRSA 彗星的树形状。我们从大型细菌基因组数据集提取树,将它们可视化成图像并对图像进行聚类。我们发现了九组主要的树状图像,包括“彗星”、星形系统发育树、“哑铃”系统发育树和其他形状,并评论了这些形状可能说明的进化和流行病学故事。本文是“微生物病原体的基因组种群结构”讨论会议议题的一部分。