Pearson Talima, Furstenau Tara, Wood Colin, Rigas Vanessa, Drake Kylie, Sahl Jason, Maltinsky Sara, Currie Bart J, Mayo Mark, Hall Carina, Keim Paul, Fofanov Viacheslav
Pathogen and Microbiome Institute, Northern Arizona University, Flagstaff, AZ 86011.
Health and Bioinformatics, School of Informatics, Computing, and Cyber Systems, Northern Arizona University, Flagstaff, AZ 86011.
Proc Natl Acad Sci U S A. 2025 Jun 10;122(23):e2424797122. doi: 10.1073/pnas.2424797122. Epub 2025 Jun 3.
Genomic diversity in pathogen populations is foundational for evolution and adaptation. Understanding population-level diversity is also essential for tracking sources and revealing detailed pathways of transmission and spread. For bacteria, culturing, isolating, and sequencing the large number of individual colonies required to adequately sample diversity can be prohibitively time-consuming and expensive. While sequencing directly from a mixed population will show variants among reads, they cannot be linked to reveal allele combinations associated with phylogenetic inheritance patterns. Here, we describe the theory and method for using population sequencing directly from a mixed sample, along with a minimal number of individually sequenced colonies, to describe the phylogenetic diversity of a population without haplotype reconstruction. To demonstrate the utility of population sequencing in capturing phylogenetic diversity, we compared isogenic clones to population sequences of from sputum of a single patient. Our results point to the pathogen population being highly structured, suggesting that for some pathogens, sputum sampling may preserve structuring in the lungs and thus present a noninvasive alternative to understanding colonization, movement, and pathogen/host interactions. We also analyzed population sequences of derived from different people and different body sites to reveal directionality of transmission between hosts and across body sites, demonstrating the power and utility for characterizing the spread of disease and identification of reservoirs at the finest levels. We anticipate that population sequencing and analysis can be broadly applied to accelerate research in a wide range of fields reliant on a foundational understanding of population phylogenetic diversity.
病原体群体中的基因组多样性是进化和适应的基础。了解群体水平的多样性对于追踪来源以及揭示传播和扩散的详细途径也至关重要。对于细菌而言,培养、分离并对大量个体菌落进行测序以充分采样多样性可能极其耗时且昂贵。虽然直接从混合群体中测序会显示读段之间的变异,但无法将它们联系起来以揭示与系统发育遗传模式相关的等位基因组合。在这里,我们描述了直接从混合样本以及极少量单独测序的菌落进行群体测序的理论和方法,以描述群体的系统发育多样性而无需单倍型重建。为了证明群体测序在捕获系统发育多样性方面的效用,我们将同基因克隆与来自一名患者痰液的群体序列进行了比较。我们的结果表明病原体群体具有高度的结构,这表明对于某些病原体,痰液采样可能保留肺部的结构,从而为理解定植、移动以及病原体/宿主相互作用提供一种非侵入性的替代方法。我们还分析了来自不同人群和不同身体部位的群体序列,以揭示宿主之间以及跨身体部位的传播方向性,证明了在最精细层面表征疾病传播和识别储存库的能力和效用。我们预计群体测序和分析可广泛应用,以加速依赖对群体系统发育多样性有基本理解的广泛领域的研究。