Pekar Jonathan E, Moshiri Niema, Lemey Philippe, Crits-Christoph Alexander, Débarre Florence, Goldstein Stephen A, Hensel Zach, Rambaut Andrew, Worobey Michael, Holmes Edward C, Wertheim Joel O
Institute of Ecology and Evolution, University of Edinburgh, Ashworth Laboratories, Charlotte Auerbach Rd, Edinburgh EH9 3FL, United Kingdom.
Department of Medicine, University of California San Diego, 9500 Gilman Drive, La Jolla, CA 92093, United States.
Virus Evol. 2025 Feb 22;11(1):veaf008. doi: 10.1093/ve/veaf008. eCollection 2025.
Understanding the genomic diversity of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) at the outset of the coronavirus disease 2019 pandemic can provide insight into the circumstances leading to its emergence. Early SARS-CoV-2 genomic diversity has been classified into two distinct viral lineages, denoted "A" and "B," which we hypothesized were separately introduced into humans. Recently published data contain two genomes with a haplotype suggested to be an evolutionary intermediate to these two lineages, known as "T/T." We used a phylodynamic approach to analyze SARS-CoV-2 genomes from early 2020 to determine whether these two T/T genomes represent an evolutionarily intermediate haplotype between lineages A and B, or if they are a later descendent of either of these two lineages. We find that these two recently published T/T genomes do not represent an evolutionarily intermediate haplotype and were, instead, derived from either lineage A or lineage B. However, we cannot conclusively determine from which lineage they were derived. After including additional data from the start of the pandemic, including these two T/T genomes, we again find a discrepancy in the molecular clock when inferring the ancestral haplotype of SARS-CoV-2, corroborating existing evidence for the separate introductions of SARS-CoV-2 lineages A and B into the human population in late 2019.
了解2019冠状病毒病大流行初期严重急性呼吸综合征冠状病毒2(SARS-CoV-2)的基因组多样性,有助于深入了解其出现的情况。早期的SARS-CoV-2基因组多样性已被分为两个不同的病毒谱系,分别标记为“A”和“B”,我们推测它们是分别传入人类的。最近发表的数据包含两个基因组,其单倍型被认为是这两个谱系的进化中间体,称为“T/T”。我们采用系统动力学方法分析了2020年初的SARS-CoV-2基因组,以确定这两个T/T基因组是代表A和B谱系之间的进化中间单倍型,还是这两个谱系中任何一个的后代。我们发现,这两个最近发表的T/T基因组并不代表进化中间单倍型,而是来自A谱系或B谱系。然而,我们无法最终确定它们来自哪个谱系。在纳入大流行开始时的其他数据,包括这两个T/T基因组后,我们在推断SARS-CoV-2的祖先单倍型时再次发现分子钟存在差异,这证实了现有证据,即SARS-CoV-2的A和B谱系在2019年末分别传入人类群体。