Departamento de Fitopatologia, Universidade de Brasília, Brasília, DF, 70910-900, Brazil.
Departamento de Veterinária, Universidade Federal de Viçosa, Viçosa, MG, 36570-900, Brazil.
Sci Rep. 2020 Oct 26;10(1):18289. doi: 10.1038/s41598-020-74050-8.
The World Health Organization characterized COVID-19 as a pandemic in March 2020, the second pandemic of the twenty-first century. Expanding virus populations, such as that of SARS-CoV-2, accumulate a number of narrowly shared polymorphisms, imposing a confounding effect on traditional clustering methods. In this context, approaches that reduce the complexity of the sequence space occupied by the SARS-CoV-2 population are necessary for robust clustering. Here, we propose subdividing the global SARS-CoV-2 population into six well-defined subtypes and 10 poorly represented genotypes named tentative subtypes by focusing on the widely shared polymorphisms in nonstructural (nsp3, nsp4, nsp6, nsp12, nsp13 and nsp14) cistrons and structural (spike and nucleocapsid) and accessory (ORF8) genes. The six subtypes and the additional genotypes showed amino acid replacements that might have phenotypic implications. Notably, three mutations (one of them in the Spike protein) were responsible for the geographical segregation of subtypes. We hypothesize that the virus subtypes detected in this study are records of the early stages of SARS-CoV-2 diversification that were randomly sampled to compose the virus populations around the world. The genetic structure determined for the SARS-CoV-2 population provides substantial guidelines for maximizing the effectiveness of trials for testing candidate vaccines or drugs.
世界卫生组织于 2020 年 3 月将 COVID-19 定性为大流行,这是二十一世纪的第二次大流行。病毒种群的扩大,如 SARS-CoV-2 的种群,积累了许多狭窄共享的多态性,对传统的聚类方法产生了干扰效应。在这种情况下,需要减少 SARS-CoV-2 种群占据的序列空间复杂性的方法来进行稳健的聚类。在这里,我们通过关注非结构(nsp3、nsp4、nsp6、nsp12、nsp13 和 nsp14 顺式)基因和结构(刺突和核衣壳)和辅助(ORF8)基因中广泛共享的多态性,将全球 SARS-CoV-2 种群分为六个明确的亚型和 10 个代表性较差的基因型,命名为暂定亚型。六个亚型和额外的基因型显示出可能具有表型影响的氨基酸替换。值得注意的是,三个突变(其中一个在刺突蛋白中)导致了亚型的地理隔离。我们假设本研究中检测到的病毒亚型是 SARS-CoV-2 多样化的早期阶段的记录,这些记录是随机抽样的,用于组成世界各地的病毒种群。确定的 SARS-CoV-2 种群的遗传结构为最大限度地提高候选疫苗或药物试验的有效性提供了重要指导。