Miao Miao, Clercq Erik De, Li Guangdi
Hunan Provincial Key Laboratory of Clinical Epidemiology, Xiangya School of Public Health, Central South University, Changsha 410078, China.
Rega Institute for Medical Research, Department of Microbiology, Immunology and Transplantation, KU Leuven, Herestraat 49, B-3000 Leuven, Belgium.
Biomedicines. 2021 Apr 11;9(4):412. doi: 10.3390/biomedicines9040412.
Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) caused a global pandemic of coronavirus disease in 2019 (COVID-19). Genome surveillance is a key method to track the spread of SARS-CoV-2 variants. Genetic diversity and evolution of SARS-CoV-2 were analyzed based on 260,673 whole-genome sequences, which were sampled from 62 countries between 24 December 2019 and 12 January 2021. We found that amino acid (AA) substitutions were observed in all SARS-CoV-2 proteins, and the top six proteins with the highest substitution rates were ORF10, nucleocapsid, ORF3a, spike glycoprotein, RNA-dependent RNA polymerase, and ORF8. Among 25,629 amino acid substitutions at 8484 polymorphic sites across the coding region of the SARS-CoV-2 genome, the D614G (93.88%) variant in spike and the P323L (93.74%) variant in RNA-dependent RNA polymerase were the dominant variants on six continents. As of January 2021, the genomic sequences of SARS-CoV-2 could be divided into at least 12 different clades. Distributions of SARS-CoV-2 clades were featured with temporal and geographical dynamics on six continents. Overall, this large-scale analysis provides a detailed mapping of SARS-CoV-2 variants in different geographic areas at different time points, highlighting the importance of evaluating highly prevalent variants in the development of SARS-CoV-2 antiviral drugs and vaccines.
严重急性呼吸综合征冠状病毒2(SARS-CoV-2)在2019年引发了全球冠状病毒病大流行(COVID-19)。基因组监测是追踪SARS-CoV-2变体传播的关键方法。基于2019年12月24日至2021年1月12日期间从62个国家采集的260,673个全基因组序列,分析了SARS-CoV-2的遗传多样性和进化情况。我们发现,在所有SARS-CoV-2蛋白中均观察到氨基酸(AA)替换,替换率最高的前六种蛋白分别是ORF10、核衣壳蛋白、ORF3a、刺突糖蛋白、RNA依赖性RNA聚合酶和ORF8。在SARS-CoV-2基因组编码区8484个多态性位点的25,629个氨基酸替换中,刺突蛋白中的D614G(93.88%)变体和RNA依赖性RNA聚合酶中的P323L(93.74%)变体是六大洲的主要变体。截至2021年1月,SARS-CoV-2的基因组序列可分为至少12个不同的进化枝。SARS-CoV-2进化枝的分布在六大洲呈现出时间和地理动态特征。总体而言,这项大规模分析提供了不同时间点不同地理区域SARS-CoV-2变体的详细图谱,突出了在SARS-CoV-2抗病毒药物和疫苗研发中评估高度流行变体的重要性。