University of Colorado Anschutz Medical Campus, 13001 East 17th Place, Aurora, CO, USA; Center for Genes, Environment, and Health, National Jewish Health, Smith Building, Room A651, 1400 Jackson Street, Denver, CO, USA.
University of Colorado Anschutz Medical Campus, 13001 East 17th Place, Aurora, CO, USA; Center for Genes, Environment, and Health, National Jewish Health, Smith Building, Room A651, 1400 Jackson Street, Denver, CO, USA.
Infect Genet Evol. 2022 Jan;97:105153. doi: 10.1016/j.meegid.2021.105153. Epub 2021 Nov 18.
Amid the ongoing COVID-19 pandemic, it has become increasingly important to monitor the mutations that arise in the SARS-CoV-2 virus, to prepare public health strategies and guide the further development of vaccines and therapeutics. The spike (S) protein and the proteins comprising the RNA-Dependent RNA Polymerase (RdRP) are key vaccine and drug targets, respectively, making mutation surveillance of these proteins of great importance. Full protein sequences were downloaded from the GISAID database, aligned, and the variants identified. 437,006 unique viral genomes were analyzed. Polymorphisms in the protein sequence were investigated and examined longitudinally to identify sequence and strain variants appearing between January 5th, 2020 and January 16th, 2021. A structural analysis was also performed to investigate mutations in the receptor binding domain and the N-terminal domain of the spike protein. Within the spike protein, there were 766 unique mutations observed in the N-terminal domain and 360 in the receptor binding domain. Four residues that directly contact ACE2 were mutated in more than 100 sequences, including positions K417, Y453, S494, and N501. Within the furin cleavage site of the spike protein, a high degree of conservation was observed, but the P681H mutation was observed in 10.47% of sequences analyzed. Within the RNA dependent RNA polymerase complex proteins, 327 unique mutations were observed in Nsp8, 166 unique mutations were observed in Nsp7, and 1157 unique mutations were observed in Nsp12. Only 4 sequences analyzed contained mutations in the 9 residues that directly interact with the therapeutic Remdesivir, suggesting limited mutations in drug interacting residues. The identification of new variants emphasizes the need for further study on the effects of the mutations and the implications of increased prevalence, particularly for vaccine or therapeutic efficacy.
在持续的 COVID-19 大流行期间,监测 SARS-CoV-2 病毒中出现的突变以制定公共卫生策略并指导疫苗和疗法的进一步发展变得越来越重要。刺突(S)蛋白和构成 RNA 依赖性 RNA 聚合酶(RdRP)的蛋白质分别是疫苗和药物的关键靶点,因此监测这些蛋白质的突变非常重要。从 GISAID 数据库下载完整的蛋白质序列,进行比对,并确定变体。分析了 437,006 个独特的病毒基因组。研究了蛋白质序列中的多态性,并进行了纵向研究,以确定 2020 年 1 月 5 日至 2021 年 1 月 16 日之间出现的序列和菌株变体。还进行了结构分析,以研究刺突蛋白的受体结合域和 N 端结构域中的突变。在刺突蛋白中,在 N 端结构域中观察到 766 个独特的突变,在受体结合域中观察到 360 个突变。有 4 个直接与 ACE2 结合的残基发生突变的序列超过 100 个,包括位置 K417、Y453、S494 和 N501。在刺突蛋白的弗林裂解位点中观察到高度的保守性,但在分析的 10.47%的序列中观察到 P681H 突变。在 RNA 依赖性 RNA 聚合酶复合物蛋白中,在 Nsp8 中观察到 327 个独特的突变,在 Nsp7 中观察到 166 个独特的突变,在 Nsp12 中观察到 1157 个独特的突变。在分析的 4 个序列中,只有 4 个序列包含与治疗药物瑞德西韦直接相互作用的 9 个残基的突变,这表明药物相互作用残基的突变有限。新变体的鉴定强调需要进一步研究突变的影响以及流行率增加的影响,特别是对疫苗或治疗效果的影响。