Department of Genetic Engineering & Biotechnology, Faculty of Biological Sciences, University of Chittagong, Chattogram 4331, Bangladesh.
Department of Pathology and Parasitology, Chittagong Veterinary and Animal Sciences University, Chattogram 4202, Bangladesh.
Infect Genet Evol. 2020 Oct;84:104389. doi: 10.1016/j.meegid.2020.104389. Epub 2020 Jun 2.
The newly identified SARS-CoV-2 has now been reported from around 185 countries with more than a million confirmed human cases including more than 120,000 deaths. The genomes of SARS-COV-2 strains isolated from different parts of the world are now available and the unique features of constituent genes and proteins need to be explored to understand the biology of the virus. Spike glycoprotein is one of the major targets to be explored because of its role during the entry of coronaviruses into host cells. We analyzed 320 whole-genome sequences and 320 spike protein sequences of SARS-CoV-2 using multiple sequence alignment. In this study, 483 unique variations have been identified among the genomes of SARS-CoV-2 including 25 nonsynonymous mutations and one deletion in the spike (S) protein. Among the 26 variations detected in S, 12 variations were located at the N-terminal domain (NTD) and 6 variations at the receptor-binding domain (RBD) which might alter the interaction of S protein with the host receptor angiotensin-converting enzyme 2 (ACE2). Besides, 22 amino acid insertions were identified in the spike protein of SARS-CoV-2 in comparison with that of SARS-CoV. Phylogenetic analyses of spike protein revealed that Bat coronavirus have a close evolutionary relationship with circulating SARS-CoV-2. The genetic variation analysis data presented in this study can help a better understanding of SARS-CoV-2 pathogenesis. Based on results reported herein, potential inhibitors against S protein can be designed by considering these variations and their impact on protein structure.
现已从全球约 185 个国家报告了新发现的 SARS-CoV-2,确诊病例超过 100 万例,死亡病例超过 12 万例。目前已经获得了来自世界不同地区的 SARS-CoV-2 毒株的基因组,需要探索其组成基因和蛋白质的独特特征,以了解病毒的生物学特性。刺突糖蛋白是需要探索的主要目标之一,因为它在冠状病毒进入宿主细胞的过程中起作用。我们使用多序列比对分析了 320 条全基因组序列和 320 条 SARS-CoV-2 的刺突蛋白序列。在这项研究中,我们在 SARS-CoV-2 的基因组中发现了 483 个独特的变异,包括 25 个非同义突变和刺突(S)蛋白中的一个缺失。在 S 中检测到的 26 个变异中,有 12 个变异位于 N 端结构域(NTD),6 个变异位于受体结合域(RBD),这可能改变 S 蛋白与宿主受体血管紧张素转换酶 2(ACE2)的相互作用。此外,与 SARS-CoV 相比,SARS-CoV-2 的刺突蛋白中还发现了 22 个氨基酸插入。刺突蛋白的系统发育分析表明,蝙蝠冠状病毒与流行的 SARS-CoV-2 具有密切的进化关系。本研究中提供的遗传变异分析数据有助于更好地了解 SARS-CoV-2 的发病机制。基于本文报道的结果,可以考虑这些变异及其对蛋白质结构的影响,设计针对 S 蛋白的潜在抑制剂。