Das Jayanta Kumar, Sengupta Antara, Choudhury Pabitra Pal, Roy Swarup
Department of Pediatrics, School of Medicine, Johns Hopkins University, MD 21205, USA.
Department of Computer Science and Engineering, University of Calcutta, India.
Gene Rep. 2021 Dec;25:101044. doi: 10.1016/j.genrep.2021.101044. Epub 2021 Feb 19.
SARS-CoV-2 is mutating and creating divergent variants by altering the composition of essential constituent proteins. Pharmacologically, it is crucial to understand the diverse mechanism of mutations for stable vaccine or anti-viral drug design. Our current study concentrates on all the constituent proteins of 469 SARS-CoV-2 genome samples, derived from Indian patients. However, the study may easily be extended to the samples across the globe. We perform clustering analysis towards identifying unique variants in each of the SARS-CoV-2 proteins. A total of 536 mutated positions within the coding regions of SARS-CoV-2 proteins are detected among the identified variants from Indian isolates. We quantify mutations by focusing on the unique variants of each SARS-CoV-2 protein. We report the average number of mutation per variant, percentage of mutated positions, synonymous and non-synonymous mutations, mutations occurring in three codon positions and so on. Our study reveals the most susceptible six (06) proteins, which are , , , , , and . Several non-synonymous substitutions are observed to be unique in different SARS-CoV-2 proteins. A total of 57 possible deleterious amino acid substitutions are predicted, which may impact on the protein functions. Several mutations show a large decrease in protein stability and are observed in putative functional domains of the proteins that might have some role in disease pathogenesis. We observe a good number of physicochemical property change during above deleterious substitutions.
严重急性呼吸综合征冠状病毒2(SARS-CoV-2)正在发生变异,并通过改变必需组成蛋白的组成来产生不同的变体。从药理学角度来看,了解突变的多种机制对于稳定的疫苗或抗病毒药物设计至关重要。我们目前的研究集中于来自印度患者的469个SARS-CoV-2基因组样本的所有组成蛋白。然而,该研究可以很容易地扩展到全球的样本。我们进行聚类分析以识别SARS-CoV-2每种蛋白中的独特变体。在从印度分离株中鉴定出的变体中,共检测到SARS-CoV-2蛋白编码区域内的536个突变位点。我们通过关注SARS-CoV-2每种蛋白的独特变体来量化突变。我们报告了每个变体的平均突变数、突变位点的百分比、同义突变和非同义突变、在三个密码子位置发生的突变等。我们的研究揭示了最易受影响的六种蛋白,即 、 、 、 、 和 。在不同的SARS-CoV-2蛋白中观察到几种非同义替换是独特的。总共预测了57种可能有害的氨基酸替换,这可能会影响蛋白质功能。在蛋白质的假定功能域中观察到几种突变导致蛋白质稳定性大幅下降,这些功能域可能在疾病发病机制中起一定作用。在上述有害替换过程中,我们观察到大量理化性质的变化。