Salami Ghaleh Samira, Rahimian Karim, Mahmanzar Mohammadamin, Mahdavi Bahar, Tokhanbigli Samaneh, Mollapour Sisakht Mahsa, Farhadi Amin, Mousakhan Bakhtiari Mahsa, Lee Kuehu Donna, Deng Youping
bioRxiv. 2022 Jul 25:2022.07.22.501212. doi: 10.1101/2022.07.22.501212.
Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is an unsegmented positivesense single-stranded RNA virus that belongs to the . This virus was the cause of a novel severe acute respiratory syndrome in 2019 (COVID-19) that emerged in Wuhan, China at the early stage of the pandemic and rapidly spread around the world. Rapid transmission and reproduction of SARS-CoV-2 threaten worldwide health with a high mortality rate from the virus. According to the significant role of non-structural protein 1 (NSP1) in inhibiting host mRNA translation, this study focuses on the link between amino acid sequences of NSP1 and alterations of them spreading around the world. The SARS-CoV-2 NSP1 protein sequences were analyzed and FASTA files were processed by Python language programming libraries. Reference sequences compared with each NSP1 sample to identify every mutation and categorize them were based on continents and frequencies. NSP1 mutations rate divided into continents were different. Based on continental studies, E87D in global vision and also in Europe notably increased. The E87D mutation has significantly risen especially in the last months of the study as the first frequent mutation observed. The remarkable mutations, H110Y and R24C, have the second and third frequencies, respectively. Based on this mutational information, despite NSP1 being a conserved sequence occurrence, these mutations change the rate of flexibility and stability of the NSP1 protein, which can eventually affect inhibiting the host translation.
In this study, we analyzed 6,510,947 sequences of non-structural protein 1 as a conserved region of SARS-CoV-2. According to the obtained results, 93.4819% of samples had no mutant regions on their amino acid sequences. Heat map data of mutational samples demonstrated high percentages of mutations that occurred in the region of 72 to 126 amino acids indicating a hot spot region of the protein. Increased rates of E87D, H110Y, and R24C mutations in the timeline of our study were reported as significant compared to available mutant samples. Analyzing the details of replacing amino acids in the most frequent E87D mutation reveals the role of this alteration in increasing molecule flexibility and destabilizing the structure of the protein.
严重急性呼吸综合征冠状病毒2(SARS-CoV-2)是一种不分节段的正链单链RNA病毒,属于冠状病毒科。该病毒是2019年新型严重急性呼吸综合征(COVID-19)的病原体,疫情初期在中国武汉出现并迅速在全球传播。SARS-CoV-2的快速传播和复制以高病毒死亡率威胁全球健康。鉴于非结构蛋白1(NSP1)在抑制宿主mRNA翻译中的重要作用,本研究聚焦于NSP1氨基酸序列与在全球传播的序列变化之间的联系。对SARS-CoV-2 NSP1蛋白序列进行了分析,并使用Python语言编程库处理FASTA文件。将参考序列与每个NSP1样本进行比较,以识别每个突变并根据大洲和频率对其进行分类。按大洲划分的NSP1突变率各不相同。基于各大洲的研究,在全球范围内以及在欧洲,E87D突变显著增加。E87D突变尤其在研究的最后几个月显著上升,成为首个观察到的高频突变。显著突变H110Y和R24C分别位列第二和第三高频。基于这些突变信息,尽管NSP1是一个保守序列,但这些突变改变了NSP1蛋白的灵活性和稳定性,最终可能影响对宿主翻译的抑制。
在本研究中,我们分析了6510947条作为SARS-CoV-2保守区域的非结构蛋白1序列。根据所得结果,93.4819%的样本在其氨基酸序列上没有突变区域。突变样本的热图数据显示,72至126个氨基酸区域发生的突变比例很高,表明该区域是该蛋白的热点区域。与现有突变样本相比,在我们的研究时间线内,E87D、H110Y和R24C突变率的增加被报告为显著。分析最常见的E87D突变中氨基酸替换的细节,揭示了这种改变在增加分子灵活性和破坏蛋白质结构稳定性方面的作用。