Department of Mathematics, Pingla Thana Mahavidyalaya, Maligram, Paschim Medinipur, 721140, West Bengal, India.
Applied Statistics Unit, Indian Statistical Institute, Kolkata 700108, West Bengal, India.
Genomics. 2020 Nov;112(6):3890-3892. doi: 10.1016/j.ygeno.2020.07.001. Epub 2020 Jul 5.
In the NCBI database, as on June 6, 2020, total number of available complete genome sequences of SARS-CoV2 across the world is 3617. The envelope (E) protein of SARS-CoV2 possesses several non-synonymous mutations over the transmembrane and C-terminus domains in 15 (0.414%) genomes among 3617 SARS-CoV2 genomes, analyzed. More precisely, 10(0.386%) out of 2588 genomes from the USA, 3(0.806%) from Asia, 1 (0.348%) from Europe and 1 (0.274%) from Oceania contained the missense mutations over the E-protein of SARS-CoV2 genomes. The C-terminus motif DLLV has been to DFLV and YLLV in the proteins from QJR88103 (Australia: Victoria) and QKI36831 (China: Guangzhou) respectively, which might affect the binding of this motif with the host protein PALS1.
截至 2020 年 6 月 6 日,在 NCBI 数据库中,全世界已公布的 SARS-CoV-2 完整基因组序列共有 3617 个。分析发现,在 3617 个 SARS-CoV-2 基因组中,有 15 个(0.414%)基因组的 E 蛋白在跨膜和 C 末端结构域存在多个非同义突变。更确切地说,在来自美国的 2588 个基因组中,有 10 个(0.386%)、来自亚洲的 3 个(0.806%)、来自欧洲的 1 个(0.348%)和来自大洋洲的 1 个(0.274%)的基因组存在 E 蛋白的错义突变。来自 QJR88103(澳大利亚:维多利亚州)和 QKI36831(中国:广州市)的蛋白中,E 蛋白的 C 末端基序 DLLV 分别突变为 DFLV 和 YLLV,这可能会影响该基序与宿主蛋白 PALS1 的结合。