Suardana Ida B K, Mahardika Bayu K, Pharmawati Made, Sudipa Putu H, Sari Tri K, Mahendra Nyoman B, Mahardika Gusti N
Virology Laboratory, The Faculty of Veterinary Medicine, Udayana University, Denpasar, Bali, Indonesia.
The Animal Biomedical and Molecular Biology Laboratory, Udayana University, Jl. Sesetan-Markisa 6A, Denpasar 80223, Bali, Indonesia.
Adv Virol. 2023 Mar 9;2023:6476626. doi: 10.1155/2023/6476626. eCollection 2023.
Since its discovery at the end of 2019, severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has rapidly evolved into many variants, including the subvariant BA.2 and the GKA clade. Genomic clarification is needed for better management of the current pandemic as well as the possible reemergence of novel variants. The sequence of the reference genome Wuhan-Hu-1 and approximately 20 representatives of each variant were downloaded from GenBank and GISAID. Two representatives with no track of in-definitive nucleotides were selected. The sequences were aligned using muscle. The location of insertion/deletion (indel) in the genome was mapped following the open reading frame (ORF) of Wuhan-Hu-1. The phylogeny of the spike protein coding region was constructed using the maximum likelihood method. Amino acid substitutions in all ORFs were analyzed separately. There are two indel sites in ORF1AB, eight in spike, and one each in ORF3A, matrix (MA), nucleoprotein (NP), and the 3'-untranslated regions (3'UTR). Some indel sites and residues/substitutions are not unique, and some are variant-specific. The phylogeny shows that Omicron, Deltacron, and BA2 are clustered together and separated from other variants with 100% bootstrap support. In conclusion, whole-genome comparison of representatives of all variants revealed indel patterns that are specific to SARS-CoV-2 variants or subvariants. Polymorphic amino acid comparison across all coding regions also showed amino acid residues shared by specific groups of variants. Finally, the higher transmissibility of BA.2 might be due at least in part to the 48 nucleotide deletions in the 3'UTR, while the seem-to-be extinction of GKA clade is due to the lack of genetic advantages as a consequence of amino acid substitutions in various genes.
自2019年底被发现以来,严重急性呼吸综合征冠状病毒2(SARS-CoV-2)迅速演变成许多变种,包括亚变种BA.2和GKA分支。为了更好地应对当前的疫情以及可能出现的新型变种,需要进行基因组澄清。参考基因组武汉-胡-1的序列以及每个变种的约20个代表序列从GenBank和GISAID下载。选择了两个没有不确定核苷酸痕迹的代表序列。使用muscle对序列进行比对。根据武汉-胡-1的开放阅读框(ORF)绘制基因组中插入/缺失(indel)的位置。使用最大似然法构建刺突蛋白编码区的系统发育树。分别分析所有ORF中的氨基酸替换。在ORF1AB中有两个indel位点,刺突蛋白中有八个,ORF3A、基质蛋白(MA)、核蛋白(NP)和3'非翻译区(3'UTR)中各有一个。一些indel位点和残基/替换并非独特,有些是变种特异性的。系统发育树显示,奥密克戎、德尔塔克戎和BA2聚集在一起,并在100%的自展支持下与其他变种分离。总之,对所有变种代表的全基因组比较揭示了SARS-CoV-2变种或亚变种特有的indel模式。对所有编码区的多态性氨基酸比较也显示了特定变种组共有的氨基酸残基。最后,BA.2较高的传播性可能至少部分归因于3'UTR中的48个核苷酸缺失,而GKA分支似乎灭绝是由于各种基因中氨基酸替换导致缺乏遗传优势。