Kusuma School of Biological Sciences, Indian Institute of Technology Delhi, New Delhi, India.
Brief Funct Genomics. 2024 May 15;23(3):256-264. doi: 10.1093/bfgp/elad027.
We identified 11 conserved stretches in over 6.3 million SARS-CoV-2 genomes including all the major variants of concerns. Each conserved stretch is ≥100 nucleotides in length with ≥99.9% conservation at each nucleotide position. Interestingly, six of the eight conserved stretches in ORF1ab overlapped significantly with well-folded experimentally verified RNA secondary structures. Furthermore, two of the conserved stretches were mapped to regions within the S2-subunit that undergo dynamic structural rearrangements during viral fusion. In addition, the conserved stretches were significantly depleted for zinc-finger antiviral protein (ZAP) binding sites, which facilitated the recognition and degradation of viral RNA. These highly conserved stretches in the SARS-CoV-2 genome were poorly conserved at the nucleotide level among closely related β-coronaviruses, thus representing ideal targets for highly specific and discriminatory diagnostic assays. Our findings highlight the role of structural constraints at both RNA and protein levels that contribute to the sequence conservation of specific genomic regions in SARS-CoV-2.
我们在超过 630 万份 SARS-CoV-2 基因组中鉴定出 11 个保守区域,其中包括所有主要关注的变体。每个保守区域的长度都≥100 个核苷酸,每个核苷酸位置的保守率都≥99.9%。有趣的是,ORF1ab 中的 8 个保守区域中有 6 个与经过充分折叠的实验验证的 RNA 二级结构显著重叠。此外,两个保守区域被映射到 S2 亚基的区域内,该区域在病毒融合过程中经历动态结构重排。此外,保守区域显著缺乏锌指抗病毒蛋白(ZAP)结合位点,这有助于病毒 RNA 的识别和降解。SARS-CoV-2 基因组中的这些高度保守区域在核苷酸水平上在密切相关的β冠状病毒中没有很好的保守,因此代表了高度特异性和有区别的诊断检测的理想靶标。我们的研究结果强调了在 RNA 和蛋白质水平上结构限制的作用,这些结构限制有助于 SARS-CoV-2 特定基因组区域的序列保守性。