Kelley James J, Grigoriev Andrey
Department of Biology, Center for Computational and Integrative Biology, Rutgers University, Camden, NJ 08102, USA.
Viruses. 2024 Dec 30;17(1):36. doi: 10.3390/v17010036.
The nucleocapsid (N) protein is the most expressed protein in later stages of SARS-CoV-2 infection with several important functions. It is translated from a subgenomic mRNA (sgmRNA) formed by template switching during transcription. A recently described translation initiation site (TIS) with a CTG codon in the leader sequence (TIS-L) is out of frame with most structural and accessory genes including the N gene and may act as a translation suppressor. We analyzed multiple sequenced samples infected by SARS-CoV-2 and found that any single variant of this virus produces multiple isoforms of the N sgmRNA. The main isoform starting at TIS-L is out of frame, but two secondary dominant isoforms (present in nearly all samples) were found to restore the reading frame and likely involved in the regulation of N protein production. Analysis of sequenced samples infected by other coronaviruses revealed that such isoforms are also produced in their transcriptomes. In SARS-CoV, they restore the reading frame for a putative TIS (also a CTG codon) in the same relative position as in SARS-CoV-2. Positions of junction breakpoints relative to stem loop 3 in the 5'-UTR suggest similar mechanisms in SARS-CoV, SARS-CoV-2, and OC43, but not in MERS-CoV. These observations may be pertinent for antisense-based antiviral strategies.
核衣壳(N)蛋白是严重急性呼吸综合征冠状病毒2(SARS-CoV-2)感染后期表达最多的蛋白,具有多种重要功能。它由转录过程中模板转换形成的亚基因组mRNA(sgmRNA)翻译而来。最近描述的位于前导序列(TIS-L)中带有CTG密码子的翻译起始位点(TIS)与包括N基因在内的大多数结构基因和辅助基因的读框不同,可能作为翻译抑制因子。我们分析了多个受SARS-CoV-2感染的测序样本,发现该病毒的任何单个变体都会产生多种N sgmRNA异构体。从TIS-L起始的主要异构体读框不同,但发现两种次要优势异构体(几乎存在于所有样本中)可恢复读框,并可能参与N蛋白产生的调控。对受其他冠状病毒感染的测序样本分析表明,其转录组中也会产生此类异构体。在严重急性呼吸综合征冠状病毒(SARS-CoV)中,它们可恢复与SARS-CoV-2中相同相对位置的一个假定TIS(也是CTG密码子)的读框。相对于5'-UTR中茎环3的连接断点位置表明,SARS-CoV、SARS-CoV-2和OC43具有相似机制,但中东呼吸综合征冠状病毒(MERS-CoV)并非如此。这些观察结果可能与基于反义的抗病毒策略相关。