Westergren Jakobsson Amanda, Segerman Bo, Wallerman Ola, Lind Sara Bergström, Zhao Hongxing, Rubin Carl-Johan, Pettersson Ulf, Akusjärvi Göran
Department of Medical Biochemistry and Microbiology, Uppsala University, Uppsala, Sweden.
Department of Immunology, Genetics and Pathology, Uppsala University, Uppsala, Sweden.
J Virol. 2021 Feb 15;95(4). doi: 10.1128/JVI.01869-20. Epub 2020 Nov 25.
We have used the Nanopore long-read sequencing platform to demonstrate how amazingly complex the human adenovirus type 2 (Ad2) transcriptome is with a flexible splicing machinery producing a range of novel mRNAs both from the early and late transcription units. In total we report more than 900 alternatively spliced mRNAs produced from the Ad2 transcriptome whereof more than 850 are novel mRNAs. A surprising finding was that more than 50% of all E1A transcripts extended upstream of the previously defined transcriptional start site. The novel start sites mapped close to the inverted terminal repeat (ITR) and within the E1A enhancer region. We speculate that novel promoters or enhancer driven transcription, so-called eRNA transcription, is responsible for producing these novel mRNAs. Their existence was verified by a peptide in the Ad2 proteome that was unique for the E1A ITR mRNA. Although we show a high complexity of alternative splicing from most early and late regions, the E3 region was by far the most complex when expressed at late times of infection. More than 400 alternatively spliced mRNAs were observed in this region alone. These mRNAs included extended L4 mRNAs containing E3 and L5 sequences and readthrough mRNAs combining E3 and L5 sequences. Our findings demonstrate that the virus has a remarkable capacity to produce novel exon combinations, which will offer the virus an evolutionary advantage to change the gene expression repertoire and protein production in an evolving environment. Work in the adenovirus system led to the groundbreaking discovery of RNA splicing and alternative RNA splicing in 1977. These mechanisms are essential in mammalian evolution by increasing the coding capacity of a genome. Here, we have used a long-read sequencing technology to characterize the complexity of human adenovirus pre-mRNA splicing in detail. It is mindboggling that the viral genome, which only houses around 36,000 bp, not being much larger than a single cellular gene, generates more than 900 alternatively spliced mRNAs. Recently, adenoviruses have been used as the backbone in several promising SARS-CoV-2 vaccines. Further improvement of adenovirus-based vaccines demands that the virus can be tamed into an innocent carrier of foreign genes. This requires a full understanding of the components that govern adenovirus replication and gene expression.
我们利用纳米孔长读长测序平台,展示了人类2型腺病毒(Ad2)转录组惊人的复杂性,其灵活的剪接机制可从早期和晚期转录单元产生一系列新的mRNA。我们总共报告了Ad2转录组产生的900多种可变剪接mRNA,其中850多种是新的mRNA。一个惊人的发现是,所有E1A转录本中有超过50%延伸到先前定义的转录起始位点上游。新的起始位点定位于靠近反向末端重复序列(ITR)以及E1A增强子区域内。我们推测,新的启动子或增强子驱动的转录,即所谓的eRNA转录,负责产生这些新的mRNA。Ad2蛋白质组中的一种肽对E1A ITR mRNA具有独特性,从而证实了它们的存在。尽管我们显示大多数早期和晚期区域的可变剪接具有高度复杂性,但在感染后期表达时,E3区域是迄今为止最复杂的。仅在该区域就观察到400多种可变剪接mRNA。这些mRNA包括含有E3和L5序列的延伸L4 mRNA以及结合E3和L5序列的通读mRNA。我们的发现表明,该病毒具有产生新外显子组合的显著能力,这将为病毒在不断演变的环境中改变基因表达谱和蛋白质产生提供进化优势。腺病毒系统的研究在1977年导致了RNA剪接和可变RNA剪接的开创性发现。这些机制通过增加基因组的编码能力在哺乳动物进化中至关重要。在这里,我们使用长读长测序技术详细表征了人类腺病毒前体mRNA剪接的复杂性。令人难以置信的是,仅包含约36,000 bp、比单个细胞基因大不了多少的病毒基因组,却能产生900多种可变剪接mRNA。最近,腺病毒已被用作几种有前景的SARS-CoV-2疫苗的骨架。基于腺病毒的疫苗的进一步改进要求将病毒驯化为无害的外源基因载体。这需要全面了解控制腺病毒复制和基因表达的成分。