Shan Ke-Jia, Wei Changshuo, Wang Yu, Huan Qing, Qian Wenfeng
State Key Laboratory of Plant Genomics, Institute of Genetics and Developmental Biology, Innovation Academy for Seed Design, Chinese Academy of Sciences, Beijing 100101, China.
University of Chinese Academy of Sciences, Beijing 100049, China.
Innovation (Camb). 2021 Nov 28;2(4):100159. doi: 10.1016/j.xinn.2021.100159. Epub 2021 Aug 30.
The capacity of RNA viruses to adapt to new hosts and rapidly escape the host immune system is largely attributable to genetic diversity that emerges through mutations in RNA. Although the molecular spectrum of mutations-the relative rates at which various base substitutions occur-are widely recognized as informative toward understanding the evolution of a viral genome, little attention has been paid to the possibility of using molecular spectra to infer the host origins of a virus. Here, we characterize the molecular spectrum of mutations for SARS-CoV-2 from transcriptomic data obtained from virus-infected cell lines, enabled by the use of sporadic junctions formed during discontinuous transcription as molecular barcodes. We find that mutations are generated in a replication-independent manner, typically on the genomic strand, and highly dependent on mutagenic mechanisms specific to the host cellular environment. mutations will then strongly influence the types of base substitutions accumulated during SARS-CoV-2 evolution, in an asymmetric manner favoring specific mutation types. Consequently, similarities between the mutation spectra of SARS-CoV-2 and the bat coronavirus RaTG13, which have accumulated since their divergence strongly suggest that SARS-CoV-2 evolved in a host cellular environment highly similar to that of bats before its zoonotic transfer into humans. Collectively, our findings provide data-driven support for the natural origin of SARS-CoV-2.
RNA病毒适应新宿主并迅速逃避免疫系统的能力很大程度上归因于RNA突变产生的遗传多样性。尽管突变的分子谱——各种碱基替换发生的相对速率——被广泛认为对理解病毒基因组的进化具有参考价值,但利用分子谱推断病毒宿主起源的可能性却很少受到关注。在这里,我们通过使用在不连续转录过程中形成的散在连接作为分子条形码,从病毒感染细胞系获得的转录组数据中表征了SARS-CoV-2的突变分子谱。我们发现,突变以与复制无关的方式产生,通常发生在基因组链上,并且高度依赖于宿主细胞环境特有的诱变机制。这些突变随后将以不对称的方式强烈影响SARS-CoV-2进化过程中积累的碱基替换类型,有利于特定的突变类型。因此,自SARS-CoV-2与蝙蝠冠状病毒RaTG13分化以来积累的突变谱之间的相似性强烈表明,SARS-CoV-2在人畜共患转移到人类之前,是在与蝙蝠高度相似的宿主细胞环境中进化的。总的来说,我们的研究结果为SARS-CoV-2的自然起源提供了数据驱动的支持。