Aix Marseille University, CNRS, IGS, Information Génomique & Structurale (UMR7256), Institut de Microbiologie de la Méditerranée (FR 3489), Marseille, France
Aix Marseille University, CNRS, IGS, Information Génomique & Structurale (UMR7256), Institut de Microbiologie de la Méditerranée (FR 3489), Marseille, France.
J Virol. 2019 Nov 13;93(23). doi: 10.1128/JVI.01206-19. Print 2019 Dec 1.
is a rapidly growing family of giant viruses, all of which have been isolated using laboratory strains of The genomes of 10 distinct strains have been fully characterized, reaching up to 2.5 Mb in size. These double-stranded DNA genomes encode the largest of all known viral proteomes and are propagated in oblate virions that are among the largest ever described (1.2 μm long and 0.5 μm wide). The evolutionary origin of these atypical viruses is the object of numerous speculations. Applying the chaos game representation to the pandoravirus genome sequences, we discovered that the tetranucleotide (4-mer) "AGCT" is totally absent from the genomes of 2 strains ( and ) and strongly underrepresented in others. Given the amazingly low probability of such an observation in the corresponding randomized sequences, we investigated its biological significance through a comprehensive study of the 4-mer compositions of all viral genomes. Our results indicate that AGCT was specifically eliminated during the evolution of the and that none of the previously proposed host-virus antagonistic relationships could explain this phenomenon. Unlike the three other families of giant viruses (, , and ) infecting the same host, the pandoraviruses exhibit a puzzling genomic anomaly suggesting a highly specific DNA editing in response to a new kind of strong evolutionary pressure. Recent years have seen the discovery of several families of giant DNA viruses infecting the ubiquitous amoebozoa of the genus With double-stranded DNA (dsDNA) genomes reaching 2.5 Mb in length packaged in oblate particles the size of a bacterium, the pandoraviruses are currently the most complex and largest viruses known. In addition to their spectacular dimensions, the pandoraviruses encode the largest proportion of proteins without homologs in other organisms, which is thought to result from a gene creation process. While using comparative genomics to investigate the evolutionary forces responsible for the emergence of such an unusual giant virus family, we discovered a unique bias in the tetranucleotide composition of the pandoravirus genomes that can result only from an undescribed evolutionary process not encountered in any other microorganism.
是一类快速增长的巨型病毒家族,所有这些病毒都是使用实验室培养的 菌株分离得到的。已经对 10 种不同菌株的基因组进行了全面特征描述,其大小可达 2.5Mb。这些双链 DNA 基因组编码了所有已知病毒中最大的病毒蛋白组,并且在扁球体病毒中进行繁殖,这些病毒是迄今描述过的最大病毒之一(长 1.2μm,宽 0.5μm)。这些非典型病毒的进化起源是许多推测的对象。我们应用混沌游戏表示法对潘多拉病毒基因组序列进行分析,发现四核苷酸(4 -mer)“AGCT”在 2 株( 和 )的基因组中完全缺失,而在其他株中则严重缺乏。考虑到在相应的随机序列中观察到这种情况的可能性非常低,我们通过对所有病毒基因组的 4-mer 组成进行全面研究来研究其生物学意义。我们的结果表明,AGCT 在 和 的进化过程中被特异性消除,以前提出的任何宿主-病毒拮抗关系都无法解释这一现象。与感染相同 宿主的其他三种巨型病毒家族(、、和)不同,潘多拉病毒表现出一种令人费解的基因组异常,这表明在新的强烈进化压力下,发生了高度特异性的 DNA 编辑。近年来,发现了几大家族的巨型 DNA 病毒感染普遍存在的变形虫属的 amoebozoa。这些病毒的双链 DNA(dsDNA)基因组长达 2.5Mb,包装在细菌大小的扁球体颗粒中,潘多拉病毒是目前已知的最复杂和最大的病毒。除了其壮观的尺寸外,潘多拉病毒编码的蛋白质比例最大,没有其他生物体的同源物,这被认为是由于基因创造过程所致。在使用比较基因组学研究导致出现这种异常巨型病毒家族的进化力量时,我们发现潘多拉病毒基因组中的四核苷酸组成存在独特的偏差,这种偏差只能来自于在其他微生物中未遇到的未描述的进化过程。