Institute for Experimental Virology, TWINCORE Centre for Experimental and Clinical Infection Research, a joint venture between the Hannover Medical School (MHH) and the Helmholtz Centre for Infection Research (HZI), Hannover, Germany.
Cluster of Excellence 2155 RESIST, Hannover, Germany.
PLoS Pathog. 2024 Apr 22;20(4):e1012163. doi: 10.1371/journal.ppat.1012163. eCollection 2024 Apr.
Virus discovery by genomics and metagenomics empowered studies of viromes, facilitated characterization of pathogen epidemiology, and redefined our understanding of the natural genetic diversity of viruses with profound functional and structural implications. Here we employed a data-driven virus discovery approach that directly queries unprocessed sequencing data in a highly parallelized way and involves a targeted viral genome assembly strategy in a wide range of sequence similarity. By screening more than 269,000 datasets of numerous authors from the Sequence Read Archive and using two metrics that quantitatively assess assembly quality, we discovered 40 nidoviruses from six virus families whose members infect vertebrate hosts. They form 13 and 32 putative viral subfamilies and genera, respectively, and include 11 coronaviruses with bisegmented genomes from fishes and amphibians, a giant 36.1 kilobase coronavirus genome with a duplicated spike glycoprotein (S) gene, 11 tobaniviruses and 17 additional corona-, arteri-, cremega-, nanhypo- and nangoshaviruses. Genome segmentation emerged in a single evolutionary event in the monophyletic lineage encompassing the subfamily Pitovirinae. We recovered the bisegmented genome sequences of two coronaviruses from RNA samples of 69 infected fishes and validated the presence of poly(A) tails at both segments using 3'RACE PCR and subsequent Sanger sequencing. We report a genetic linkage between accessory and structural proteins whose phylogenetic relationships and evolutionary distances are incongruent with the phylogeny of replicase proteins. We rationalize these observations in a model of inter-family S recombination involving at least five ancestral corona- and tobaniviruses of aquatic hosts. In support of this model, we describe an individual fish co-infected with members from the families Coronaviridae and Tobaniviridae. Our results expand the scale of the known extraordinary evolutionary plasticity in nidoviral genome architecture and call for revisiting fundamentals of genome expression, virus particle biology, host range and ecology of vertebrate nidoviruses.
通过基因组学和宏基因组学进行病毒发现,促进了病毒组学的研究,有助于描述病原体流行病学特征,并重新定义了我们对具有深远功能和结构意义的病毒自然遗传多样性的理解。在这里,我们采用了一种数据驱动的病毒发现方法,该方法直接以高度并行的方式查询未处理的测序数据,并在广泛的序列相似性范围内采用靶向病毒基因组组装策略。通过筛选来自 Sequence Read Archive 的 269000 多个数据集,我们使用了两种定量评估组装质量的指标,发现了来自六个病毒科的 40 种正粘病毒,它们的成员感染脊椎动物宿主。它们分别形成 13 个和 32 个假定的病毒亚科和属,包括来自鱼类和两栖类的具有双节段基因组的 11 种冠状病毒、一种具有重复刺突糖蛋白(S)基因的巨大 36.1kb 冠状病毒基因组、11 种托巴病毒和 17 种其他冠状病毒、动脉病毒、Cremegavirus、Nanovirus 和 Nanogavirus。基因组分段在单系谱系中单一进化事件中出现,该谱系包含亚科 Pitovirinae。我们从 69 种感染鱼类的 RNA 样本中回收了两种冠状病毒的双节段基因组序列,并使用 3'RACE PCR 和随后的 Sanger 测序验证了两个节段都存在 poly(A)尾巴。我们报告了辅助蛋白和结构蛋白之间的遗传联系,其系统发育关系和进化距离与复制酶蛋白的系统发育不一致。我们在一个涉及至少五种水生宿主的冠状病毒和托巴病毒祖先的家族间 S 重组模型中对这些观察结果进行了合理化。支持该模型,我们描述了一个个体鱼类同时感染冠状病毒科和托巴病毒科成员。我们的研究结果扩展了已知正粘病毒基因组结构的非凡进化可塑性的规模,并呼吁重新审视脊椎动物正粘病毒的基因组表达、病毒粒子生物学、宿主范围和生态学的基本原理。
J Gen Virol. 2015-8
Virus Res. 2006-4
PLoS Pathog. 2018-11-1
Nature. 2018-4-4
Proc Natl Acad Sci U S A. 2025-3-18
Proc Natl Acad Sci U S A. 2025-2-18
Virus Evol. 2025-1-24
Nucleic Acids Res. 2022-10-28
Biomolecules. 2022-8-4