Institute of Virology, Charité-Universitätsmedizin Berlin, Corporate Member of Free University, Humboldt-University and Berlin Institute of Health, Berlin, Germany.
Center for Molecular Biodiversity Research, Zoological Research Museum Alexander Koenig, Bonn, Germany.
PLoS Pathog. 2019 Dec 12;15(12):e1008224. doi: 10.1371/journal.ppat.1008224. eCollection 2019 Dec.
The spectrum of viruses in insects is important for subjects as diverse as public health, veterinary medicine, food production, and biodiversity conservation. The traditional interest in vector-borne diseases of humans and livestock has drawn the attention of virus studies to hematophagous insect species. However, these represent only a tiny fraction of the broad diversity of Hexapoda, the most speciose group of animals. Here, we systematically probed the diversity of negative strand RNA viruses in the largest and most representative collection of insect transcriptomes from samples representing all 34 extant orders of Hexapoda and 3 orders of Entognatha, as well as outgroups, altogether representing 1243 species. Based on profile hidden Markov models we detected 488 viral RNA-directed RNA polymerase (RdRp) sequences with similarity to negative strand RNA viruses. These were identified in members of 324 arthropod species. Selection for length, quality, and uniqueness left 234 sequences for analyses, showing similarity to genomes of viruses classified in Bunyavirales (n = 86), Articulavirales (n = 54), and several orders within Haploviricotina (n = 94). Coding-complete genomes or nearly-complete subgenomic assemblies were obtained in 61 cases. Based on phylogenetic topology and the availability of coding-complete genomes we estimate that at least 20 novel viral genera in seven families need to be defined, only two of them monospecific. Seven additional viral clades emerge when adding sequences from the present study to formerly monospecific lineages, potentially requiring up to seven additional genera. One long sequence may indicate a novel family. For segmented viruses, cophylogenies between genome segments were generally improved by the inclusion of viruses from the present study, suggesting that in silico misassembly of segmented genomes is rare or absent. Contrary to previous assessments, significant virus-host codivergence was identified in major phylogenetic lineages based on two different approaches of codivergence analysis in a hypotheses testing framework. In spite of these additions to the known spectrum of viruses in insects, we caution that basing taxonomic decisions on genome information alone is challenging due to technical uncertainties, such as the inability to prove integrity of complete genome assemblies of segmented viruses.
昆虫病毒的多样性对于公共卫生、兽医、食品生产和生物多样性保护等多个领域都非常重要。人们对人类和牲畜的媒介传播疾病的传统关注,使病毒研究的注意力集中在吸血昆虫物种上。然而,这些只是节肢动物门(Hexapoda)这一动物多样性最丰富的群体中极为微小的一部分。在这里,我们系统地探测了来自所有 34 个现存节肢动物目和 3 个内生节肢动物目的样本中最大和最具代表性的昆虫转录组中负链 RNA 病毒的多样性,以及外群,共代表了 1243 个物种。基于剖面隐马尔可夫模型,我们检测到了 488 种与负链 RNA 病毒具有相似性的病毒 RNA 依赖性 RNA 聚合酶(RdRp)序列。这些在 324 种节肢动物物种的成员中被识别出来。为了长度、质量和独特性而进行的选择,留下了 234 个用于分析的序列,这些序列与属于 Bunyavirales(n = 86)、Articulavirales(n = 54)和 Haploviricotina 几个目中的病毒基因组具有相似性。在 61 个案例中获得了编码完整基因组或近乎完整的亚基因组组装。基于系统发育拓扑结构和编码完整基因组的可用性,我们估计至少需要定义 7 个科中的 20 个新病毒属,其中只有 2 个是单种属。当将本研究中的序列添加到以前的单种谱系中时,会出现 7 个额外的病毒分支,可能需要多达 7 个额外的属。一个长序列可能表示一个新的家族。对于分段病毒,基因组片段之间的共进化分析通常通过包含本研究中的病毒而得到改善,这表明在分段基因组的计算机错误组装是罕见的或不存在的。与之前的评估相反,根据两种不同的共进化分析方法在假设检验框架中的应用,在主要的系统发育谱系中发现了显著的病毒-宿主共进化。尽管对昆虫中的病毒谱进行了这些补充,但我们警告说,仅基于基因组信息做出分类决策具有挑战性,因为存在技术不确定性,例如无法证明完整的分段病毒基因组组装的完整性。