Nasir Arshan, Caetano-Anollés Gustavo
Department of Crop Sciences, Evolutionary Bioinformatics Laboratory, University of Illinois at Urbana-ChampaignUrbana, IL, USA; Department of Biosciences, COMSATS Institute of Information TechnologyIslamabad, Pakistan.
Department of Crop Sciences, Evolutionary Bioinformatics Laboratory, University of Illinois at Urbana-Champaign Urbana, IL, USA.
Front Microbiol. 2017 Mar 10;8:380. doi: 10.3389/fmicb.2017.00380. eCollection 2017.
The viral supergroup includes the entire collection of known and unknown viruses that roam our planet and infect life forms. The supergroup is remarkably diverse both in its genetics and morphology and has historically remained difficult to study and classify. The accumulation of protein structure data in the past few years now provides an excellent opportunity to re-examine the classification and evolution of viruses. Here we scan completely sequenced viral proteomes from all genome types and identify protein folds involved in the formation of viral capsids and virion architectures. Viruses encoding similar capsid/coat related folds were pooled into lineages, after benchmarking against published literature. Remarkably, the exercise reproduced all previously described members of known structure-based viral lineages, along with several proposals for new additions, suggesting it could be a useful supplement to experimental approaches and to aid qualitative assessment of viral diversity in metagenome samples.
病毒超群包括在我们这个星球上传播并感染生命形式的所有已知和未知病毒。这个超群在遗传学和形态学上都极为多样,并且在历史上一直难以研究和分类。过去几年中蛋白质结构数据的积累现在为重新审视病毒的分类和进化提供了绝佳机会。在这里,我们扫描了来自所有基因组类型的完全测序的病毒蛋白质组,并识别出参与病毒衣壳和病毒体结构形成的蛋白质折叠。在对照已发表文献进行基准测试后,将编码相似衣壳/包膜相关折叠的病毒汇集到谱系中。值得注意的是,这项工作重现了所有先前描述的基于已知结构的病毒谱系成员,以及几个关于新增成员的提议,表明它可能是实验方法的有用补充,并有助于对宏基因组样本中的病毒多样性进行定性评估。