病毒分类学的生物信息学:基于序列的层次分类开发的基础与工具
Bioinformatics of virus taxonomy: foundations and tools for developing sequence-based hierarchical classification.
作者信息
Gorbalenya Alexander E, Lauber Chris
机构信息
Department of Medical Microbiology, Leiden University Medical Center, Leiden, The Netherlands; Faculty of Bioengineering and Bioinformatics and Belozersky, Institute of Physico-Chemical Biology, Lomonosov Moscow State University, 119899, Moscow, Russia.
Institute for Experimental Virology, TWINCORE Centre for Experimental and Clinical Infection Research, A Joint Venture between the Hannover Medical School (MHH) and the Helmholtz Centre for Infection Research (HZI), Hannover, Germany.
出版信息
Curr Opin Virol. 2022 Feb;52:48-56. doi: 10.1016/j.coviro.2021.11.003. Epub 2021 Dec 6.
The genome sequence is the only characteristic readily obtainable for all known viruses, underlying the growing role of comparative genomics in organizing knowledge about viruses in a systematic evolution-aware way, known as virus taxonomy. Overseen by the International Committee on Taxonomy of Viruses (ICTV), development of virus taxonomy involves taxa demarcation at 15 ranks of a hierarchical classification, often in host-specific manner. Outside the ICTV remit, researchers assess fitting numerous unclassified viruses into the established taxa. They employ different metrics of virus clustering, basing on conserved domain(s), separation of viruses in rooted phylogenetic trees and pair-wise distance space. Computational approaches differ further in respect to methodology, number of ranks considered, sensitivity to uneven virus sampling, and visualization of results. Advancing and using computational tools will be critical for improving taxa demarcation across the virosphere and resolving rank origins in research that may also inform experimental virology.
基因组序列是所有已知病毒唯一容易获得的特征,这使得比较基因组学在以系统的、具有进化意识的方式(即病毒分类学)组织病毒知识方面发挥着越来越重要的作用。在国际病毒分类委员会(ICTV)的监督下,病毒分类学的发展涉及到一个具有15个等级的层次分类中的分类单元划分,通常是以宿主特异性的方式进行。在ICTV的职责范围之外,研究人员评估如何将众多未分类的病毒归入已建立的分类单元。他们采用不同的病毒聚类指标,这些指标基于保守结构域、根系统发育树中病毒的分离以及成对距离空间。计算方法在方法、考虑的等级数量、对不均匀病毒采样的敏感性以及结果可视化方面进一步有所不同。改进和使用计算工具对于改善整个病毒圈的分类单元划分以及解决研究中的等级起源至关重要,这也可能为实验病毒学提供信息。