Hulo Chantal, Masson Patrick, de Castro Edouard, Auchincloss Andrea H, Foulger Rebecca, Poux Sylvain, Lomax Jane, Bougueleret Lydie, Xenarios Ioannis, Le Mercier Philippe
SIB Swiss Institute of Bioinformatics, CMU, University of Geneva Medical School, Geneva, Switzerland.
European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, United Kingdom.
PLoS One. 2017 Feb 16;12(2):e0171746. doi: 10.1371/journal.pone.0171746. eCollection 2017.
Viruses are genetically diverse, infect a wide range of tissues and host cells and follow unique processes for replicating themselves. All these processes were investigated and indexed in ViralZone knowledge base. To facilitate standardizing data, a simple ontology of viral life-cycle terms was developed to provide a common vocabulary for annotating data sets. New terminology was developed to address unique viral replication cycle processes, and existing terminology was modified and adapted. The virus life-cycle is classically described by schematic pictures. Using this ontology, it can be represented by a combination of successive terms: "entry", "latency", "transcription", "replication" and "exit". Each of these parts is broken down into discrete steps. For example Zika virus "entry" is broken down in successive steps: "Attachment", "Apoptotic mimicry", "Viral endocytosis/ macropinocytosis", "Fusion with host endosomal membrane", "Viral factory". To demonstrate the utility of a standard ontology for virus biology, this work was completed by annotating virus data in the ViralZone, UniProtKB and Gene Ontology databases.
病毒在基因上具有多样性,可感染多种组织和宿主细胞,并遵循独特的自我复制过程。所有这些过程都在病毒区知识库中进行了研究和编目。为了便于数据标准化,开发了一个简单的病毒生命周期术语本体,以提供用于注释数据集的通用词汇。开发了新的术语来描述独特的病毒复制周期过程,并对现有术语进行了修改和调整。病毒生命周期通常用示意图来描述。使用这个本体,它可以由一系列连续的术语组合来表示:“进入”、“潜伏”、“转录”、“复制”和“退出”。这些部分中的每一个都被分解为离散的步骤。例如,寨卡病毒的“进入”被分解为连续的步骤:“附着”、“凋亡模拟”、“病毒内吞作用/巨胞饮作用”、“与宿主内体膜融合”、“病毒工厂”。为了证明标准本体在病毒生物学中的实用性,通过注释病毒区、UniProtKB和基因本体数据库中的病毒数据完成了这项工作。