Truong Nguyen Phuoc, Garcia-Vallvé Santiago, Puigbò Pere
Department of Biology, University of Turku, 20500 Turku, Finland.
Department of Virology, Faculty of Medicine, University of Helsinki, 00290 Helsinki, Finland.
Life (Basel). 2021 May 14;11(5):442. doi: 10.3390/life11050442.
Early characterization of emerging viruses is essential to control their spread, such as the Zika Virus outbreak in 2014. Among other non-viral factors, host information is essential for the surveillance and control of virus spread. Flaviviruses (genus ), akin to other viruses, are modulated by high mutation rates and selective forces to adapt their codon usage to that of their hosts. However, a major challenge is the identification of potential hosts for novel viruses. Usually, potential hosts of emerging zoonotic viruses are identified after several confirmed cases. This is inefficient for deterring future outbreaks. In this paper, we introduce an algorithm to identify the host range of a virus from its raw genome sequences. The proposed strategy relies on comparing codon usage frequencies across viruses and hosts, by means of a normalized Codon Adaptation Index (CAI). We have tested our algorithm on 94 flaviviruses and 16 potential hosts. This novel method is able to distinguish between arthropod and vertebrate hosts for several flaviviruses with high values of accuracy (virus group 91.9% and host type 86.1%) and specificity (virus group 94.9% and host type 79.6%), in comparison to empirical observations. Overall, this algorithm may be useful as a complementary tool to current phylogenetic methods in monitoring current and future viral outbreaks by understanding host-virus relationships.
对新出现病毒进行早期特征描述对于控制其传播至关重要,例如2014年的寨卡病毒疫情。在其他非病毒因素中,宿主信息对于监测和控制病毒传播至关重要。黄病毒属与其他病毒一样,受到高突变率和选择压力的调节,以使其密码子使用适应宿主的密码子使用。然而,一个主要挑战是确定新型病毒的潜在宿主。通常,新发人畜共患病毒的潜在宿主是在几例确诊病例之后才被确定的。这对于阻止未来的疫情爆发效率低下。在本文中,我们介绍了一种从病毒原始基因组序列中识别病毒宿主范围的算法。所提出的策略依赖于通过标准化密码子适应指数(CAI)比较病毒和宿主之间的密码子使用频率。我们在94种黄病毒和16种潜在宿主上测试了我们的算法。与实证观察结果相比,这种新方法能够以较高的准确性(病毒组为91.9%,宿主类型为86.1%)和特异性(病毒组为94.9%,宿主类型为79.6%)区分几种黄病毒的节肢动物宿主和脊椎动物宿主。总体而言,通过理解宿主-病毒关系,该算法作为当前系统发育方法的补充工具,在监测当前和未来的病毒疫情方面可能会有所帮助。