The Conway Institute of Biomolecular and Biomedical Research, University College Dublin, Dublin, Ireland.
PLoS One. 2010 Dec 29;5(12):e14454. doi: 10.1371/journal.pone.0014454.
More and more nucleotide sequences of type A influenza virus are available in public databases. Although these sequences have been the focus of many molecular epidemiological and phylogenetic analyses, most studies only deal with a few representative sequences. In this paper, we present a complete analysis of all Haemagglutinin (HA) and Neuraminidase (NA) gene sequences available to allow large scale analyses of the evolution and epidemiology of type A influenza.
METHODOLOGY/PRINCIPAL FINDINGS: This paper describes an analysis and complete classification of all HA and NA gene sequences available in public databases using multivariate and phylogenetic methods.
CONCLUSIONS/SIGNIFICANCE: We analyzed 18,975 HA sequences and divided them into 280 subgroups according to multivariate and phylogenetic analyses. Similarly, we divided 11,362 NA sequences into 202 subgroups. Compared to previous analyses, this work is more detailed and comprehensive, especially for the bigger datasets. Therefore, it can be used to show the full and complex phylogenetic diversity and provides a framework for studying the molecular evolution and epidemiology of type A influenza virus. For more than 85% of type A influenza HA and NA sequences into GenBank, they are categorized in one unambiguous and unique group. Therefore, our results are a kind of genetic and phylogenetic annotation for influenza HA and NA sequences. In addition, sequences of swine influenza viruses come from 56 HA and 45 NA subgroups. Most of these subgroups also include viruses from other hosts indicating cross species transmission of the viruses between pigs and other hosts. Furthermore, the phylogenetic diversity of swine influenza viruses from Eurasia is greater than that of North American strains and both of them are becoming more diverse. Apart from viruses from human, pigs, birds and horses, viruses from other species show very low phylogenetic diversity. This might indicate that viruses have not become established in these species. Based on current evidence, there is no simple pattern of inter-hemisphere transmission of avian influenza viruses and it appears to happen sporadically. However, for H6 subtype avian influenza viruses, such transmissions might have happened very frequently and multiple and bidirectional transmission events might exist.
越来越多的甲型流感病毒核苷酸序列可在公共数据库中获得。尽管这些序列一直是许多分子流行病学和系统发育分析的焦点,但大多数研究仅涉及少数有代表性的序列。在本文中,我们对所有可用的血凝素(HA)和神经氨酸酶(NA)基因序列进行了全面分析,以允许对甲型流感的进化和流行病学进行大规模分析。
方法/主要发现:本文描述了使用多元和系统发育方法对公共数据库中所有可用的 HA 和 NA 基因序列进行分析和完整分类。
结论/意义:我们分析了 18975 个 HA 序列,并根据多元和系统发育分析将它们分为 280 个亚组。同样,我们将 11362 个 NA 序列分为 202 个亚组。与以前的分析相比,这项工作更详细和全面,特别是对于更大的数据集。因此,它可用于展示完整而复杂的系统发育多样性,并为研究甲型流感病毒的分子进化和流行病学提供框架。对于超过 85%的甲型流感 HA 和 NA 序列进入 GenBank,它们被归入一个明确而独特的组中。因此,我们的结果是对流感 HA 和 NA 序列的一种遗传和系统发育注释。此外,来自 56 个 HA 和 45 个 NA 亚组的猪流感病毒序列。这些亚组中的大多数还包括来自其他宿主的病毒,表明病毒在猪和其他宿主之间发生了跨物种传播。此外,欧亚大陆猪流感病毒的系统发育多样性大于北美毒株,而且两者都变得更加多样化。除了来自人类、猪、鸟类和马的病毒外,来自其他物种的病毒显示出非常低的系统发育多样性。这可能表明病毒在这些物种中尚未建立。根据目前的证据,禽流感病毒在半球间的传播没有简单的模式,似乎是零星发生的。然而,对于 H6 亚型禽流感病毒,这种传播可能非常频繁,并且可能存在多次双向传播事件。