Poon Art F Y
Department of Pathology & Laboratory Medicine, Western University, Dental Sciences Building, Rm. 4044, London, Ontario N6A 5C1, Canada.
Department of Microbiology & Immunology, Western University, 1151 Richmond Street, London, Ontario N6A 3K7, Canada.
Virus Evol. 2024 Aug 17;10(1):veae064. doi: 10.1093/ve/veae064. eCollection 2024.
Hemagglutinin (HA) and neuraminidase (NA) proteins are the primary antigenic targets of influenza A virus (IAV) infections. IAV infections are generally classified into subtypes of HA and NA proteins, e.g. H3N2. Most of the known subtypes were originally defined by a lack of antibody cross-reactivity. However, genetic sequencing has played an increasingly important role in characterizing the evolving diversity of IAV. Novel subtypes have recently been described solely by their genetic sequences, and IAV infections are routinely subtyped by molecular assays, or the comparison of sequences to references. In this study, I carry out a comparative analysis of all available IAV protein sequences in the Genbank database (over 1.1 million, reduced to 272,292 unique sequences prior to phylogenetic reconstruction) to determine whether the serologically defined subtypes can be reproduced with sequence-based criteria. I show that a robust genetic taxonomy of HA and NA subtypes can be obtained using a simple clustering method, namely, by progressively partitioning the phylogeny on its longest internal branches. However, this taxonomy also requires some amendments to the current nomenclature. For example, two IAV isolates from bats previously characterized as a divergent lineage of H9N2 should be separated into their own subtype. With the exception of these small and highly divergent lineages, the phylogenies relating each of the other six genomic segments do not support partitions into major subtypes.
血凝素(HA)和神经氨酸酶(NA)蛋白是甲型流感病毒(IAV)感染的主要抗原靶点。IAV感染通常根据HA和NA蛋白的亚型进行分类,例如H3N2。大多数已知亚型最初是根据缺乏抗体交叉反应来定义的。然而,基因测序在表征IAV不断演变的多样性方面发挥着越来越重要的作用。最近,新型亚型仅通过其基因序列来描述,并且IAV感染通常通过分子检测或与参考序列的比较来进行亚型分类。在本研究中,我对Genbank数据库中所有可用的IAV蛋白序列进行了比较分析(超过110万条,在进行系统发育重建之前减少到272,292条独特序列),以确定基于血清学定义的亚型是否可以通过基于序列的标准重现。我表明,使用一种简单的聚类方法,即通过在其最长的内部分支上逐步划分系统发育树,可以获得HA和NA亚型的稳健遗传分类法。然而,这种分类法也需要对当前的命名法进行一些修正。例如,先前被鉴定为H9N2不同谱系的两种来自蝙蝠的IAV分离株应被分为它们自己的亚型。除了这些小的且高度分化的谱系外,与其他六个基因组片段中的每一个相关的系统发育树不支持划分为主要亚型。