Center for Systems Medicine, Institute of Basic Medical Sciences, Chinese Academy of Medical Sciences & Peking Union Medical College, Suzhou Institute of Systems Medicine, Suzhou, China.
School of Computer Science and Technology, Soochow University, Suzhou, China.
Mol Biol Evol. 2019 Jun 1;36(6):1172-1186. doi: 10.1093/molbev/msz050.
Seasonal influenza viruses undergo frequent mutations on their surface hemagglutinin (HA) proteins to escape the host immune response. In these mutations, a few key amino acid sites are associated with significant antigenic cluster transitions. To recognize the cluster-transition determining sites of seasonal influenza A/H3N2 and A/H1N1 viruses systematically and quickly, we developed a computational model named RECDS (recognition of cluster-transition determining sites) to evaluate the contribution of a specific amino acid site on the HA protein in the whole history of antigenic evolution. In RECDS, we ranked all of the HA sites by calculating the contribution scores derived from the forest of gradient boosting classifiers trained by various sequence- and structure-based features. With the RECDS model, we found out that the sites determining influenza antigenicity were mostly around the receptor-binding domain both for the influenza A/H3N2 and A/H1N1 viruses. Specifically, half of the cluster-transition determining sites of the influenza A/H1N1 virus were located in the vestigial esterase domain and basic path area on the HA, which indicated that the differential driving force of the antigenic evolution of the A/H1N1 virus refers to the A/H3N2 virus. Beyond that, the footprints of substitutions responsible for antigenic evolution were inferred according to the phylogenetic trees for the cluster-transition determining sites. The monitoring of genetic variation occurring at these cluster-transition determining sites in circulating influenza viruses on a large scale will potentially reduce current assay workloads in influenza surveillance and the selection of new influenza vaccine strains.
季节性流感病毒在其表面血凝素 (HA) 蛋白上经常发生突变,以逃避宿主的免疫反应。在这些突变中,少数关键氨基酸位点与显著的抗原簇转变有关。为了系统而快速地识别季节性甲型流感 A/H3N2 和 A/H1N1 病毒的簇转变决定位点,我们开发了一种名为 RECDS(识别簇转变决定位点)的计算模型,用于评估 HA 蛋白上特定氨基酸位点在整个抗原进化历史中的贡献。在 RECDS 中,我们通过计算基于梯度提升分类器森林的贡献得分来对所有 HA 位点进行排序,这些分类器是由各种序列和结构特征训练得到的。使用 RECDS 模型,我们发现决定流感抗原性的位点主要位于流感 A/H3N2 和 A/H1N1 病毒的受体结合域周围。具体来说,流感 A/H1N1 病毒的一半簇转变决定位点位于 HA 的残余酯酶域和碱性路径区域,这表明 A/H1N1 病毒的抗原进化的差异驱动力与 A/H3N2 病毒有关。除此之外,根据簇转变决定位点的系统发育树推断了导致抗原进化的取代足迹。对流行流感病毒中这些簇转变决定位点发生的遗传变异进行大规模监测,可能会降低流感监测中当前检测工作量,并选择新的流感疫苗株。