School of Computer Science and Engineering, Nanyang Technological University, 50 Nanyang Avenue, Singapore 639798, Singapore.
J Bioinform Comput Biol. 2020 Feb;18(1):2040002. doi: 10.1142/S0219720020400028.
Influenza viruses are persistently threatening public health, causing annual epidemics and sporadic pandemics due to rapid viral evolution. Vaccines are used to prevent influenza infections but the composition of the influenza vaccines have to be updated regularly to ensure its efficacy. Computational tools and analyses have become increasingly important in guiding the process of vaccine selection. By constructing time-series training samples with splittings and embeddings, we develop a computational method for predicting suitable strains as the recommendation of the influenza vaccines using recurrent neural networks (RNNs). The Encoder-decoder architecture of RNN model enables us to perform sequence-to-sequence prediction. We employ this model to predict the prevalent sequence of the H3N2 viruses sampled from 2006 to 2017. The identity between our predicted sequence and recommended vaccines is greater than 98% and the indicates their antigenic similarity. The multi-step vaccine prediction further demonstrates the robustness of our method which achieves comparable results in contrast to single step prediction. The results show significant matches of the recommended vaccine strains to the circulating strains. We believe it would facilitate the process of vaccine selection and surveillance of seasonal influenza epidemics.
流感病毒持续威胁着公众健康,由于病毒的快速进化,每年都会引发流行疫情和偶发性大流行。疫苗被用于预防流感感染,但为了确保其疗效,流感疫苗的成分必须定期更新。计算工具和分析在指导疫苗选择过程中变得越来越重要。通过使用分裂和嵌入构建时间序列训练样本,我们使用递归神经网络 (RNN) 开发了一种计算方法,通过该方法可以预测合适的流感疫苗候选株。RNN 模型的编码器-解码器架构使我们能够进行序列到序列的预测。我们使用该模型预测了 2006 年至 2017 年期间采样的 H3N2 病毒的流行序列。我们预测的序列与推荐疫苗之间的一致性大于 98%,这表明它们具有抗原相似性。多步疫苗预测进一步证明了我们方法的稳健性,与单步预测相比,我们的方法取得了可比的结果。结果表明,推荐的疫苗株与流行株有显著的匹配。我们相信这将有助于疫苗选择和季节性流感流行的监测过程。