U. S. Geological Survey, National Wildlife Health Center, Madison, Wisconsin.
Department of Statistics and Department of Entomology, University of Wisconsin - Madison, Madison, Wisconsin.
Transbound Emerg Dis. 2019 Nov;66(6):2537-2545. doi: 10.1111/tbed.13318. Epub 2019 Aug 19.
Influenza A viruses are one of the most significant viral groups globally with substantial impacts on human, domestic animal and wildlife health. Wild birds are the natural reservoirs for these viruses, and active surveillance within wild bird populations provides critical information about viral evolution forming the basis of risk assessments and countermeasure development. Unfortunately, active surveillance programs are often resource-intensive, and thus, enhancing programs for increased efficiency is paramount. Machine learning, a branch of artificial intelligence applications, provides statistical learning procedures that can be used to gain novel insights into disease surveillance systems. We use a form of machine learning, gradient boosted trees, to estimate the probability of isolating avian influenza viruses (AIV) from wild bird samples collected during surveillance for AIVs from 2006 to 2011 in the United States. We examined several predictive features including age, sex, bird type, geographic location and matrix gene rRT-PCR results. Our final model had high predictive power and only included geographic location and rRT-PCR results as important predictors. The highest predicted viral isolation probability was for samples collected from the north-central states and the south-eastern region of Alaska. Lower rRT-PCR Ct-values are associated with increased likelihood of AIV isolation, and the model estimated 16% probability of isolating AIV from samples declared negative (i.e., ≥35 Ct-value) using the rRT-PCR screening test and standard protocols. Our model can be used to prioritize previously collected samples for isolation and rapidly evaluate AIV surveillance designs to maximize the probability of viral isolation given limited resources and laboratory capacity.
甲型流感病毒是全球最重要的病毒群之一,对人类、家畜和野生动物的健康有重大影响。野生鸟类是这些病毒的天然宿主,对野生鸟类种群进行主动监测可以提供有关病毒进化的关键信息,为风险评估和对策制定提供依据。不幸的是,主动监测计划通常需要大量资源,因此,提高计划的效率至关重要。机器学习是人工智能应用的一个分支,它提供了统计学习程序,可以用于深入了解疾病监测系统。我们使用一种机器学习方法,梯度提升树,来估计从美国 2006 年至 2011 年期间针对甲型流感病毒(AIV)的监测中收集的野生鸟类样本中分离出 AIV 的概率。我们检查了几个预测特征,包括年龄、性别、鸟类类型、地理位置和基质基因 rRT-PCR 结果。我们的最终模型具有很高的预测能力,仅包括地理位置和 rRT-PCR 结果作为重要的预测因素。预测的病毒分离概率最高的是从中部各州和阿拉斯加东南部采集的样本。较低的 rRT-PCR Ct 值与 AIV 分离的可能性增加有关,该模型估计使用 rRT-PCR 筛选试验和标准方案从宣布为阴性(即 rRT-PCR 检测值≥35)的样本中分离出 AIV 的概率为 16%。我们的模型可用于优先对以前收集的样本进行分离,并快速评估 AIV 监测设计,以在资源有限和实验室能力有限的情况下最大限度地提高病毒分离的可能性。