Kieran Troy J, Sun Xiangjie, Maines Taronna R, Belser Jessica A
Influenza Division, Centers for Disease Control and Prevention, Atlanta, GA, USA.
Commun Biol. 2024 Aug 1;7(1):927. doi: 10.1038/s42003-024-06629-0.
In vivo assessments of influenza A virus (IAV) pathogenicity and transmissibility in ferrets represent a crucial component of many pandemic risk assessment rubrics, but few systematic efforts to identify which data from in vivo experimentation are most useful for predicting pathogenesis and transmission outcomes have been conducted. To this aim, we aggregated viral and molecular data from 125 contemporary IAV (H1, H2, H3, H5, H7, and H9 subtypes) evaluated in ferrets under a consistent protocol. Three overarching predictive classification outcomes (lethality, morbidity, transmissibility) were constructed using machine learning (ML) techniques, employing datasets emphasizing virological and clinical parameters from inoculated ferrets, limited to viral sequence-based information, or combining both data types. Among 11 different ML algorithms tested and assessed, gradient boosting machines and random forest algorithms yielded the highest performance, with models for lethality and transmission consistently better performing than models predicting morbidity. Comparisons of feature selection among models was performed, and highest performing models were validated with results from external risk assessment studies. Our findings show that ML algorithms can be used to summarize complex in vivo experimental work into succinct summaries that inform and enhance risk assessment criteria for pandemic preparedness that take in vivo data into account.
雪貂体内甲型流感病毒(IAV)致病性和传播性的评估是许多大流行风险评估标准的关键组成部分,但很少有人系统地努力确定体内实验中的哪些数据对预测发病机制和传播结果最有用。为此,我们汇总了按照一致方案在雪貂体内评估的125种当代IAV(H1、H2、H3、H5、H7和H9亚型)的病毒和分子数据。使用机器学习(ML)技术构建了三个总体预测分类结果(致死率、发病率、传播性),采用的数据集强调接种雪貂的病毒学和临床参数,仅限于基于病毒序列的信息,或结合两种数据类型。在测试和评估的11种不同ML算法中,梯度提升机和随机森林算法表现最佳,致死率和传播模型的表现始终优于预测发病率的模型。对模型之间的特征选择进行了比较,并使用外部风险评估研究的结果对表现最佳的模型进行了验证。我们的研究结果表明,ML算法可用于将复杂的体内实验工作总结为简洁的摘要,为考虑体内数据的大流行防范风险评估标准提供信息并加以完善。