利用基因组数据和机器学习预测宿主物种对流感病毒和冠状病毒的易感性：一项范围综述

Predicting host species susceptibility to influenza viruses and coronaviruses using genome data and machine learning: a scoping review.

作者信息

Alberts Famke, Berke Olaf, Rocha Leilani, Keay Sheila, Maboni Grazieli, Poljak Zvonimir

机构信息

Department of Population Medicine, Ontario Veterinary College, University of Guelph, Guelph, ON, Canada.

Centre for Public Health and Zoonoses, University of Guelph, Guelph, ON, Canada.

出版信息

Front Vet Sci. 2024 Sep 25;11:1358028. doi: 10.3389/fvets.2024.1358028. eCollection 2024.

DOI:10.3389/fvets.2024.1358028

PMID:39386249

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11462629/

Abstract

INTRODUCTION

Predicting which species are susceptible to viruses (i.e., host range) is important for understanding and developing effective strategies to control viral outbreaks in both humans and animals. The use of machine learning and bioinformatic approaches to predict viral hosts has been expanded with advancements in techniques. We conducted a scoping review to identify the breadth of machine learning methods applied to influenza and coronavirus genome data for the identification of susceptible host species.

METHODS

The protocol for this scoping review is available at https://hdl.handle.net/10214/26112. Five online databases were searched, and 1,217 citations, published between January 2000 and May 2022, were obtained, and screened in duplicate for English language and research, covering the use of machine learning to identify susceptible species to viruses.

RESULTS

Fifty-three relevant publications were identified for data charting. The breadth of research was extensive including 32 different machine learning algorithms used in combination with 29 different feature selection methods and 43 different genome data input formats. There were 20 different methods used by authors to assess accuracy. Authors mostly used influenza viruses ( = 31/53 publications, 58.5%), however, more recent publications focused on coronaviruses and other viruses in combination with influenza viruses ( = 22/53, 41.5%). The susceptible animal groups authors most used were humans ( = 57/77 analyses, 74.0%), avian ( = 35/77 45.4%), and swine ( = 28/77, 36.4%). In total, 53 different hosts were used and, in most publications, data from multiple hosts was used.

DISCUSSION

The main gaps in research were a lack of standardized reporting of methodology and the use of broad host categories for classification. Overall, approaches to viral host identification using machine learning were diverse and extensive.

摘要

引言

预测哪些物种易感染病毒（即宿主范围）对于理解和制定控制人类和动物病毒爆发的有效策略至关重要。随着技术的进步，利用机器学习和生物信息学方法预测病毒宿主的应用得到了扩展。我们进行了一项范围综述，以确定应用于流感和冠状病毒基因组数据以识别易感宿主物种的机器学习方法的广度。

方法

本范围综述的方案可在https://hdl.handle.net/10214/26112获取。检索了五个在线数据库，获得了2000年1月至2022年5月发表的1217篇引文，并对其进行了重复筛选，以确保语言为英语且属于研究范畴，涵盖了使用机器学习识别病毒易感物种的内容。

结果

确定了53篇相关出版物用于数据图表绘制。研究范围广泛，包括32种不同的机器学习算法与29种不同的特征选择方法以及43种不同的基因组数据输入格式相结合。作者使用了20种不同的方法来评估准确性。作者大多使用流感病毒（31/53篇出版物，58.5%），然而，最近的出版物则侧重于冠状病毒和其他病毒与流感病毒的联合研究（22/53篇，41.5%）。作者最常使用的易感动物群体是人类（57/77项分析，74.0%）、禽类（35/77项，45.4%）和猪（28/77项，36.4%）。总共使用了53种不同的宿主，并且在大多数出版物中使用了来自多个宿主的数据。

讨论

研究中的主要差距在于缺乏方法学的标准化报告以及使用宽泛的宿主类别进行分类。总体而言，使用机器学习进行病毒宿主识别的方法多样且广泛。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/86db/11462629/7cf3e7d2e20f/fvets-11-1358028-g003.jpg

相似文献

Predicting host species susceptibility to influenza viruses and coronaviruses using genome data and machine learning: a scoping review.利用基因组数据和机器学习预测宿主物种对流感病毒和冠状病毒的易感性：一项范围综述

Front Vet Sci. 2024 Sep 25;11:1358028. doi: 10.3389/fvets.2024.1358028. eCollection 2024.

Utilizing machine learning and hemagglutinin sequences to identify likely hosts of influenza H3Nx viruses.利用机器学习和血凝素序列识别可能的 H3Nx 流感病毒宿主。

Prev Vet Med. 2024 Dec;233:106351. doi: 10.1016/j.prevetmed.2024.106351. Epub 2024 Sep 26.

Novel approach for identification of influenza virus host range and zoonotic transmissible sequences by determination of host-related associative positions in viral genome segments.通过确定病毒基因组片段中与宿主相关的关联位置来鉴定流感病毒宿主范围和人畜共患传播序列的新方法。

BMC Genomics. 2016 Nov 16;17(1):925. doi: 10.1186/s12864-016-3250-9.

Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区，服用抗叶酸抗疟药物的人群中，叶酸补充剂与疟疾易感性和严重程度的关系。

Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.

Variation in the ACE2 receptor has limited utility for SARS-CoV-2 host prediction.ACE2 受体的变异性对预测 SARS-CoV-2 的宿主有限用性。

Elife. 2022 Nov 23;11:e80329. doi: 10.7554/eLife.80329.

Prediction of virus-host infectious association by supervised learning methods.通过监督学习方法预测病毒-宿主感染关联。

BMC Bioinformatics. 2017 Mar 14;18(Suppl 3):60. doi: 10.1186/s12859-017-1473-7.

Application of Machine Learning in Multimorbidity Research: Protocol for a Scoping Review.机器学习在多病种研究中的应用：系统评价方案。

JMIR Res Protoc. 2024 May 20;13:e53761. doi: 10.2196/53761.

Misclassified: identification of zoonotic transition biomarker candidates for influenza A viruses using deep neural network.错误分类：使用深度神经网络识别甲型流感病毒人畜共患病转变生物标志物候选物

Front Genet. 2023 Jul 27;14:1145166. doi: 10.3389/fgene.2023.1145166. eCollection 2023.

Predicting Zoonotic Risk of Influenza A Viruses from Host Tropism Protein Signature Using Random Forest.利用随机森林从宿主嗜性蛋白特征预测甲型流感病毒的人畜共患病风险

Int J Mol Sci. 2017 May 25;18(6):1135. doi: 10.3390/ijms18061135.

Predicting host-pathogen interactions with machine learning algorithms: A scoping review.使用机器学习算法预测宿主与病原体的相互作用：一项综述。

Infect Genet Evol. 2025 Jun;130:105751. doi: 10.1016/j.meegid.2025.105751. Epub 2025 Apr 10.

本文引用的文献

Machine learning and artificial intelligence: applications in healthcare epidemiology.机器学习与人工智能：在医疗保健流行病学中的应用

Antimicrob Steward Healthc Epidemiol. 2021 Oct 7;1(1):e28. doi: 10.1017/ash.2021.192. eCollection 2021.

A contemporary review on the important role of approaches for managing different aspects of COVID-19 crisis.一篇关于应对新冠疫情危机不同方面的方法所起重要作用的当代综述。

Inform Med Unlocked. 2022;28:100862. doi: 10.1016/j.imu.2022.100862. Epub 2022 Jan 21.

Tracking the amino acid changes of spike proteins across diverse host species of severe acute respiratory syndrome coronavirus 2.追踪严重急性呼吸综合征冠状病毒2在不同宿主物种间刺突蛋白的氨基酸变化。

iScience. 2022 Jan 21;25(1):103560. doi: 10.1016/j.isci.2021.103560. Epub 2021 Dec 2.

Predicting Cross-Species Infection of Swine Influenza Virus with Representation Learning of Amino Acid Features.基于氨基酸特征表示学习预测猪流感病毒的跨种感染

Comput Math Methods Med. 2021 Oct 11;2021:6985008. doi: 10.1155/2021/6985008. eCollection 2021.

Identifying and prioritizing potential human-infecting viruses from their genome sequences.从基因组序列中识别和确定潜在的感染人类的病毒，并对其进行优先级排序。

PLoS Biol. 2021 Sep 28;19(9):e3001390. doi: 10.1371/journal.pbio.3001390. eCollection 2021 Sep.

Influenza virus genotype to phenotype predictions through machine learning: a systematic review.通过机器学习进行流感病毒基因型到表型的预测：系统评价。

Emerg Microbes Infect. 2021 Dec;10(1):1896-1907. doi: 10.1080/22221751.2021.1978824.

Predicting hosts based on early SARS-CoV-2 samples and analyzing the 2020 pandemic.基于早期 SARS-CoV-2 样本预测宿主并分析 2020 年大流行。

Sci Rep. 2021 Aug 31;11(1):17422. doi: 10.1038/s41598-021-96903-6.

Compositional biases in RNA viruses: Causes, consequences and applications.RNA病毒中的组成性偏差：原因、后果及应用

Wiley Interdiscip Rev RNA. 2022 Mar;13(2):e1679. doi: 10.1002/wrna.1679. Epub 2021 Jun 21.

Alignment free sequence comparison methods and reservoir host prediction.无比对序列比较方法与宿主预测

Bioinformatics. 2021 Oct 11;37(19):3337-3342. doi: 10.1093/bioinformatics/btab338.

Virus Detection: A Review of the Current and Emerging Molecular and Immunological Methods.病毒检测：当前及新兴分子和免疫方法综述

Front Mol Biosci. 2021 Apr 20;8:637559. doi: 10.3389/fmolb.2021.637559. eCollection 2021.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

利用基因组数据和机器学习预测宿主物种对流感病毒和冠状病毒的易感性：一项范围综述

Predicting host species susceptibility to influenza viruses and coronaviruses using genome data and machine learning: a scoping review.

作者信息

机构信息

出版信息

INTRODUCTION

METHODS

RESULTS

DISCUSSION

引言

方法

结果

讨论

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献