• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用人工神经网络预测 SARS-CoV-2 的复发性突变。

Prediction of Recurrent Mutations in SARS-CoV-2 Using Artificial Neural Networks.

机构信息

Research Group in Cheminformatics & Nutrition, Departament de Bioquímica i Biotecnologia, Campus de Sescelades, Universitat Rovira i Virgili, 43007 Tarragona, Spain.

Department of Biology, University of Turku, 20500 Turku, Finland.

出版信息

Int J Mol Sci. 2022 Nov 24;23(23):14683. doi: 10.3390/ijms232314683.

DOI:10.3390/ijms232314683
PMID:36499005
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9736107/
Abstract

Predicting SARS-CoV-2 mutations is difficult, but predicting recurrent mutations driven by the host, such as those caused by host deaminases, is feasible. We used machine learning to predict which positions from the SARS-CoV-2 genome will hold a recurrent mutation and which mutations will be the most recurrent. We used data from April 2021 that we separated into three sets: a training set, a validation set, and an independent test set. For the test set, we obtained a specificity value of 0.69, a sensitivity value of 0.79, and an Area Under the Curve (AUC) of 0.8, showing that the prediction of recurrent SARS-CoV-2 mutations is feasible. Subsequently, we compared our predictions with updated data from January 2022, showing that some of the false positives in our prediction model become true positives later on. The most important variables detected by the model's Shapley Additive exPlanation (SHAP) are the nucleotide that mutates and RNA reactivity. This is consistent with the SARS-CoV-2 mutational bias pattern and the preference of some host deaminases for specific sequences and RNA secondary structures. We extend our investigation by analyzing the mutations from the variants of concern Alpha, Beta, Delta, Gamma, and Omicron. Finally, we analyzed amino acid changes by looking at the predicted recurrent mutations in the M-pro and spike proteins.

摘要

预测 SARS-CoV-2 突变是困难的,但预测由宿主驱动的复发性突变,如由宿主脱氨酶引起的突变,是可行的。我们使用机器学习来预测 SARS-CoV-2 基因组中的哪些位置将发生复发性突变,以及哪些突变将是最常见的。我们使用了 2021 年 4 月的数据,将其分为三组:训练集、验证集和独立测试集。对于测试集,我们获得了特异性值为 0.69、敏感性值为 0.79 和曲线下面积(AUC)为 0.8,表明预测 SARS-CoV-2 的复发性突变是可行的。随后,我们将我们的预测与 2022 年 1 月的更新数据进行了比较,结果表明我们的预测模型中的一些假阳性后来变成了真正的阳性。模型的 Shapley Additive exPlanation (SHAP) 检测到的最重要变量是发生突变的核苷酸和 RNA 反应性。这与 SARS-CoV-2 的突变偏向模式以及一些宿主脱氨酶对特定序列和 RNA 二级结构的偏好一致。我们通过分析关注变体 Alpha、Beta、Delta、Gamma 和 Omicron 的突变来扩展我们的研究。最后,我们通过研究 M 蛋白和刺突蛋白中的预测复发性突变来分析氨基酸变化。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/20a5/9736107/e3b2cbeb271b/ijms-23-14683-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/20a5/9736107/d5895060ad66/ijms-23-14683-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/20a5/9736107/d68fbf6d1940/ijms-23-14683-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/20a5/9736107/a0e931fd4a33/ijms-23-14683-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/20a5/9736107/b124dfa4b578/ijms-23-14683-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/20a5/9736107/6c50d055af31/ijms-23-14683-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/20a5/9736107/e3b2cbeb271b/ijms-23-14683-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/20a5/9736107/d5895060ad66/ijms-23-14683-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/20a5/9736107/d68fbf6d1940/ijms-23-14683-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/20a5/9736107/a0e931fd4a33/ijms-23-14683-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/20a5/9736107/b124dfa4b578/ijms-23-14683-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/20a5/9736107/6c50d055af31/ijms-23-14683-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/20a5/9736107/e3b2cbeb271b/ijms-23-14683-g006.jpg

相似文献

1
Prediction of Recurrent Mutations in SARS-CoV-2 Using Artificial Neural Networks.利用人工神经网络预测 SARS-CoV-2 的复发性突变。
Int J Mol Sci. 2022 Nov 24;23(23):14683. doi: 10.3390/ijms232314683.
2
Mutation prediction in the SARS-CoV-2 genome using attention-based neural machine translation.利用基于注意力的神经机器翻译预测 SARS-CoV-2 基因组中的突变。
Math Biosci Eng. 2024 May 20;21(5):5996-6018. doi: 10.3934/mbe.2024264.
3
Developing an Amplification Refractory Mutation System-Quantitative Reverse Transcription-PCR Assay for Rapid and Sensitive Screening of SARS-CoV-2 Variants of Concern.开发扩增抑制突变系统-实时荧光定量 RT-PCR 检测方法,快速灵敏地筛查关注的 SARS-CoV-2 变异株。
Microbiol Spectr. 2022 Feb 23;10(1):e0143821. doi: 10.1128/spectrum.01438-21. Epub 2022 Jan 5.
4
Machine Learning Algorithms Associate Case Numbers with SARS-CoV-2 Variants Rather Than with Impactful Mutations.机器学习算法将病例编号与 SARS-CoV-2 变体相关联,而不是与有影响的突变相关联。
Viruses. 2023 May 24;15(6):1226. doi: 10.3390/v15061226.
5
Impact of ADAR-induced editing of minor viral RNA populations on replication and transmission of SARS-CoV-2.ADAR 诱导的小病毒 RNA 群体编辑对 SARS-CoV-2 复制和传播的影响。
Proc Natl Acad Sci U S A. 2022 Feb 8;119(6). doi: 10.1073/pnas.2112663119.
6
Molecular epidemiology of SARS-CoV-2 isolated from COVID-19 family clusters.新冠病毒家族聚集性病例中分离的 SARS-CoV-2 的分子流行病学研究。
BMC Med Genomics. 2021 Jun 1;14(1):144. doi: 10.1186/s12920-021-00990-3.
7
From delta to Omicron: S1-RBD/S2 mutation/deletion equilibrium in SARS-CoV-2 defined variants.从德尔塔到奥密克戎:SARS-CoV-2 定义变体中的 S1-RBD/S2 突变/缺失平衡。
Gene. 2022 Mar 10;814:146134. doi: 10.1016/j.gene.2021.146134. Epub 2022 Jan 4.
8
Temporal-Geographical Dispersion of SARS-CoV-2 Spike Glycoprotein Variant Lineages and Their Functional Prediction Using Approach.利用 方法预测 SARS-CoV-2 刺突糖蛋白变异株的时空地理分布及其功能。
mBio. 2021 Oct 26;12(5):e0268721. doi: 10.1128/mBio.02687-21.
9
Emergency SARS-CoV-2 Variants of Concern: Novel Multiplex Real-Time RT-PCR Assay for Rapid Detection and Surveillance.关注的紧急 SARS-CoV-2 变异株:用于快速检测和监测的新型多重实时 RT-PCR 检测方法。
Microbiol Spectr. 2022 Feb 23;10(1):e0251321. doi: 10.1128/spectrum.02513-21.
10
Convolutional Neural Networks Based on Sequential Spike Predict the High Human Adaptation of SARS-CoV-2 Omicron Variants.基于序列尖峰的卷积神经网络预测 SARS-CoV-2 奥密克戎变异株的高人类适应性。
Viruses. 2022 May 17;14(5):1072. doi: 10.3390/v14051072.

引用本文的文献

1
Transfer learning towards predicting viral missense mutations: A case study on SARS-CoV-2.面向预测病毒错义突变的迁移学习:以严重急性呼吸综合征冠状病毒2为例的研究
Comput Struct Biotechnol J. 2025 Apr 22;27:1686-1692. doi: 10.1016/j.csbj.2025.04.029. eCollection 2025.
2
Paying attention to the SARS-CoV-2 dialect : a deep neural network approach to predicting novel protein mutations.关注新冠病毒变体:一种预测新型蛋白质突变的深度神经网络方法。
Commun Biol. 2025 Jan 21;8(1):98. doi: 10.1038/s42003-024-07262-7.
3
A prediction of mutations in infectious viruses using artificial intelligence.

本文引用的文献

1
Quantifying Mutational Response to Track the Evolution of SARS-CoV-2 Spike Variants: Introducing a Statistical-Mechanics-Guided Machine Learning Method.量化突变反应以追踪 SARS-CoV-2 刺突变异株的进化:引入一种基于统计力学指导的机器学习方法。
J Phys Chem B. 2022 Oct 13;126(40):7895-7905. doi: 10.1021/acs.jpcb.2c04574. Epub 2022 Sep 30.
2
The roles of APOBEC-mediated RNA editing in SARS-CoV-2 mutations, replication and fitness.APOBEC 介导的 RNA 编辑在 SARS-CoV-2 突变、复制和适应性中的作用。
Sci Rep. 2022 Sep 13;12(1):14972. doi: 10.1038/s41598-022-19067-x.
3
Comprehensive fitness landscape of SARS-CoV-2 M reveals insights into viral resistance mechanisms.
利用人工智能预测传染性病毒中的突变。
Genomics Inform. 2024 Oct 8;22(1):15. doi: 10.1186/s44342-024-00019-y.
4
Properties and Mechanisms of Deletions, Insertions, and Substitutions in the Evolutionary History of SARS-CoV-2.SARS-CoV-2 进化史中缺失、插入和替换的特性和机制。
Int J Mol Sci. 2024 Mar 26;25(7):3696. doi: 10.3390/ijms25073696.
5
Broad Epitope Coverage of Therapeutic Multi-Antibody Combinations Targeting SARS-CoV-2 Boosts In Vivo Protection and Neutralization Potency to Corner an Immune-Evading Virus.靶向严重急性呼吸综合征冠状病毒2(SARS-CoV-2)的治疗性多抗体组合的广泛表位覆盖增强了体内保护作用和中和效力,以遏制一种免疫逃逸病毒。
Biomedicines. 2024 Mar 13;12(3):642. doi: 10.3390/biomedicines12030642.
6
The Mutational Landscape of SARS-CoV-2.SARS-CoV-2 的突变景观。
Int J Mol Sci. 2023 May 22;24(10):9072. doi: 10.3390/ijms24109072.
7
A Simple Epidemiologic Model for Predicting Impaired Neutralization of New SARS-CoV-2 Variants.一种预测新型SARS-CoV-2变体中和能力受损的简单流行病学模型。
Vaccines (Basel). 2023 Jan 5;11(1):128. doi: 10.3390/vaccines11010128.
SARS-CoV-2 M 的综合适应景观揭示了病毒耐药机制的见解。
Elife. 2022 Jun 20;11:e77433. doi: 10.7554/eLife.77433.
4
Where did Omicron come from? Three key theories.奥密克戎毒株源自何处?三个关键理论。
Nature. 2022 Feb;602(7895):26-28. doi: 10.1038/d41586-022-00215-2.
5
Predicting the mutational drivers of future SARS-CoV-2 variants of concern.预测未来引起关注的 SARS-CoV-2 变异株的突变驱动因素。
Sci Transl Med. 2022 Feb 23;14(633):eabk3445. doi: 10.1126/scitranslmed.abk3445.
6
A Review of the Current Landscape of SARS-CoV-2 Main Protease Inhibitors: Have We Hit the Bullseye Yet?SARS-CoV-2 主蛋白酶抑制剂的现状综述:我们是否击中了靶心?
Int J Mol Sci. 2021 Dec 27;23(1):259. doi: 10.3390/ijms23010259.
7
GISAID's Role in Pandemic Response.全球流感共享数据库(GISAID)在大流行应对中的作用。
China CDC Wkly. 2021 Dec 3;3(49):1049-1051. doi: 10.46234/ccdcw2021.255.
8
Functional evaluation of the P681H mutation on the proteolytic activation of the SARS-CoV-2 variant B.1.1.7 (Alpha) spike.P681H突变对严重急性呼吸综合征冠状病毒2变异株B.1.1.7(阿尔法)刺突蛋白的蛋白水解激活的功能评估。
iScience. 2022 Jan 21;25(1):103589. doi: 10.1016/j.isci.2021.103589. Epub 2021 Dec 10.
9
Recent progress in the development of potential drugs against SARS-CoV-2.抗新型冠状病毒潜在药物研发的最新进展。
Curr Res Pharmacol Drug Discov. 2021;2:100057. doi: 10.1016/j.crphar.2021.100057. Epub 2021 Sep 14.
10
The N501Y spike substitution enhances SARS-CoV-2 infection and transmission.N501Y 刺突突变增强了 SARS-CoV-2 的感染和传播。
Nature. 2022 Feb;602(7896):294-299. doi: 10.1038/s41586-021-04245-0. Epub 2021 Nov 24.