• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种用于探索病毒适应性景观的蛋白质语言模型。

A protein language model for exploring viral fitness landscapes.

作者信息

Ito Jumpei, Strange Adam, Liu Wei, Joas Gustav, Lytras Spyros, Sato Kei

机构信息

Division of Systems Virology, Department of Microbiology and Immunology, The Institute of Medical Science, The University of Tokyo, Tokyo, Japan.

International Research Center for Infectious Diseases, The Institute of Medical Science, The University of Tokyo, Tokyo, Japan.

出版信息

Nat Commun. 2025 May 13;16(1):4236. doi: 10.1038/s41467-025-59422-w.

DOI:10.1038/s41467-025-59422-w
PMID:40360496
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12075601/
Abstract

Successively emerging SARS-CoV-2 variants lead to repeated epidemic surges through escalated fitness (i.e., relative effective reproduction number between variants). Modeling the genotype-fitness relationship enables us to pinpoint the mutations boosting viral fitness and flag high-risk variants immediately after their detection. Here, we present CoVFit, a protein language model adapted from ESM-2, designed to predict variant fitness based solely on spike protein sequences. CoVFit was trained on genotype-fitness data derived from viral genome surveillance and functional mutation assays related to immune evasion. CoVFit successively ranked the fitness of unknown future variants harboring nearly 15 mutations with informative accuracy. CoVFit identified 959 fitness elevation events throughout SARS-CoV-2 evolution until late 2023. Furthermore, we show that CoVFit is applicable for predicting viral evolution through single amino acid mutations. Our study gives insight into the SARS-CoV-2 fitness landscape and provides a tool for efficiently identifying SARS-CoV-2 variants with higher epidemic risk.

摘要

不断出现的新冠病毒(SARS-CoV-2)变种通过提升适应性(即变种之间的相对有效繁殖数)导致疫情反复激增。对基因型-适应性关系进行建模使我们能够确定增强病毒适应性的突变,并在检测到高风险变种后立即对其发出警示。在此,我们展示了CoVFit,这是一种从ESM-2改编而来的蛋白质语言模型,旨在仅根据刺突蛋白序列预测变种的适应性。CoVFit基于源自病毒基因组监测和与免疫逃逸相关的功能突变试验的基因型-适应性数据进行训练。CoVFit连续对携带近15个突变的未知未来变种的适应性进行了具有信息性准确性的排名。CoVFit在整个新冠病毒进化过程中直至2023年末识别出959个适应性提升事件。此外,我们表明CoVFit适用于通过单氨基酸突变预测病毒进化。我们的研究深入了解了新冠病毒的适应性格局,并提供了一种有效识别具有更高疫情风险的新冠病毒变种的工具。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3ce5/12075601/bc5646cecd17/41467_2025_59422_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3ce5/12075601/f18275d3a797/41467_2025_59422_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3ce5/12075601/c0fe48a2767f/41467_2025_59422_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3ce5/12075601/8066268634f7/41467_2025_59422_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3ce5/12075601/e481559ca8f3/41467_2025_59422_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3ce5/12075601/bf898d5ec078/41467_2025_59422_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3ce5/12075601/bc5646cecd17/41467_2025_59422_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3ce5/12075601/f18275d3a797/41467_2025_59422_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3ce5/12075601/c0fe48a2767f/41467_2025_59422_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3ce5/12075601/8066268634f7/41467_2025_59422_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3ce5/12075601/e481559ca8f3/41467_2025_59422_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3ce5/12075601/bf898d5ec078/41467_2025_59422_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3ce5/12075601/bc5646cecd17/41467_2025_59422_Fig6_HTML.jpg

相似文献

1
A protein language model for exploring viral fitness landscapes.一种用于探索病毒适应性景观的蛋白质语言模型。
Nat Commun. 2025 May 13;16(1):4236. doi: 10.1038/s41467-025-59422-w.
2
Evolutionary and Phenotypic Characterization of Two Spike Mutations in European Lineage 20E of SARS-CoV-2.两种 SARS-CoV-2 欧洲谱系 20E 刺突突变的进化和表型特征。
mBio. 2021 Dec 21;12(6):e0231521. doi: 10.1128/mBio.02315-21. Epub 2021 Nov 16.
3
Molecular and structural insights into SARS-CoV-2 evolution: from BA.2 to XBB subvariants.SARS-CoV-2 进化的分子和结构洞察:从 BA.2 到 XBB 亚系。
mBio. 2024 Oct 16;15(10):e0322023. doi: 10.1128/mbio.03220-23. Epub 2024 Sep 16.
4
Role of glycosylation mutations at the N-terminal domain of SARS-CoV-2 XEC variant in immune evasion, cell-cell fusion, and spike stability.新型冠状病毒XEC变异株N端结构域糖基化突变在免疫逃逸、细胞间融合及刺突蛋白稳定性中的作用
J Virol. 2025 Apr 15;99(4):e0024225. doi: 10.1128/jvi.00242-25. Epub 2025 Mar 26.
5
The emerging plasticity of SARS-CoV-2.严重急性呼吸综合征冠状病毒2(SARS-CoV-2)新出现的可塑性。
Science. 2021 Mar 26;371(6536):1306-1308. doi: 10.1126/science.abg4493.
6
Analysis of 6.4 million SARS-CoV-2 genomes identifies mutations associated with fitness.分析 640 万例 SARS-CoV-2 基因组,鉴定与适应性相关的突变。
Science. 2022 Jun 17;376(6599):1327-1332. doi: 10.1126/science.abm1208. Epub 2022 May 24.
7
Decoding omicron: Genetic insight into its transmission dynamics, severity spectrum and ever-evolving strategies of immune escape in comparison with other SARS-CoV-2 variants.解读奥密克戎:与其他新冠病毒变体相比,对其传播动态、严重程度范围及不断演变的免疫逃逸策略的遗传学见解
Diagn Microbiol Infect Dis. 2025 Mar;111(3):116705. doi: 10.1016/j.diagmicrobio.2025.116705. Epub 2025 Jan 23.
8
Circulating SARS-CoV-2 spike N439K variants maintain fitness while evading antibody-mediated immunity.循环 Sars-CoV-2 刺突 N439K 变体在保持适应性的同时逃避抗体介导的免疫。
Cell. 2021 Mar 4;184(5):1171-1187.e20. doi: 10.1016/j.cell.2021.01.037. Epub 2021 Jan 28.
9
Biophysical principles predict fitness of SARS-CoV-2 variants.生物物理原理预测 SARS-CoV-2 变体的适应性。
Proc Natl Acad Sci U S A. 2024 Jun 4;121(23):e2314518121. doi: 10.1073/pnas.2314518121. Epub 2024 May 31.
10
Spread of Gamma (P.1) Sub-Lineages Carrying Spike Mutations Close to the Furin Cleavage Site and Deletions in the N-Terminal Domain Drives Ongoing Transmission of SARS-CoV-2 in Amazonas, Brazil.携带接近弗林裂解位点的刺突突变和 N 端结构域缺失的伽马(P.1)亚谱系的传播推动了巴西亚马逊州 SARS-CoV-2 的持续传播。
Microbiol Spectr. 2022 Feb 23;10(1):e0236621. doi: 10.1128/spectrum.02366-21.

引用本文的文献

1
SARITA: a large language model for generating the S1 subunit of the SARS-CoV-2 spike protein.SARITA:一种用于生成严重急性呼吸综合征冠状病毒2刺突蛋白S1亚基的大语言模型。
Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf384.
2
Evolving fitness and immune escape: a retrospective analysis of SARS-CoV-2 spike protein (2020-2024) using protein language model.不断演变的适应性与免疫逃逸:使用蛋白质语言模型对严重急性呼吸综合征冠状病毒2刺突蛋白(2020 - 2024年)的回顾性分析
Front Immunol. 2025 Jun 18;16:1576414. doi: 10.3389/fimmu.2025.1576414. eCollection 2025.
3
Predicting high-fitness viral protein variants with Bayesian active learning and biophysics.

本文引用的文献

1
Virological characteristics of the SARS-CoV-2 LP.8.1 variant.严重急性呼吸综合征冠状病毒2型LP.8.1变体的病毒学特征
Lancet Infect Dis. 2025 Apr;25(4):e193. doi: 10.1016/S1473-3099(25)00079-9. Epub 2025 Feb 10.
2
Structural basis for receptor-binding domain mobility of the spike in SARS-CoV-2 BA.2.86 and JN.1.刺突在 SARS-CoV-2 BA.2.86 和 JN.1 中受体结合域的结构基础:动态性
Nat Commun. 2024 Oct 7;15(1):8574. doi: 10.1038/s41467-024-52808-2.
3
Molecular and structural insights into SARS-CoV-2 evolution: from BA.2 to XBB subvariants.
利用贝叶斯主动学习和生物物理学预测高适应性病毒蛋白变体
Proc Natl Acad Sci U S A. 2025 Jun 17;122(24):e2503742122. doi: 10.1073/pnas.2503742122. Epub 2025 Jun 9.
4
Few-Shot Viral Variant Detection via Bayesian Active Learning and Biophysics.通过贝叶斯主动学习和生物物理学进行少样本病毒变体检测
bioRxiv. 2025 Mar 13:2025.03.12.642881. doi: 10.1101/2025.03.12.642881.
SARS-CoV-2 进化的分子和结构洞察:从 BA.2 到 XBB 亚系。
mBio. 2024 Oct 16;15(10):e0322023. doi: 10.1128/mbio.03220-23. Epub 2024 Sep 16.
4
Virological characteristics of the SARS-CoV-2 Omicron EG.5.1 variant.奥密克戎变异株 EG.5.1 病毒学特征。
Microbiol Immunol. 2024 Sep;68(9):305-330. doi: 10.1111/1348-0421.13165. Epub 2024 Jul 4.
5
Spike deep mutational scanning helps predict success of SARS-CoV-2 clades.刺突深度突变扫描有助于预测新冠病毒进化枝的成功情况。
Nature. 2024 Jul;631(8021):617-626. doi: 10.1038/s41586-024-07636-1. Epub 2024 Jul 3.
6
Distinct evolution of SARS-CoV-2 Omicron XBB and BA.2.86/JN.1 lineages combining increased fitness and antibody evasion.奥密克戎 XBB 和 BA.2.86/JN.1 分支的 SARS-CoV-2 具有不同的进化特征,表现为适应性增强和抗体逃避能力提高。
Nat Commun. 2024 Mar 13;15(1):2254. doi: 10.1038/s41467-024-46490-7.
7
Virological characteristics of the SARS-CoV-2 BA.2.86 variant.SARS-CoV-2 BA.2.86 变异株的病毒学特征。
Cell Host Microbe. 2024 Feb 14;32(2):170-180.e12. doi: 10.1016/j.chom.2024.01.001. Epub 2024 Jan 26.
8
Characteristics of the SARS-CoV-2 omicron HK.3 variant harbouring the FLip substitution.携带FLip替代的新冠病毒奥密克戎HK.3变异株的特征
Lancet Microbe. 2024 Apr;5(4):e313. doi: 10.1016/S2666-5247(23)00373-7. Epub 2024 Jan 11.
9
Virological characteristics of the SARS-CoV-2 JN.1 variant.严重急性呼吸综合征冠状病毒2 JN.1变体的病毒学特征。
Lancet Infect Dis. 2024 Feb;24(2):e82. doi: 10.1016/S1473-3099(23)00813-7. Epub 2024 Jan 3.
10
Deep mutational scans of XBB.1.5 and BQ.1.1 reveal ongoing epistatic drift during SARS-CoV-2 evolution.深度突变扫描显示,XBB.1.5 和 BQ.1.1 期间 SARS-CoV-2 进化持续发生上位性漂移。
PLoS Pathog. 2023 Dec 29;19(12):e1011901. doi: 10.1371/journal.ppat.1011901. eCollection 2023 Dec.