• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于结构的自监督学习能够在突变时实现超快速蛋白质稳定性预测。

Structure-based self-supervised learning enables ultrafast protein stability prediction upon mutation.

作者信息

Sun Jinyuan, Zhu Tong, Cui Yinglu, Wu Bian

机构信息

AIM Center, College of Life Sciences and Technology, Beijing University of Chemical Technology, Institute of Microbiology, Chinese Academy of Sciences, Beijing, China.

University of Chinese Academy of Sciences, Beijing, China.

出版信息

Innovation (Camb). 2025 Jan 6;6(1):100750. doi: 10.1016/j.xinn.2024.100750.

DOI:10.1016/j.xinn.2024.100750
PMID:39872490
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11763918/
Abstract

Predicting free energy changes (ΔΔG) is essential for enhancing our understanding of protein evolution and plays a pivotal role in protein engineering and pharmaceutical development. While traditional methods offer valuable insights, they are often constrained by computational speed and reliance on biased training datasets. These constraints become particularly evident when aiming for accurate ΔΔG predictions across a diverse array of protein sequences. Herein, we introduce Pythia, a self-supervised graph neural network specifically designed for zero-shot ΔΔG predictions. Our comparative benchmarks demonstrate that Pythia outperforms other self-supervised pretraining models and force field-based approaches while also exhibiting competitive performance with fully supervised models. Notably, Pythia shows strong correlations and achieves a remarkable increase in computational speed of up to 10-fold. We further validated Pythia's performance in predicting the thermostabilizing mutations of limonene epoxide hydrolase, leading to higher experimental success rates. This exceptional efficiency has enabled us to explore 26 million high-quality protein structures, marking a significant advancement in our ability to navigate the protein sequence space and enhance our understanding of the relationships between protein genotype and phenotype. In addition, we established a web server at https://pythia.wulab.xyz to allow users to easily perform such predictions.

摘要

预测自由能变化(ΔΔG)对于加深我们对蛋白质进化的理解至关重要,并且在蛋白质工程和药物开发中起着关键作用。虽然传统方法提供了有价值的见解,但它们常常受到计算速度和对有偏差训练数据集的依赖的限制。当旨在对各种蛋白质序列进行准确的ΔΔG预测时,这些限制变得尤为明显。在此,我们介绍了Pythia,这是一种专门为零样本ΔΔG预测设计的自监督图神经网络。我们的比较基准表明,Pythia优于其他自监督预训练模型和基于力场的方法,同时与全监督模型相比也表现出有竞争力的性能。值得注意的是,Pythia显示出很强的相关性,并且计算速度显著提高了高达10倍。我们进一步验证了Pythia在预测柠檬烯环氧水解酶的热稳定突变方面的性能,从而提高了实验成功率。这种卓越的效率使我们能够探索2600万个高质量的蛋白质结构,标志着我们在探索蛋白质序列空间以及加深对蛋白质基因型和表型之间关系的理解方面取得了重大进展。此外,我们在https://pythia.wulab.xyz建立了一个网络服务器,以便用户能够轻松地进行此类预测。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8270/11763918/2cda60cd9678/gr6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8270/11763918/cb0c38e4a8e3/fx1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8270/11763918/aa84a4b996ac/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8270/11763918/030193e2da5c/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8270/11763918/048916c6aa3b/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8270/11763918/fd6528521bd4/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8270/11763918/5a96c429c43f/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8270/11763918/2cda60cd9678/gr6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8270/11763918/cb0c38e4a8e3/fx1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8270/11763918/aa84a4b996ac/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8270/11763918/030193e2da5c/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8270/11763918/048916c6aa3b/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8270/11763918/fd6528521bd4/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8270/11763918/5a96c429c43f/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8270/11763918/2cda60cd9678/gr6.jpg

相似文献

1
Structure-based self-supervised learning enables ultrafast protein stability prediction upon mutation.基于结构的自监督学习能够在突变时实现超快速蛋白质稳定性预测。
Innovation (Camb). 2025 Jan 6;6(1):100750. doi: 10.1016/j.xinn.2024.100750.
2
Prediction of mutation-induced protein stability changes based on the geometric representations learned by a self-supervised method.基于自监督方法学习到的几何表示来预测突变诱导的蛋白质稳定性变化。
BMC Bioinformatics. 2024 Aug 28;25(1):282. doi: 10.1186/s12859-024-05876-6.
3
Rewiring protein sequence and structure generative models to enhance protein stability prediction.重新调整蛋白质序列和结构生成模型以增强蛋白质稳定性预测。
bioRxiv. 2025 Feb 18:2025.02.13.638154. doi: 10.1101/2025.02.13.638154.
4
Semantical and geometrical protein encoding toward enhanced bioactivity and thermostability.面向增强生物活性和热稳定性的语义和几何蛋白质编码
Elife. 2025 May 2;13:RP98033. doi: 10.7554/eLife.98033.
5
GDMol: Generative Double-Masking Self-Supervised Learning for Molecular Property Prediction.GDMol:用于分子性质预测的生成式双掩码自监督学习
Mol Inform. 2025 Jan;44(1):e202400146. doi: 10.1002/minf.202400146. Epub 2024 Oct 24.
6
Comparing Supervised Learning and Rigorous Approach for Predicting Protein Stability upon Point Mutations in Difficult Targets.比较监督学习和严格方法用于预测难处理靶点点突变后的蛋白质稳定性
J Chem Inf Model. 2023 Nov 13;63(21):6778-6788. doi: 10.1021/acs.jcim.3c00750. Epub 2023 Oct 28.
7
A Multi-Task Self-Supervised Strategy for Predicting Molecular Properties and FGFR1 Inhibitors.一种用于预测分子性质和FGFR1抑制剂的多任务自监督策略
Adv Sci (Weinh). 2025 Apr;12(13):e2412987. doi: 10.1002/advs.202412987. Epub 2025 Feb 8.
8
A veracity dissemination consistency-based few-shot fake news detection framework by synergizing adversarial and contrastive self-supervised learning.一种基于真实性传播一致性的少样本假新闻检测框架,通过协同对抗性和对比性自监督学习实现。
Sci Rep. 2024 Aug 22;14(1):19470. doi: 10.1038/s41598-024-70039-9.
9
OmeDDG: Improved Protein Mutation Stability Prediction Based on Predicted 3D Structures.OmeDDG:基于预测的三维结构改进蛋白质突变稳定性预测。
J Phys Chem B. 2024 Jan 11;128(1):67-76. doi: 10.1021/acs.jpcb.3c05601. Epub 2023 Dec 21.
10
Cross-shaped windows transformer with self-supervised pretraining for clinically significant prostate cancer detection in bi-parametric MRI.用于双参数磁共振成像中具有临床意义的前列腺癌检测的带自监督预训练的十字形窗口变换器
Med Phys. 2025 Feb;52(2):993-1004. doi: 10.1002/mp.17546. Epub 2024 Nov 26.

引用本文的文献

1
Reliable prediction of protein-protein binding affinity changes upon mutations with Pythia-PPI.使用Pythia-PPI对突变后蛋白质-蛋白质结合亲和力变化进行可靠预测。
Natl Sci Rev. 2025 Jun 10;12(6):nwaf231. doi: 10.1093/nsr/nwaf231. eCollection 2025 Jun.
2
Mass balance approximation of unfolding boosts potential-based protein stability predictions.去折叠的质量平衡近似提高了基于势能的蛋白质稳定性预测。
Protein Sci. 2025 May;34(5):e70134. doi: 10.1002/pro.70134.

本文引用的文献

1
Improving the prediction of protein stability changes upon mutations by geometric learning and a pre-training strategy.通过几何学习和预训练策略提高突变后蛋白质稳定性变化的预测。
Nat Comput Sci. 2024 Nov;4(11):840-850. doi: 10.1038/s43588-024-00716-2. Epub 2024 Oct 25.
2
Stability Oracle: a structure-based graph-transformer framework for identifying stabilizing mutations.稳定性预测器:一种基于结构的图变换框架,用于识别稳定化突变。
Nat Commun. 2024 Jul 23;15(1):6170. doi: 10.1038/s41467-024-49780-2.
3
Transfer learning to leverage larger datasets for improved prediction of protein stability changes.
利用更大的数据集进行迁移学习,以提高蛋白质稳定性变化预测的准确性。
Proc Natl Acad Sci U S A. 2024 Feb 6;121(6):e2314853121. doi: 10.1073/pnas.2314853121. Epub 2024 Jan 29.
4
Rotamer-free protein sequence design based on deep learning and self-consistency.基于深度学习和自一致性的无旋转异构体蛋白质序列设计
Nat Comput Sci. 2022 Jul;2(7):451-462. doi: 10.1038/s43588-022-00273-6. Epub 2022 Jul 21.
5
Masked inverse folding with sequence transfer for protein representation learning.用于蛋白质表示学习的带序列转移的掩码反向折叠
Protein Eng Des Sel. 2023 Jan 21;36. doi: 10.1093/protein/gzad015.
6
Accurate proteome-wide missense variant effect prediction with AlphaMissense.使用 AlphaMissense 进行精确的全蛋白质错义变异效应预测。
Science. 2023 Sep 22;381(6664):eadg7492. doi: 10.1126/science.adg7492.
7
ProS-GNN: Predicting effects of mutations on protein stability using graph neural networks.ProS-GNN:使用图神经网络预测突变对蛋白质稳定性的影响。
Comput Biol Chem. 2023 Dec;107:107952. doi: 10.1016/j.compbiolchem.2023.107952. Epub 2023 Aug 26.
8
Mega-scale experimental analysis of protein folding stability in biology and design.大规模实验分析生物学和设计中的蛋白质折叠稳定性。
Nature. 2023 Aug;620(7973):434-444. doi: 10.1038/s41586-023-06328-6. Epub 2023 Jul 19.
9
DDMut: predicting effects of mutations on protein stability using deep learning.DDMut:使用深度学习预测突变对蛋白质稳定性的影响。
Nucleic Acids Res. 2023 Jul 5;51(W1):W122-W128. doi: 10.1093/nar/gkad472.
10
Rapid protein stability prediction using deep learning representations.利用深度学习表示进行快速蛋白质稳定性预测。
Elife. 2023 May 15;12:e82593. doi: 10.7554/eLife.82593.