• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过几何学习和预训练策略提高突变后蛋白质稳定性变化的预测。

Improving the prediction of protein stability changes upon mutations by geometric learning and a pre-training strategy.

机构信息

MOE Key Laboratory of Bioinformatics, School of Life Sciences, Tsinghua University, Beijing, China.

Beijing Frontier Research Center for Biological Structure, Tsinghua University, Beijing, China.

出版信息

Nat Comput Sci. 2024 Nov;4(11):840-850. doi: 10.1038/s43588-024-00716-2. Epub 2024 Oct 25.

DOI:10.1038/s43588-024-00716-2
PMID:39455825
Abstract

Accurate prediction of protein mutation effects is of great importance in protein engineering and design. Here we propose GeoStab-suite, a suite of three geometric learning-based models-GeoFitness, GeoDDG and GeoDTm-for the prediction of fitness score, ΔΔG and ΔT of a protein upon mutations, respectively. GeoFitness engages a specialized loss function to allow supervised training of a unified model using the large amount of multi-labeled fitness data in the deep mutational scanning database. To further improve the downstream tasks of ΔΔG and ΔT prediction, the encoder of GeoFitness is reutilized as a pre-trained module in GeoDDG and GeoDTm to overcome the challenge of lacking sufficient labeled data. This pre-training strategy, in combination with data expansion, markedly improves model performance and generalizability. In the benchmark test, GeoDDG and GeoDTm outperform the other state-of-the-art methods by at least 30% and 70%, respectively, in terms of the Spearman correlation coefficient.

摘要

准确预测蛋白质突变效应在蛋白质工程和设计中具有重要意义。在这里,我们提出了 GeoStab-suite,这是一套基于几何学习的三个模型——GeoFitness、GeoDDG 和 GeoDTm——分别用于预测蛋白质突变后的适合度得分、ΔΔG 和 ΔT。GeoFitness 采用了专门的损失函数,允许使用深度突变扫描数据库中大量多标签适合度数据对统一模型进行有监督训练。为了进一步提高 ΔΔG 和 ΔT 预测的下游任务,GeoFitness 的编码器被重新用作 GeoDDG 和 GeoDTm 的预训练模块,以克服缺乏足够标记数据的挑战。这种预训练策略,结合数据扩展,显著提高了模型的性能和泛化能力。在基准测试中,GeoDDG 和 GeoDTm 在斯皮尔曼相关系数方面分别比其他最先进的方法至少高出 30%和 70%。

相似文献

1
Improving the prediction of protein stability changes upon mutations by geometric learning and a pre-training strategy.通过几何学习和预训练策略提高突变后蛋白质稳定性变化的预测。
Nat Comput Sci. 2024 Nov;4(11):840-850. doi: 10.1038/s43588-024-00716-2. Epub 2024 Oct 25.
2
Prediction of mutation-induced protein stability changes based on the geometric representations learned by a self-supervised method.基于自监督方法学习到的几何表示来预测突变诱导的蛋白质稳定性变化。
BMC Bioinformatics. 2024 Aug 28;25(1):282. doi: 10.1186/s12859-024-05876-6.
3
Protein multi-level structure feature-integrated deep learning method for mutational effect prediction.基于蛋白质多层次结构特征的深度学习基因突变效应预测方法。
Biotechnol J. 2024 Aug;19(8):e2400203. doi: 10.1002/biot.202400203.
4
Assessing computational methods for predicting protein stability upon mutation: good on average but not in the details.评估预测突变后蛋白质稳定性的计算方法:总体良好但细节欠佳。
Protein Eng Des Sel. 2009 Sep;22(9):553-60. doi: 10.1093/protein/gzp030. Epub 2009 Jun 26.
5
Assessing the performance of computational predictors for estimating protein stability changes upon missense mutations.评估用于估计错义突变后蛋白质稳定性变化的计算预测器的性能。
Brief Bioinform. 2021 Nov 5;22(6). doi: 10.1093/bib/bbab184.
6
Flattening the curve-How to get better results with small deep-mutational-scanning datasets.拉平曲线——如何从小规模深度突变扫描数据集获得更好的结果。
Proteins. 2024 Jul;92(7):886-902. doi: 10.1002/prot.26686. Epub 2024 Mar 19.
7
Reviewing Challenges of Predicting Protein Melting Temperature Change Upon Mutation Through the Full Analysis of a Highly Detailed Dataset with High-Resolution Structures.通过对具有高分辨率结构的高度详细数据集进行全面分析来预测蛋白质突变时的熔融温度变化的挑战综述。
Mol Biotechnol. 2021 Oct;63(10):863-884. doi: 10.1007/s12033-021-00349-0. Epub 2021 Jun 8.
8
An end-to-end framework for the prediction of protein structure and fitness from single sequence.从单序列预测蛋白质结构和适应性的端到端框架。
Nat Commun. 2024 Aug 27;15(1):7400. doi: 10.1038/s41467-024-51776-x.
9
PON-tstab: Protein Variant Stability Predictor. Importance of Training Data Quality.PON-tstab:蛋白变体稳定性预测器。训练数据质量的重要性。
Int J Mol Sci. 2018 Mar 28;19(4):1009. doi: 10.3390/ijms19041009.
10
iSEE: Interface structure, evolution, and energy-based machine learning predictor of binding affinity changes upon mutations.iSEE:界面结构、进化和基于能量的机器学习预测突变引起的结合亲和力变化。
Proteins. 2019 Feb;87(2):110-119. doi: 10.1002/prot.25630. Epub 2018 Dec 3.

引用本文的文献

1
Predicting protein stability changes upon mutations with dual-view ensemble learning from single sequence.利用单序列的双视角集成学习预测突变后蛋白质稳定性的变化。
Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf319.
2
VenusMutHub: A systematic evaluation of protein mutation effect predictors on small-scale experimental data.金星突变中心:基于小规模实验数据对蛋白质突变效应预测因子的系统评估。
Acta Pharm Sin B. 2025 May;15(5):2454-2467. doi: 10.1016/j.apsb.2025.03.028. Epub 2025 Mar 14.
3
Fine-tuning of conditional Transformers improves enzyme prediction and generation.
条件Transformer的微调改进了酶的预测和生成。
Comput Struct Biotechnol J. 2025 Mar 26;27:1318-1334. doi: 10.1016/j.csbj.2025.03.037. eCollection 2025.
4
A Novel Missense Variant of in Juvenile Polyposis Syndrome: Assessment of Structural and Functional Alternations.青少年息肉病综合征中一种新的错义变异:结构和功能改变的评估。
Hum Mutat. 2025 Feb 18;2025:7317429. doi: 10.1155/humu/7317429. eCollection 2025.
5
Rewiring protein sequence and structure generative models to enhance protein stability prediction.重新调整蛋白质序列和结构生成模型以增强蛋白质稳定性预测。
bioRxiv. 2025 Feb 18:2025.02.13.638154. doi: 10.1101/2025.02.13.638154.
6
Decoding the effects of mutation on protein interactions using machine learning.利用机器学习解码突变对蛋白质相互作用的影响。
Biophys Rev (Melville). 2025 Feb 21;6(1):011307. doi: 10.1063/5.0249920. eCollection 2025 Mar.
7
Structure-based self-supervised learning enables ultrafast protein stability prediction upon mutation.基于结构的自监督学习能够在突变时实现超快速蛋白质稳定性预测。
Innovation (Camb). 2025 Jan 6;6(1):100750. doi: 10.1016/j.xinn.2024.100750.
8
EvoAI enables extreme compression and reconstruction of the protein sequence space.EvoAI能够对蛋白质序列空间进行极致的压缩和重建。
Res Sq. 2024 Feb 23:rs.3.rs-3930833. doi: 10.21203/rs.3.rs-3930833/v1.