• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

ProSTAGE:利用蛋白质嵌入和图卷积网络预测突变对蛋白质稳定性的影响。

ProSTAGE: Predicting Effects of Mutations on Protein Stability by Using Protein Embeddings and Graph Convolutional Networks.

机构信息

Production and R&D Center I of LSS, GenScript (Shanghai) Biotech Co., Ltd., Shanghai 200131, China.

出版信息

J Chem Inf Model. 2024 Jan 22;64(2):340-347. doi: 10.1021/acs.jcim.3c01697. Epub 2024 Jan 2.

DOI:10.1021/acs.jcim.3c01697
PMID:38166383
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10806799/
Abstract

Protein thermodynamic stability is essential to clarify the relationships among structure, function, and interaction. Therefore, developing a faster and more accurate method to predict the impact of the mutations on protein stability is helpful for protein design and understanding the phenotypic variation. Recent studies have shown that protein embedding will be particularly powerful at modeling sequence information with context dependence, such as subcellular localization, variant effect, and secondary structure prediction. Herein, we introduce a novel method, ProSTAGE, which is a deep learning method that fuses structure and sequence embedding to predict protein stability changes upon single point mutations. Our model combines graph-based techniques and language models to predict stability changes. Moreover, ProSTAGE is trained on a larger data set, which is almost twice as large as the most used S2648 data set. It consistently outperforms all existing state-of-the-art methods on mutation-affected problems as benchmarked on several independent data sets. The protein embedding as the prediction input achieves better results than the previous results, which shows the potential of protein language models in predicting the effect of mutations on proteins. ProSTAGE is implemented as a user-friendly web server.

摘要

蛋白质热力学稳定性对于阐明结构、功能和相互作用之间的关系至关重要。因此,开发一种更快、更准确的方法来预测突变对蛋白质稳定性的影响,有助于蛋白质设计和理解表型变异。最近的研究表明,蛋白质嵌入在建模具有上下文依赖性的序列信息方面将特别强大,例如亚细胞定位、变体效应和二级结构预测。在这里,我们引入了一种新的方法 ProSTAGE,这是一种融合结构和序列嵌入的深度学习方法,用于预测单点突变对蛋白质稳定性变化的影响。我们的模型结合了基于图的技术和语言模型来预测稳定性变化。此外,ProSTAGE 是在一个更大的数据集上进行训练的,这个数据集几乎是最常用的 S2648 数据集的两倍大。它在几个独立的数据集中作为基准,在受突变影响的问题上始终优于所有现有的最先进的方法。将蛋白质嵌入作为预测输入可以获得比以前更好的结果,这表明蛋白质语言模型在预测突变对蛋白质的影响方面具有潜力。ProSTAGE 被实现为一个用户友好的网络服务器。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de12/10806799/86e507c81e6c/ci3c01697_0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de12/10806799/123a1f31d2ed/ci3c01697_0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de12/10806799/518d3b584773/ci3c01697_0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de12/10806799/86e507c81e6c/ci3c01697_0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de12/10806799/123a1f31d2ed/ci3c01697_0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de12/10806799/518d3b584773/ci3c01697_0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de12/10806799/86e507c81e6c/ci3c01697_0003.jpg

相似文献

1
ProSTAGE: Predicting Effects of Mutations on Protein Stability by Using Protein Embeddings and Graph Convolutional Networks.ProSTAGE:利用蛋白质嵌入和图卷积网络预测突变对蛋白质稳定性的影响。
J Chem Inf Model. 2024 Jan 22;64(2):340-347. doi: 10.1021/acs.jcim.3c01697. Epub 2024 Jan 2.
2
ELASPIC2 (EL2): Combining Contextualized Language Models and Graph Neural Networks to Predict Effects of Mutations.ELASPIC2(EL2):结合语境化语言模型和图神经网络来预测突变的影响。
J Mol Biol. 2021 May 28;433(11):166810. doi: 10.1016/j.jmb.2021.166810. Epub 2021 Jan 13.
3
EnzyACT: A Novel Deep Learning Method to Predict the Impacts of Single and Multiple Mutations on Enzyme Activity.EnzyACT:一种预测单突变和多突变对酶活性影响的新型深度学习方法。
J Chem Inf Model. 2024 Aug 12;64(15):5912-5921. doi: 10.1021/acs.jcim.4c00920. Epub 2024 Jul 22.
4
Transfer learning to leverage larger datasets for improved prediction of protein stability changes.利用更大的数据集进行迁移学习,以提高蛋白质稳定性变化预测的准确性。
Proc Natl Acad Sci U S A. 2024 Feb 6;121(6):e2314853121. doi: 10.1073/pnas.2314853121. Epub 2024 Jan 29.
5
Predicting changes in protein thermodynamic stability upon point mutation with deep 3D convolutional neural networks.利用深度 3D 卷积神经网络预测点突变对蛋白质热力学稳定性的影响。
PLoS Comput Biol. 2020 Nov 30;16(11):e1008291. doi: 10.1371/journal.pcbi.1008291. eCollection 2020 Nov.
6
Modeling aspects of the language of life through transfer-learning protein sequences.通过转移学习蛋白质序列来模拟生命语言的各个方面。
BMC Bioinformatics. 2019 Dec 17;20(1):723. doi: 10.1186/s12859-019-3220-8.
7
SAAFEC-SEQ: A Sequence-Based Method for Predicting the Effect of Single Point Mutations on Protein Thermodynamic Stability.SAAFEC-SEQ:一种基于序列的方法,用于预测单点突变对蛋白质热力学稳定性的影响。
Int J Mol Sci. 2021 Jan 9;22(2):606. doi: 10.3390/ijms22020606.
8
Graph-based prediction of Protein-protein interactions with attributed signed graph embedding.基于属性有向图嵌入的蛋白质-蛋白质相互作用的图预测。
BMC Bioinformatics. 2020 Jul 21;21(1):323. doi: 10.1186/s12859-020-03646-8.
9
Assessing protein model quality based on deep graph coupled networks using protein language model.基于蛋白质语言模型的深度图耦合网络评估蛋白质模型质量。
Brief Bioinform. 2023 Nov 22;25(1). doi: 10.1093/bib/bbad420.
10
MAMF-GCN: Multi-scale adaptive multi-channel fusion deep graph convolutional network for predicting mental disorder.MAMF-GCN:用于预测精神障碍的多尺度自适应多通道融合深度图卷积网络。
Comput Biol Med. 2022 Sep;148:105823. doi: 10.1016/j.compbiomed.2022.105823. Epub 2022 Jul 6.

引用本文的文献

1
Graph neural network integrated with pretrained protein language model for predicting human-virus protein-protein interactions.结合预训练蛋白质语言模型的图神经网络用于预测人-病毒蛋白质-蛋白质相互作用
Brief Bioinform. 2025 Aug 31;26(5). doi: 10.1093/bib/bbaf461.
2
Predicting protein stability changes upon mutations with dual-view ensemble learning from single sequence.利用单序列的双视角集成学习预测突变后蛋白质稳定性的变化。
Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf319.
3
Shared-weight graph framework for comprehensive protein stability prediction across diverse mutation types.

本文引用的文献

1
DDMut: predicting effects of mutations on protein stability using deep learning.DDMut:使用深度学习预测突变对蛋白质稳定性的影响。
Nucleic Acids Res. 2023 Jul 5;51(W1):W122-W128. doi: 10.1093/nar/gkad472.
2
Evolutionary-scale prediction of atomic-level protein structure with a language model.用语言模型进行原子级蛋白质结构的进化尺度预测。
Science. 2023 Mar 17;379(6637):1123-1130. doi: 10.1126/science.ade2574. Epub 2023 Mar 16.
3
PROST: AlphaFold2-aware Sequence-Based Predictor to Estimate Protein Stability Changes upon Missense Mutations.
用于跨多种突变类型进行全面蛋白质稳定性预测的共享权重图框架。
Brief Bioinform. 2025 Mar 4;26(2). doi: 10.1093/bib/bbaf190.
4
Rewiring protein sequence and structure generative models to enhance protein stability prediction.重新调整蛋白质序列和结构生成模型以增强蛋白质稳定性预测。
bioRxiv. 2025 Feb 18:2025.02.13.638154. doi: 10.1101/2025.02.13.638154.
5
Decoding the effects of mutation on protein interactions using machine learning.利用机器学习解码突变对蛋白质相互作用的影响。
Biophys Rev (Melville). 2025 Feb 21;6(1):011307. doi: 10.1063/5.0249920. eCollection 2025 Mar.
6
Beyond digital twins: the role of foundation models in enhancing the interpretability of multiomics modalities in precision medicine.超越数字孪生:基础模型在提高精准医学中多组学模式的可解释性方面的作用。
FEBS Open Bio. 2025 Aug;15(8):1192-1208. doi: 10.1002/2211-5463.70003. Epub 2025 Feb 24.
7
ProCeSa: Contrast-Enhanced Structure-Aware Network for Thermostability Prediction with Protein Language Models.ProCeSa:用于蛋白质语言模型热稳定性预测的对比增强结构感知网络。
J Chem Inf Model. 2025 Mar 10;65(5):2304-2313. doi: 10.1021/acs.jcim.4c01752. Epub 2025 Feb 23.
8
ProG-SOL: Predicting Protein Solubility Using Protein Embeddings and Dual-Graph Convolutional Networks.ProG-SOL:利用蛋白质嵌入和双图卷积网络预测蛋白质溶解度
ACS Omega. 2025 Jan 24;10(4):3910-3916. doi: 10.1021/acsomega.4c09688. eCollection 2025 Feb 4.
9
SSEmb: A joint embedding of protein sequence and structure enables robust variant effect predictions.SSEmb:一种蛋白质序列和结构的联合嵌入方法,可实现稳健的变体效应预测。
Nat Commun. 2024 Nov 7;15(1):9646. doi: 10.1038/s41467-024-53982-z.
10
Variant Impact Predictor database (VIPdb), version 2: trends from three decades of genetic variant impact predictors.变异影响预测器数据库(VIPdb),版本 2:三十年来遗传变异影响预测器的趋势。
Hum Genomics. 2024 Aug 28;18(1):90. doi: 10.1186/s40246-024-00663-z.
PROST:基于序列的 AlphaFold2 感知预测器,用于预测错义突变后蛋白质稳定性的变化。
J Chem Inf Model. 2022 Sep 12;62(17):4270-4282. doi: 10.1021/acs.jcim.2c00799. Epub 2022 Aug 16.
4
Prediction of protein-protein interaction using graph neural networks.基于图神经网络的蛋白质-蛋白质相互作用预测。
Sci Rep. 2022 May 19;12(1):8360. doi: 10.1038/s41598-022-12201-9.
5
DDGun: an untrained predictor of protein stability changes upon amino acid variants.DDGun:一种未经训练的预测蛋白质稳定性变化的方法,针对氨基酸变体。
Nucleic Acids Res. 2022 Jul 5;50(W1):W222-W227. doi: 10.1093/nar/gkac325.
6
Predicting protein stability changes upon single-point mutation: a thorough comparison of the available tools on a new dataset.预测单点突变后蛋白质稳定性的变化:在新数据集上对现有工具的全面比较。
Brief Bioinform. 2022 Mar 10;23(2). doi: 10.1093/bib/bbab555.
7
Highly accurate protein structure prediction with AlphaFold.利用 AlphaFold 进行高精度蛋白质结构预测。
Nature. 2021 Aug;596(7873):583-589. doi: 10.1038/s41586-021-03819-2. Epub 2021 Jul 15.
8
ProtTrans: Toward Understanding the Language of Life Through Self-Supervised Learning.ProtTrans:通过自监督学习理解生命语言。
IEEE Trans Pattern Anal Mach Intell. 2022 Oct;44(10):7112-7127. doi: 10.1109/TPAMI.2021.3095381. Epub 2022 Sep 14.
9
A Deep-Learning Sequence-Based Method to Predict Protein Stability Changes Upon Genetic Variations.基于深度学习的序列方法预测遗传变异后蛋白质稳定性的变化。
Genes (Basel). 2021 Jun 12;12(6):911. doi: 10.3390/genes12060911.
10
Structure-based protein function prediction using graph convolutional networks.基于结构的蛋白质功能预测使用图卷积网络。
Nat Commun. 2021 May 26;12(1):3168. doi: 10.1038/s41467-021-23303-9.