• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

迁移学习以利用更大的数据集来改进对蛋白质稳定性变化的预测。

Transfer learning to leverage larger datasets for improved prediction of protein stability changes.

作者信息

Dieckhaus Henry, Brocidiacono Michael, Randolph Nicholas, Kuhlman Brian

机构信息

Department of Biochemistry and Biophysics, University of North Carolina School of Medicine, Chapel Hill, North Carolina, USA.

Division of Chemical Biology and Medicinal Chemistry, University of North Carolina Eshelman School of Pharmacy, Chapel Hill, North Carolina, USA.

出版信息

bioRxiv. 2023 Jul 30:2023.07.27.550881. doi: 10.1101/2023.07.27.550881.

DOI:10.1101/2023.07.27.550881
PMID:37547004
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10402116/
Abstract

Amino acid mutations that lower a protein's thermodynamic stability are implicated in numerous diseases, and engineered proteins with enhanced stability are important in research and medicine. Computational methods for predicting how mutations perturb protein stability are therefore of great interest. Despite recent advancements in protein design using deep learning, prediction of stability changes has remained challenging, in part due to a lack of large, high-quality training datasets for model development. Here we introduce ThermoMPNN, a deep neural network trained to predict stability changes for protein point mutations given an initial structure. In doing so, we demonstrate the utility of a newly released mega-scale stability dataset for training a robust stability model. We also employ transfer learning to leverage a second, larger dataset by using learned features extracted from a deep neural network trained to predict a protein's amino acid sequence given its three-dimensional structure. We show that our method achieves competitive performance on established benchmark datasets using a lightweight model architecture that allows for rapid, scalable predictions. Finally, we make ThermoMPNN readily available as a tool for stability prediction and design.

摘要

降低蛋白质热力学稳定性的氨基酸突变与多种疾病有关,而具有增强稳定性的工程蛋白在研究和医学中具有重要意义。因此,预测突变如何扰乱蛋白质稳定性的计算方法备受关注。尽管最近在利用深度学习进行蛋白质设计方面取得了进展,但稳定性变化的预测仍然具有挑战性,部分原因是缺乏用于模型开发的大规模、高质量训练数据集。在这里,我们介绍了ThermoMPNN,这是一种深度神经网络,经过训练可以在给定初始结构的情况下预测蛋白质点突变的稳定性变化。在此过程中,我们展示了新发布的大规模稳定性数据集在训练强大的稳定性模型方面的效用。我们还采用迁移学习,通过使用从经过训练以根据蛋白质的三维结构预测其氨基酸序列的深度神经网络中提取的学习特征,来利用第二个更大的数据集。我们表明,我们的方法使用轻量级模型架构在既定的基准数据集上实现了有竞争力的性能,该架构允许进行快速、可扩展的预测。最后,我们将ThermoMPNN作为一种稳定性预测和设计工具提供。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dbf6/10402116/5aec3f17a481/nihpp-2023.07.27.550881v1-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dbf6/10402116/a49e13577e46/nihpp-2023.07.27.550881v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dbf6/10402116/f607f4174c97/nihpp-2023.07.27.550881v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dbf6/10402116/c3182e6f7ce9/nihpp-2023.07.27.550881v1-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dbf6/10402116/5aec3f17a481/nihpp-2023.07.27.550881v1-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dbf6/10402116/a49e13577e46/nihpp-2023.07.27.550881v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dbf6/10402116/f607f4174c97/nihpp-2023.07.27.550881v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dbf6/10402116/c3182e6f7ce9/nihpp-2023.07.27.550881v1-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dbf6/10402116/5aec3f17a481/nihpp-2023.07.27.550881v1-f0004.jpg

相似文献

1
Transfer learning to leverage larger datasets for improved prediction of protein stability changes.迁移学习以利用更大的数据集来改进对蛋白质稳定性变化的预测。
bioRxiv. 2023 Jul 30:2023.07.27.550881. doi: 10.1101/2023.07.27.550881.
2
Transfer learning to leverage larger datasets for improved prediction of protein stability changes.利用更大的数据集进行迁移学习,以提高蛋白质稳定性变化预测的准确性。
Proc Natl Acad Sci U S A. 2024 Feb 6;121(6):e2314853121. doi: 10.1073/pnas.2314853121. Epub 2024 Jan 29.
3
Protein stability models fail to capture epistatic interactions of double point mutations.蛋白质稳定性模型无法捕捉双点突变的上位性相互作用。
bioRxiv. 2024 Aug 21:2024.08.20.608844. doi: 10.1101/2024.08.20.608844.
4
MaTPIP: A deep-learning architecture with eXplainable AI for sequence-driven, feature mixed protein-protein interaction prediction.MaTPIP:一种具有可解释 AI 的深度学习架构,用于序列驱动、特征混合的蛋白质-蛋白质相互作用预测。
Comput Methods Programs Biomed. 2024 Feb;244:107955. doi: 10.1016/j.cmpb.2023.107955. Epub 2023 Nov 30.
5
Ensemble machine learning model trained on a new synthesized dataset generalizes well for stress prediction using wearable devices.在新合成数据集上训练的集成机器学习模型,对于使用可穿戴设备进行压力预测具有良好的泛化能力。
J Biomed Inform. 2023 Dec;148:104556. doi: 10.1016/j.jbi.2023.104556. Epub 2023 Dec 2.
6
Off-target predictions in CRISPR-Cas9 gene editing using deep learning.使用深度学习进行 CRISPR-Cas9 基因编辑中的脱靶预测。
Bioinformatics. 2018 Sep 1;34(17):i656-i663. doi: 10.1093/bioinformatics/bty554.
7
Prediction of mutation-induced protein stability changes based on the geometric representations learned by a self-supervised method.基于自监督方法学习到的几何表示来预测突变诱导的蛋白质稳定性变化。
BMC Bioinformatics. 2024 Aug 28;25(1):282. doi: 10.1186/s12859-024-05876-6.
8
OmeDDG: Improved Protein Mutation Stability Prediction Based on Predicted 3D Structures.OmeDDG:基于预测的三维结构改进蛋白质突变稳定性预测。
J Phys Chem B. 2024 Jan 11;128(1):67-76. doi: 10.1021/acs.jpcb.3c05601. Epub 2023 Dec 21.
9
Predicting the Effect of Single and Multiple Mutations on Protein Structural Stability.预测单突变和多突变对蛋白质结构稳定性的影响。
Molecules. 2018 Jan 27;23(2):251. doi: 10.3390/molecules23020251.
10
Improved Prediction of Stabilizing Mutations in Proteins by Incorporation of Mutational Effects on Ligand Binding.通过纳入突变对配体结合的影响改进蛋白质中稳定突变的预测。
Proteins. 2025 Jan;93(1):384-395. doi: 10.1002/prot.26738. Epub 2024 Aug 21.

本文引用的文献

1
Stability Oracle: a structure-based graph-transformer framework for identifying stabilizing mutations.稳定性预测器:一种基于结构的图变换框架,用于识别稳定化突变。
Nat Commun. 2024 Jul 23;15(1):6170. doi: 10.1038/s41467-024-49780-2.
2
PROSTATA: a framework for protein stability assessment using transformers.前列腺:使用变压器进行蛋白质稳定性评估的框架。
Bioinformatics. 2023 Nov 1;39(11). doi: 10.1093/bioinformatics/btad671.
3
ProS-GNN: Predicting effects of mutations on protein stability using graph neural networks.ProS-GNN:使用图神经网络预测突变对蛋白质稳定性的影响。
Comput Biol Chem. 2023 Dec;107:107952. doi: 10.1016/j.compbiolchem.2023.107952. Epub 2023 Aug 26.
4
Mega-scale experimental analysis of protein folding stability in biology and design.大规模实验分析生物学和设计中的蛋白质折叠稳定性。
Nature. 2023 Aug;620(7973):434-444. doi: 10.1038/s41586-023-06328-6. Epub 2023 Jul 19.
5
Rapid protein stability prediction using deep learning representations.利用深度学习表示进行快速蛋白质稳定性预测。
Elife. 2023 May 15;12:e82593. doi: 10.7554/eLife.82593.
6
Engineering protein-based therapeutics through structural and chemical design.通过结构和化学设计工程蛋白质类治疗药物。
Nat Commun. 2023 Apr 27;14(1):2411. doi: 10.1038/s41467-023-38039-x.
7
Light attention predicts protein location from the language of life.轻注意力从生命语言中预测蛋白质位置。
Bioinform Adv. 2021 Nov 19;1(1):vbab035. doi: 10.1093/bioadv/vbab035. eCollection 2021.
8
BayeStab: Predicting effects of mutations on protein stability with uncertainty quantification.BayeStab:使用不确定性量化预测突变对蛋白质稳定性的影响。
Protein Sci. 2022 Nov;31(11):e4467. doi: 10.1002/pro.4467.
9
Stabilizing proteins, simplified: A Rosetta-based webtool for predicting favorable mutations.稳定蛋白质,简化版:基于 Rosetta 的预测有利突变的网络工具。
Protein Sci. 2022 Oct;31(10):e4428. doi: 10.1002/pro.4428.
10
Robust deep learning-based protein sequence design using ProteinMPNN.使用 ProteinMPNN 进行健壮的基于深度学习的蛋白质序列设计。
Science. 2022 Oct 7;378(6615):49-56. doi: 10.1126/science.add2187. Epub 2022 Sep 15.