• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

蛋白质稳定性模型无法捕捉双点突变的上位性相互作用。

Protein stability models fail to capture epistatic interactions of double point mutations.

作者信息

Dieckhaus Henry, Kuhlman Brian

机构信息

Department of Biochemistry and Biophysics, University of North Carolina School of Medicine, Chapel Hill, North Carolina, USA.

Division of Chemical Biology and Medicinal Chemistry, University of North Carolina Eshelman School of Pharmacy, Chapel Hill, North Carolina, USA.

出版信息

bioRxiv. 2024 Aug 21:2024.08.20.608844. doi: 10.1101/2024.08.20.608844.

DOI:10.1101/2024.08.20.608844
PMID:39229177
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11370451/
Abstract

There is strong interest in accurate methods for predicting changes in protein stability resulting from amino acid mutations to the protein sequence. Recombinant proteins must often be stabilized to be used as therapeutics or reagents, and destabilizing mutations are implicated in a variety of diseases. Due to increased data availability and improved modeling techniques, recent studies have shown advancements in predicting changes in protein stability when a single point mutation is made. Less focus has been directed toward predicting changes in protein stability when there are two or more mutations, despite the significance of mutation clusters for disease pathways and protein design studies. Here, we analyze the largest available dataset of double point mutation stability and benchmark several widely used protein stability models on this and other datasets. We identify a blind spot in how predictors are typically evaluated on multiple mutations, finding that, contrary to assumptions in the field, current stability models are unable to consistently capture epistatic interactions between double mutations. We observe one notable deviation from this trend, which is that epistasis-aware models provide marginally better predictions on stabilizing double point mutations. We develop an extension of the ThermoMPNN framework for double mutant modeling as well as a novel data augmentation scheme which mitigates some of the limitations in available datasets. Collectively, our findings indicate that current protein stability models fail to capture the nuanced epistatic interactions between concurrent mutations due to several factors, including training dataset limitations and insufficient model sensitivity.

摘要

人们对预测氨基酸突变导致蛋白质稳定性变化的准确方法有着浓厚兴趣。重组蛋白通常必须进行稳定化处理才能用作治疗药物或试剂,而不稳定突变与多种疾病有关。由于数据可用性的提高和建模技术的改进,最近的研究表明在预测单点突变时蛋白质稳定性的变化方面取得了进展。尽管突变簇对疾病途径和蛋白质设计研究具有重要意义,但较少关注预测两个或更多突变时蛋白质稳定性的变化。在此,我们分析了最大的双点突变稳定性可用数据集,并在该数据集及其他数据集上对几种广泛使用的蛋白质稳定性模型进行了基准测试。我们发现了预测器在多个突变上通常评估方式中的一个盲点,即与该领域的假设相反,当前的稳定性模型无法一致地捕捉双突变之间的上位性相互作用。我们观察到一个明显偏离此趋势的情况,即考虑上位性的模型对稳定双点突变的预测略好一些。我们开发了用于双突变体建模的ThermoMPNN框架扩展以及一种新颖的数据增强方案,该方案减轻了现有数据集中的一些局限性。总体而言,我们的研究结果表明,由于包括训练数据集限制和模型敏感性不足等几个因素,当前的蛋白质稳定性模型无法捕捉并发突变之间细微的上位性相互作用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8afb/11370451/96a5db8066f5/nihpp-2024.08.20.608844v1-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8afb/11370451/a7a6abe9d294/nihpp-2024.08.20.608844v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8afb/11370451/c2792218ad7e/nihpp-2024.08.20.608844v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8afb/11370451/96a5db8066f5/nihpp-2024.08.20.608844v1-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8afb/11370451/a7a6abe9d294/nihpp-2024.08.20.608844v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8afb/11370451/c2792218ad7e/nihpp-2024.08.20.608844v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8afb/11370451/96a5db8066f5/nihpp-2024.08.20.608844v1-f0003.jpg

相似文献

1
Protein stability models fail to capture epistatic interactions of double point mutations.蛋白质稳定性模型无法捕捉双点突变的上位性相互作用。
bioRxiv. 2024 Aug 21:2024.08.20.608844. doi: 10.1101/2024.08.20.608844.
2
Protein stability models fail to capture epistatic interactions of double point mutations.蛋白质稳定性模型无法捕捉双点突变的上位性相互作用。
Protein Sci. 2025 Jan;34(1):e70003. doi: 10.1002/pro.70003.
3
Transfer learning to leverage larger datasets for improved prediction of protein stability changes.利用更大的数据集进行迁移学习,以提高蛋白质稳定性变化预测的准确性。
Proc Natl Acad Sci U S A. 2024 Feb 6;121(6):e2314853121. doi: 10.1073/pnas.2314853121. Epub 2024 Jan 29.
4
Transfer learning to leverage larger datasets for improved prediction of protein stability changes.迁移学习以利用更大的数据集来改进对蛋白质稳定性变化的预测。
bioRxiv. 2023 Jul 30:2023.07.27.550881. doi: 10.1101/2023.07.27.550881.
5
Assessing the performance of computational predictors for estimating protein stability changes upon missense mutations.评估用于估计错义突变后蛋白质稳定性变化的计算预测器的性能。
Brief Bioinform. 2021 Nov 5;22(6). doi: 10.1093/bib/bbab184.
6
Assessing computational tools for predicting protein stability changes upon missense mutations using a new dataset.评估使用新数据集预测错义突变导致的蛋白质稳定性变化的计算工具。
Protein Sci. 2024 Jan;33(1):e4861. doi: 10.1002/pro.4861.
7
Improved Prediction of Stabilizing Mutations in Proteins by Incorporation of Mutational Effects on Ligand Binding.通过纳入突变对配体结合的影响改进蛋白质中稳定突变的预测。
Proteins. 2025 Jan;93(1):384-395. doi: 10.1002/prot.26738. Epub 2024 Aug 21.
8
Coupled molecular dynamics mediate long- and short-range epistasis between mutations that affect stability and aggregation kinetics.偶联分子动力学介导影响稳定性和聚集动力学的突变之间的长程和短程上位性。
Proc Natl Acad Sci U S A. 2018 Nov 20;115(47):E11043-E11052. doi: 10.1073/pnas.1810324115. Epub 2018 Nov 7.
9
Quantification of biases in predictions of protein stability changes upon mutations.量化预测蛋白质突变稳定性变化时的偏差。
Bioinformatics. 2018 Nov 1;34(21):3659-3665. doi: 10.1093/bioinformatics/bty348.
10
Predicting the Effect of Amino Acid Single-Point Mutations on Protein Stability-Large-Scale Validation of MD-Based Relative Free Energy Calculations.预测氨基酸单点突变对蛋白质稳定性的影响——基于分子动力学的相对自由能计算的大规模验证
J Mol Biol. 2017 Apr 7;429(7):948-963. doi: 10.1016/j.jmb.2016.12.007. Epub 2016 Dec 10.

本文引用的文献

1
An end-to-end framework for the prediction of protein structure and fitness from single sequence.从单序列预测蛋白质结构和适应性的端到端框架。
Nat Commun. 2024 Aug 27;15(1):7400. doi: 10.1038/s41467-024-51776-x.
2
Stability Oracle: a structure-based graph-transformer framework for identifying stabilizing mutations.稳定性预测器:一种基于结构的图变换框架,用于识别稳定化突变。
Nat Commun. 2024 Jul 23;15(1):6170. doi: 10.1038/s41467-024-49780-2.
3
Protein design using structure-based residue preferences.基于结构的残基偏好的蛋白质设计。
Nat Commun. 2024 Feb 22;15(1):1639. doi: 10.1038/s41467-024-45621-4.
4
Machine learning for functional protein design.用于功能性蛋白质设计的机器学习
Nat Biotechnol. 2024 Feb;42(2):216-228. doi: 10.1038/s41587-024-02127-0. Epub 2024 Feb 15.
5
Transfer learning to leverage larger datasets for improved prediction of protein stability changes.利用更大的数据集进行迁移学习,以提高蛋白质稳定性变化预测的准确性。
Proc Natl Acad Sci U S A. 2024 Feb 6;121(6):e2314853121. doi: 10.1073/pnas.2314853121. Epub 2024 Jan 29.
6
The energetic and allosteric landscape for KRAS inhibition.KRAS抑制的能量和变构格局。
Nature. 2024 Feb;626(7999):643-652. doi: 10.1038/s41586-023-06954-0. Epub 2023 Dec 18.
7
Mega-scale experimental analysis of protein folding stability in biology and design.大规模实验分析生物学和设计中的蛋白质折叠稳定性。
Nature. 2023 Aug;620(7973):434-444. doi: 10.1038/s41586-023-06328-6. Epub 2023 Jul 19.
8
Epistasis and evolution: recent advances and an outlook for prediction.上位性与进化:最新进展与预测展望。
BMC Biol. 2023 May 24;21(1):120. doi: 10.1186/s12915-023-01585-3.
9
Understanding Mutations in Human SARS-CoV-2 Spike Glycoprotein: A Systematic Review & Meta-Analysis.理解人类 SARS-CoV-2 刺突糖蛋白中的突变:系统评价和荟萃分析。
Viruses. 2023 Mar 27;15(4):856. doi: 10.3390/v15040856.
10
Neural network-derived Potts models for structure-based protein design using backbone atomic coordinates and tertiary motifs.基于神经网络的 Potts 模型,使用骨架原子坐标和三级基序进行基于结构的蛋白质设计。
Protein Sci. 2023 Feb;32(2):e4554. doi: 10.1002/pro.4554.